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FOREVVORD 


The present book is meant as a basic text for a one-year course in algebra, 
at the graduate level. 


A perspective on algebra 

As I see it, the graduate course in algebra must primarily prepare students 
to handle the algebra which they will meet in all of mathematics: topology, 
partial differential equations, differential geometry, algebraic geometry, analysis, 
and representation theory, not to speak of algebra itself and algebraic number 
theory with all its ramifications. Hence I have inserted throughout references to 
papers and books which have appeared during the last decades, to indicate some 
of the directions in which the algebraic foundations provided by this book are 
used; I have accompanied these references with some motivating comments, to 
explain how the topics of the present book fit into the mathematics that is to 
come subsequently in various fields; and I have also mentioned some unsolved 
problems of mathematics in algebra and number theory. The abc conjecture is 
perhaps the most spectacular of these. 

Often when such comments and examples occur out of the logical order, 
especially with examples from other branches of mathematics, of necessity some 
terms may not be defined, or may be defined only later in the book. I have tried 
to help the reader not only by making cross-references within the book, but also 
by referring to other books or papers which I mention explicitly. 

I have also added a number of exercises. On the whole, I have tried to make 
the exercises complement the examples, and to give them aesthetic appeal. I 
have tried to use the exercises also to drive readers toward variations and appli- 
cations of the main text, as well as toward working out special cases, and as 
openings toward applications beyond this book. 


Organization 

Unfortunately, a book must be projected in a totally ordered way on the page 
axis, but that’s not the way mathematics “is”, so readers have to make choices 
how to reset certain topics in parallel for themselves, rather than in succession. 


V 
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I have inserted cross-references to help them do this, but different people will 
make different choices at different times depending on different circumstances. 

The book splits naturally into several parts. The first part introduces the basic 
notions of algebra. After these basic notions, the book splits in two major 
directions: the direction of algebraic equations including the Galois theory in 
Part II; and the direction of linear and multilinear algebra in Parts III and IV. 
There is some sporadic feedback between them, but their unification takes place 
at the next level of mathematics, which is suggested, for instance, in §15 of 
Chapter VI. Indeed, the study of algebraic extensions of the rationals can be 
carried out from two points of view which are complementary and interrelated: 
representing the Galois group of the algebraic closure in groups of matrices (the 
linear approach), and giving an explicit determination of the irrationalities gen- 
erating algebraic extensions (the equations approach). At the moment, repre- 
sentations in GL 2 are at the center of attention from various quarters, and readers 
will see GL 2 appear several times throughout the book. For instance, I have 
found it appropriate to add a section describing all irreducible characters of 
GL 2 (F) when F is a finite field. Ultimately, GL 2 will appear as the simplest but 
typical case of groups of Lie types, occurring both in a differential context and 
over finite fields or more general arithmetic rings for arithmetic applications. 

After almost a decade since the second edition, I find that the basic topics 
of algebra have become stable, with one exception. I have added two sections 
on elimination theory, complementing the existing section on the resultant. 
Algebraic geometry having progressed in many ways, it is now sometimes return- 
ing to older and harder problems, such as searching for the effective construction 
of polynomials vanishing on certain algebraic sets, and the older elimination 
procedures of last century serve as an introduction to those problems. 

Except for this addition, the main topics of the book are unchanged from the 
second edition, but I have tried to improve the book in several ways. 

First, some topics have been reordered. I was informed by readers and review- 
ers of the tension existing between having a textbook usable for relatively inex- 
perienced students, and a reference book where results could easily be found in 
a systematic arrangement. I have tried to reduce this tension by moving all the 
homological algebra to a fourth part, and by integrating the commutative algebra 
with the chapter on algebraic sets and elimination theory, thus giving an intro- 
duction to different points of view leading toward algebraic geometry. 

The book as a text and a reference 

In teaching the course, one might wish to push into the study of algebraic 
equations through Part II, or one may choose to go first into the linear algebra 
of Parts III and IV. One semester could be devoted to each, for instance. The 
chapters have been so written as to allow maximal flexibility in this respect, and 
I have frequently committed the crime of lèse-Bourbaki by repeating short argu- 
ments or defìnitions to make certain sections or chapters logically independent 
of each other. 



FOREVVORD VÌÌ 


Granting the material which under no circumstances can be omitted from a 
basic course, there exist several options for leading the course in various direc- 
tions. It is impossible to treat all of them with the same degree of thoroughness. 
The precise point at which one is willing to stop in any given direction will 
depend on time, place, and mood. However, any book with the aims of the 
present one must include a choice of topics, pushing ahead in deeper waters, 
while stopping short of full involvement. 

There can be no universal agreement on these matters, not even between the 
author and himself. Thus the concrete decisions as to what to include and what 
not to include are finally taken on grounds of general coherence and aesthetic 
balance. Anyone teaching the course will want to impress their own personality 
on the material, and may push certain topics with more vigor than I have, at the 
expense of others. Nothing in the present book is meant to inhibit this. 

Unfortunately, the goal to present a fairly comprehensive perspective on 
algebra required a substantial increase in size from the first to the second edition, 
and a moderate increase in this third edition. These increases require some 
decisions as to what to omit in a given course. 

Many shortcuts can be taken in the presentation of the topics, which 
admits many variations. For instance, one can proceed into field theory and 
Galois theory immediately after giving the basic definitions for groups, rings, 
fields, polynomials in one variable, and vector spaces. Since the Galois theory 
gives very quickly an impression of depth, this is very satisfactory in many 
respects. 

It is appropriate here to recall my original indebtedness to Artin, who first 
taught me algebra. The treatment of the basics of Galois theory is much 
influenced by the presentation in his own monograph. 

Audience and background 

As I already stated in the forewords of previous editions, the present book 
is meant for the graduate level, and I expect most of those coming to it to have 
had suitable exposure to some algebra in an undergraduate course, or to have 
appropriate mathematical maturity. I expect students taking a graduate course 
to have had some exposure to vector spaces, linear maps, matrices, and they 
will no doubt have seen polynomials at the very least in calculus courses. 

My books Undergraduate Algebra and Linear Algebra provide more than 
enough background for a graduate course. Such elementary texts bring out in 
parallel the two basic aspects of algebra, and are organized differently from the 
present book, where both aspects are deepened. Of course, some aspects of the 
linear algebra in Part III of the present book are more “elementary” than some 
aspects of Part II, which deals with Galois theory and the theory of polynomial 
equations in several variables. Because Part II has gone deeper into the study 
of algebraic equations, of necessity the parallel linear algebra occurs only later 
in the total ordering of the book. Readers should view both parts as running 
simultaneously. 
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Unfortunately, the amount of algebra which one should ideally absorb during 
this fìrst year in order to have a proper background (irrespective of the subject 
in which one eventually specializes) exceeds the amount which can be covered 
physically by a lecturer during a one-year course. Hence more material must be 
included than can actually be handled in class. I find it essential to bring this 
material to the attention of graduate students. 

I hope that the various additions and changes make the book easier to use as 
a text. By these additions, I have tried to expand the general mathematical 
perspective of the reader, insofar as algebra relates to other parts of mathematics. 
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Logical Prerequisites 


We assume that the reader is familiar with sets, and with the symbols fì, U, 
D, C, e. If A, B are sets, we use the symbol A C B to mean that A is contained 
in B but may be equal to B. Similarly for A D B. 

If f:A-*Bìsa. mapping of one set into another, we write 

x i * f(x) 

to denote the effect of / on an element x of A. We distinguish between the 
arrows -► and i—►. We denote by f(A) the set of all elements/(x), with x e A. 

Let /: A -► B be a mapping (also called a map). We say that / is injective 
if x # y implies f(x) # / (y). We say / is surjective if given b e B there exists 
a e A such that /(a) = b. We say that / is bijective if it is both surjective and 
injective. 

A subset A of a set B is said to be proper if A # B. 

Let /: A -► B be a map, and A' a subset of A. The restriction of / to A' is 
a map of A' into B denoted by /1 A'. 

If /: A -► B and g : B -* C are maps, then we have a composite map ^o/ 
such that (g o f)(x) = g(f(x)) for all x e A. 

Let/: A -+ B be a map, and fl' a subset of B. By f~\B') we mean the subset 
of A consisting of all x e A such that /(x) e B'. We call it the inverse image of 
B'. We call /(A) the image of /. 

A diagram 



C 


is said to be commutative if g © / = h. Similarly, a diagram 

/ 1 — f —+B 

<P 9 

C -► D 

* 


ix 
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is said to be commutative if g o f = \j/ © <p. We deal sometimes with more 
complicated diagrams, consisting of arrows between various objects. Such 
diagrams are called commutative if, whenever it is possible to go from one 
object to another by means of two sequences of arrows, say 

A à ì A à > . . . èzl , A 


and 


then 


a x ^b 2 ^~ 


9m— 1 


B m = A n> 


fn-l ° •“ °/l = 9m -1 ° ° 9l> 


in other words, the composite maps are equal. Most of our diagrams are 
composed of triangles or squares as above, and to verify that a diagram con- 
sisting of triangles or squares is commutative, it suffices to verify that each 
triangle and square in it is commutative. 

We assume that the reader is acquainted with the integers and rational 
numbers, denoted respectively by Z and Q. For many of our examples, we also 
assume that the reader knows the real and complex numbers, denoted.by R 
and C. 

Let A and / be two sets. By a family of elements of A , indexed by /, one 
means a map /: / -► A. Thus for each i e I we are given an element /(i) e A. 
Although a family does not differ from a map, we think of it as determining a 
collection of objects from A , and write it often as 

{/( 0 } í6 / 


or 


K }iel > 

writing di instead of f(i). We call / the indexing set. 

We assume that the reader knows what an equivalence relation is. Let A 
be a set with an equivalence relation, let E be an equivalence class of elements 
of A. We sometimes try to define a map of the equivalence classes into some 
set B. To define such a map / on the class £, we sometimes first give its value 
on an element xeE (called a representative of E), and then show that it is 
independent of the choice of representative x e E. In that case we say that / 
is well defined. 

We have products of sets, say finite products A x B, or A x x • • • x A n , and 
products of families of sets. 

We shall use Zorn’s lemma, which we describe in Appendix 2. 

We let #(S) denote the number of elements of a set S , also called the 
cardinality of S. The notation is usually employed when S is finite. We also 
write #(S) = card(5). 
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Part One 

THE BASIC 
OBJECTS OF 
ALGEBRA 


This part introduces the basic notions of algebra, and the main difficulty 
for the beginner is to absorb a reasonable vocabulary in a short time. None 
of the concepts is difficult, but there is an accumulation of new concepts which 
may sometimes seem heavy. 

To understand the next parts of the book, the reader needs to know 
essentially only the basic definitions of this first part. Of course, a theorem 
may be used later for some specific and isolated applications, but on the 
whole, we have avoided making long logical chains of interdependence. 





CHAPTERI 

Groups 


§1. MONOIDS 

Let S be a set. A mapping 


S x S -* S 

is sometimes called a law of composition (of S into itself). If x, y are elements of 
5, the image of the pair (x, y) under this mapping is also called their product 
under the law of composition, and will be denoted by xy. (Sometimes, we also 
write x • y, and in many cases it is also convenient to use an additive notation, 
and thus to write x + y. In that case, we call this element the sum of x and y. 
It is customary to use the notation x + y only when the relation x + y = 
y + x holds.) 

Let S be a set with a law of composition. If x, y, z are elements of S, then we 
may form their product in two ways: ( xy)z and x{yz). If ( xy)z = x(yz) for all 
x, _v, z in S then we say that the law of composition is associative. 

An element e of S such that ex = x = xe for all xe S is called a unit 
element. (When the law of composition is written additively, the unit element 
is denoted by 0, and is called a zero element.) A unit element is unique, for if 
e' is another unit element, we have 

e = ee’ = e' 

by assumption. In most cases, the unit element is written simply 1 (instead of e). 
For most of this chapter, however, we shall write e so as to avoid confusion in 
proving the most basic properties. 

A monoid is a set G, with a law of composition which is associative, and 
having a unit element (so that in particular, G is not empty). 
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Let G be a monoid, and x u ..., x„ elements of G (where n is an integer > 1). 
We define their product inductively: 

n 

= ' X n (^i * * * X n - l)^n* 

v= 1 

We then have the following rule: 

m n m + n 

n*,- n- m+ v = n^v, 

H=l v = 1 v = 1 

which essentially asserts that we can insert parentheses in any manner in our 
product without changing its value. The proof is easy by induction, and we shall 
leave it as an exercise. 

One also writes 

m + n n 

II instead of Yl x m + , 

m + 1 v — 1 

and we define 


o 

n x v = e. 

V = 1 

As a matter of convention, we agree also that the empty product is equal 
to the unit element. 

It would be possible to define more general laws of composition, i.e. maps 
Sj x S 2 S 3 using arbitrary sets. One can then express associativity and 
commutativity in any setting for which they make sense. For instance, for 
commutativity we need a law of composition 

f:S x S^T 

where the two sets of departure are the same. Commutativity then means 
/ (*» y)=f (y» x X = y * if we omit the mapping / from the notation. For 

associativity, we leave it to the reader to formulate the most general combination 
of sets under which it will work. We shall meet special cases later, for instance 
arising from maps 


5 x S -► S and 5 x T -► T. 

Then a product {xy)z makes sense with xeS, yeS , and zeT. The product 
x(yz) also makes sense for such elements x, y, z and thus it makes sense to say 
that our law of composition is associative, namely to say that for all x, y, z as 
above we have (xy)z = x(yz). 

If the law of composition of G is commutative, we also say that G is com- 
mutative (or abelian). 
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Let G be a commutative monoid, and x u ..., x„ elements of G. Let \j/ be a 
bijection of the set of integers (1,..., n) onto itself. Then 

n n 

fk<v> = n*v 

V — 1 v=1 

We prove this by induction, it being obvious for n = 1. We assume it for 
n — 1. Let k be an integer such that il/(k) = n. Then 


n 

k- 1 


n-k 

n x *<v> 

= 

• x 1,(k) 

‘ 0 X ^(* + v) 

i 

i 


1 


k~ 1 

n-k 



= n x *(vr 

n x > 

p(k + v) ‘ X ìlf(k) 


1 i 

Define a map <pof(l,...,n— 1) into itself by the rule 
cpiy) = iA(v) if v < /c, 

<p(v) = i/<(v +1) if v = k. 

Then 

n k- 1 n-k 

n x ^(v> = n x <p(v) o x <p(fc -1+v) ■ x n 

i i i 

n- 1 

= rK(v)- x *> 

i 

which, by induction, is equal to x t • • x n , as desired. 

Let G be a commutative monoid, let / be a set, and let /: / -► G be a 
mapping such that /(/) = e for almost all i e I. (Here and thereafter, almost 
all will mean all but a finite number.) Let / 0 be the subset of 1 consisting of 
those ì such that f(i) # e. By 

n/(o 

iel 

we shall mean the product 

n/(o 

iel o 

taken in any order (the value does not depend on the order, according to the 
preceding remark). It is understood that the empty prpduct is equal to e. 

When G is written additively, then instead of a product sign, we write the 
sum sign X. 

There are a number of formal rules for dealing with products which it would 
be tedious to list completely. We give one example. Let /, J be two sets, and 
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/:/ x J -* G a mapping into a commutative monoid which takes the value e 
for almost all pairs (i, j). Then 

nín /(*»/) 1 = n ín /(*.»]. 

ie/ L/'eJ J jeJ L*e/ J 

We leave the proof as an exercise. 

As a matter of notation, we sometimes write f] /(i), omitting the signs 
i g /, if the reference to the indexing set is clear. 

Let x be an element of a monoid G. For every integer n ^ 0 we define x" 
to be 

n 

n* 

i 

so that in particular we have x° = e, x 1 = x, x 2 = xx,.... We obviously have 
x (n + m) _ x n x m an( j _ ^nm p ur thermore, from our preceding rules of 
associativity and commutativity, if x, y are elements of G such that xy = yx, 
then (xy) n = x n y n . We leave the formal proof as an exercise. 

If S, S' are two subsets of a monoid G, then we define SS' to be the subset 
consisting of all elements xy, with xe S and y e S'. Inductively, we can define 
the product of a finite number of subsets, and we have associativity. For in- 
stance, if S, S', S" are subsets of G, then (SS')S" = S(S'S"). Observe that GG = G 
(because G has a unit element). If x e G, then we define xS to be {x}S, where 
{x} is the set consisting of the single element x. Thus xS consists of all elements 
xy, with yeS. 

By a submonoid of G, we shall mean a subset H of G containing the unit 
element e , and such that, if x, y e // then xy e H (we say that // is closed under 
the law of composition). It is then clear that H is itself a monoid, under the law 
of composition induced by that of G. 

If x is an element of a monoid G, then the subset of powers x n (n = 0, 1,...) 
is a submonoid of G. 

The set of integers ^ 0 under addition is a monoid. 

Later we shall define rings. If R is a commutative ring, we shall deal with 
multiplicative subsets 5, that is subsets containing the unit element, and such 
that if x, y e S then xy e S. Such subsets are monoids. 

A routine example. Let N be the natural numbers, i.e. the integers ^ 0. 
Then N is an additive monoid. In some applications, it is useful to deal with a 
multiplicative version. See the definition of polynomials in Chapter II, §3, where 
a higher-dimensional version is also used for polynomials in several variables. 

An interesting example. We assume that the reader is familiar with the 
terminology of elementary topology. Let M be the set of homeomorphism 
classes of compact (connected) surfaces. We shall define an addition in M. 
Let S, S' be compact surfaces. Let D be a small disc in S, and D ' a small disc in 
S'. Let C, C' be the circles which form the boundaries of D and D' respectively. 
Let D 09 D'o be the interiors of D and D' respectively, and glue S—D 0 to S'—D' 0 by 
identifying C with C'. It can be shown that the resulting surface is independent, 
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up to homeomorphism, of the various choices made in the preceding construc- 
tion. If cr, cr' denote the homeomorphism classes of S and S' respectively, we 
define cr -f cr' to be the class of the surface obtained by the preceding gluing 
process. It can be shown that this addition defines a monoid structure on M, 
whose unit element is the class of the ordinary 2-sphere. Furthermore, if r 
denotes the class of the torus, and n denotes the class of the projective plane, 
then every element cr of M has a unique expression of the form 

cr = nx -f mn 

where n is an integer ^ 0 and m = 0, 1, or 2. We have 3n = x -F n. 

(The reasons for inserting the preceding example are twofold: First to 
relieve the essential dullness of the section. Second to show the reader that 
monoids exist in nature. Needless to say, the example will not be used in any 
way throughout the rest of the book.) 

Still other examples. At the end of Chapter III, §4, we shall remark that 
isomorphism classes of modules over a ring form a monoid under the direct sum. 
In Chapter XV, § 1, we shall consider a monoid consisting of equivalence classes 
of quadratic forms. 


§2. GROUPS 

A group G is a monoid, such that for every element xeG there exists an 
element yeG such that xy = yx = e. Such an element y is called an inverse for 
x. Such an inverse is unique, because if y' is also an inverse for x, then 

y’ = ýe = ý(xy) = (ýx)y = ey = y. 

We denote this inverse by x~ l (or by — x when the law of composition is 
written additively). 

For any positive integer n , we let x~ n = (x -1 )”. Then the usual rules for 
exponentiation hold for all integers, not only for integers ^ 0 (as we pointed out 
for monoids in §1). The trivial proofs are left to the reader. 

In the definitions of unit elements and inverses, we could also define left 
units and left inverses (in the obvious way). One can easily prove that these 
are also units and inverses respectively under suitable conditions. Namely: 

Let G be a set with an associative law of composition , let e be a left unit for 
that law , and assume that every element has a left inverse. Then e is a unit , 
and each left inverse is also an inverse. In particular , G is a group. 

To prove this, let a e G and let b e G be such that ba = e. Then 

bab = eb = b. 

Multiplying on the left by a left inverse for b yields 

ab = e, 

or in other words, b is also a right inverse for a. One sees also that a is a left 
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inverse for b. Furthermore, 

ae = aba = ea = a, 

whence e is a right unit. 

Example. Let G be a group and S a nonempty set. The set of maps M(S, G) 
is itself a group; namely for two maps /, g of S into G we define fg to be the 
map such that 

(fg)(x) = f(x)g(x), 

and we define/ -1 to be the map such that/ _1 (x) = f(x)~ l . It is then trivial 
to verify that M(S , G) is a group. If G is commutative, so is M(S, G ), and when 
the law of composition in G is written additively, so is the law of composition 
in M(S, G), so that we would write f + g instead of fg, and —/ instead of /“ L 

Example. Let S be a non-empty set. Let G be the set of bijective mappings 
of S onto itself. Then G is a group, the law of composition being ordinary com- 
position of mappings. The unit element of G is the identity map of S, and the 
other group properties are trivially verified. The elements of G are called 
permutations of S. We also denote G by Perm(S). For more information on 
Perm(5) when S is finite, see §5 below. 

Example. Let us assume here the basic notions of linear algebra. Let k be 
a fìeld and V a vector space over k. Let GL(V) denote the set of invertible k- 
linear maps of V onto itself. Then GL(V) is a group under composition of 
mappings. Similarly, let k be a field and let GL(n, k) be the set of invertible 
n X n matrices with components in k. Then GL(n, k) is a group under the 
multiplication of matrices. For n ^ 2, this group is not commutative. 

Example. The group of automorphisms. We recommend that the reader 
now refer immediately to §11, where the notion of a category is defined, and 
where several examples are given. For any object A in a category, its auto- 
morphisms form a group denoted by Aut(A). Permutations of a set and the linear 
automorphisms of a vector space are merely examples of this more general 
structure. 

Example. The set of rational numbers forms a group under addition. The 
set of non-zero rational numbers forms a group under multiplication. Similar 
statements hold for the real and complex numbers. 

Example. Cyclic groups. The integers Z form an additive group. A group 
is defined to be cyclic if there exists an element a e G such that every element 
of G (written multiplicatively) is of the form a n for some integer n. If G is written 
additively, then every element of a cyclic group is of the form na. One calls a 
a cyclic generator. Thus Z is an additive cyclic group with generator 1, and 
also with generator -1. There are no other generators. Given a positive integer 
n, the n-th roots of unity in the complex numbers form a cyclic group of order 
n. In terms of the usual notation, e 2m/n is a generator for this group. So is e 2nir/n 
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with r e Z and r prime to n. A generator for this group is called a primitive 
n-th root of unity. 

Example. The direct product. Let Gj, G 2 be groups. Let G x x G 2 be 
the direct product as sets, so G x x G 2 is the set of all pairs (jc^, jc 2 ) with 
jc z e G,-. We define the product componentwise by 

(*1> > 2 ) = Ul3'l, 

Then Gj x G 2 is a group, whose unit element is (e x , e 2 ) (where e t is the unit 
element of G z ). Similarly, for n groups we define G x x • • • x G n to be the set 
of n-tuples with jc z - e G t (i = 1, ... , n), and componentwise multiplication. 
Even more generally, let / be a set, and for each i e /, let G f be a group. Let 
G = n G/ be the set-theoretic product of the sets G,-. Then G is the set of all 
families (jc z ) le/ with jc z - e G z . We can define a group structure on G by compo- 
nentwise multiplication, namely, if (jc z ) ze/ and (y z ) ze/ are two elements of G, we 
define their product to be (jc z y z ) ze/ . We define the inverse of (jc z ) ze/ to be (jc z rl ) ze/ . 
It is then obvious that G is a group called the direct product of the family. 

Let G be a group. A subgroup H of G is a subset of G containing the unit 
element, and such that H is closed under the law of composition and inverse 
(i.e. it is a submonoid, such that if xe H then x" 1 e H). A subgroup is called 
trivial if it consists of the unit element alone. The intersection of an arbitrary 
non-empty family of subgroups is a subgroup (trivial verification). 

Let G be a group and S a subset of G. We shall say that S generates G, 
or that S is a set of generators for G, if every element of G can be expressed as a 
product of elements of S or inverses of elements of S, i.e. as a product x x • • • x n 
where each x f or x t ~ 1 is in S. It is clear that the set of all such products is a 
subgroup of G (the empty product is the unit element), and is the smallest sub- 
group of G containing S. Thus 5 generates G if and only if the smallest subgroup 
of G containing S is G itself. If G is generated by S, then we write G = (S). By 
definition, a cyclic group is a group which has one generator. Given elements 
X\ 9 ... , x n e G, these elements generate a subgroup (x x , . . . , x n ), namely the 
set of all elements of G of the form 

x k il ’ ’ * x k r r with k r e Z. 

A single element jc e G generates a cyclic subgroup. 

Example. There are two non-abelian groups of order 8. One is the group 
of symmetries of the square, generated by two elements o-, r such that 

(T 4 = t 2 = e and tot - 1 = o 3 . 

The other is the quaternion group, generated by two elements, i, j such that 
if we put k = ij and m = i 2 , then 

i 4 = j 4 = k 4 = e 9 i 2 = j 2 = k 2 = m, ij = mji. 

After you know enough facts about groups, you can easily do Exercise 35. 
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Let G, G' be monoids. A monoid-homomorphism (or simply homomorphism) 
of G into G' is a mapping/: G -> G' such that/(xy) = f(x) f(y) for all x, y e G, 
and mapping the unit element of G into that of G'. If G, G' are groups, a group- 
homomorphism of G into G' is simply a monoid-homomorphism. 

We sometimes say: “Let/:G -► G' be a group-homomorphism” to mean: 
“Let G, G' be groups, and let/be a homomorphism from G into GV’ 

Let/: G -► G' be a group-homomorphism. Then 

/(* _1 ) =/W _1 

because if e, e' are the unit elements of G, G' respectively, then 
e' =/(e) =/(xx _1 ) =/(x)/(x _1 ). 

Furthermore, if G, G' are groups and/: G -► G' is a map such that 

f(xy) =f(x)f(y) 

for all x, y in G, then / (e) = e' because /(ee) = /(e) and also = f(e)f(e). 
Multiplying by the inverse of f(e) shows that f(e) = e'. 

Let G, G' bemonoids. A homomorphism/: G -► G' iscalled an isomorphism 
if there exists a homomorphism g:G' -> G such that / ° g and g © / are the 
identity mappings (in G' and G respectively). It is trivially verified that / is 
an isomorphism if and only if / is bijective. The existence of an isomorphism 
between two groups G and G' is sometimes denoted by G « G'. If G = G', 
we say that isomorphism is an automorphism. A homomorphism of G into 
itself is also called an endomorphism. 

Example. Let G be a monoid and x an element of G. Let N denote the 
(additive) monoid of integers ^ 0. Then the map/: N -► G such that/ (n) = x n 
is a homomorphism. If G is a group, we can extend/to a homomorphism of Z 
into G (x w is defined for all n e Z, as pointed out previously). The trivial proofs 
are left to the reader. 

Let n be a fixed integer and let G be a commutative group. Then one verifies 
easily that the map 

x i—► x n 

from G into itself is a homomorphism. So is the map xi—►x” 1 . The map 
x i—► x n is called the n-th power map. 

Example. Let / = {/} be an indexing set, and let {Gj be a family of groups. 
Let G = \\Gi be their direct product. Let 

Pî : G —► Gi 

be the projection on the i-th factor. Then p t is a homomorphism. 

Let G be a group , S a set of generators for G, and G' another group. Let 
f:S G' be a map. If there exists a homomorphism f of G into G' whose 
restriction to S is f then there is only one. 
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In other words, / has at most one extension to a homomorphism of G 
into G'. This is obvious, but will be used many times in the sequel. 

Let f :G-+G' and g :G' -* G" be two group-homomorphisms. Then the 
composite map g o/is a group-homomorphism. If/ g are isomorphisms then 
so is g o/ Furthermore /" 1 : G' -► G is also an isomorphism. In particular, 
the set of all automorphisms of G is itself a group, denoted by Aut(G). 

Let/:G -* G' be a group-homomorphism. Let e, c' be the respective unit 
elements of G, G'. We define the kernel of / to be the subset of G consisting 
of all x such that f(x) = e'. From the definitions, it follows at once that the 
kernel H of / is a subgroup of G. (Let us prove for instance that H is closed 
under the inverse mapping. Let xe H. Then 

f(x- l )f(x)=m = e'. 

Since f(x) = e\ we have f(x~ l ) = e' , whence x” 1 eH. We leave the other 
verifications to the reader.) 

Let /: G -► G' be a group-homomorphism again. Let H' be the image of/. 
Then H' is a subgroup of G\ because it contains e\ and if f(x\f(y) e H' , then 
f(xy) = f(x)f(y) lies also in H'. Furthermore,/(x” l ) = f(x)~ l is in H' , and 
hence H' is a subgroup of G'. 

The kernel and image of/are sometimes denoted by Ker/and Im/ 

A homomorphism /: G -► G' which establishes an isomorphism between 
G and its image in G' will also be called an embedding. 

A homomorphism whose kernel is trivial is injective. 

To prove this, suppose that the kernel of/is trivial, and let f(x) = f(y) for 
some x, yeG. Multiplying by f(y~ ^) we obtain 

fixy' 1 ) =/W/0 ;_1 ) = e’- 

Hence xy~ 1 lies in the kernel, hence xy~ 1 = e, and x = y. If in particular/is 
also surjective, then / is an isomorphism. Thus a surjective homomorphism 
whose kernel is trivial must be an isomorphism. We note that an injective 
homomorphism is an embedding. 

An injective homomorphism is often denoted by a special arrow, such as 

/: G <=^G'. 

There is a useful criterion for a group to be a direct product of subgroups: 

Proposition 2.1. Let G be a group and let H , K be two subgroups such that 

H n K = e, HK = G, and such that xy = yxfor all xeH and yeK. Then 

the map 

H x K-G 

such that (x, y) > xy is an isomorphism. 

Proof. It is obviously a homomorphism, which is surjective since HK = G. 
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If (x, y ) is in its kernel, then x = y \ whence x lies in both H and K , and x = e, 
so that y = e also, and our map is an isomorphism. 

We observe that Proposition 2.1 generalizes by induction to a finite number 
of subgroups H u ... 9 H n whose elements commute with each other, such that 

H\ ' • H n = G, 

and such that 

H i + 1 n(H 1 -..H l ) = *. 

In that case, G is isomorphic to the direct product 

Hj x .. x H n . 

Let G be a group and H a subgroup. A left coset of H in G is a subset of 
G of type aH , for some element úf of G. An element of aH is called a coset 
representative of aH. The map x f-> ax induces a bijection of H onto aH. 
Hence any two left cosets have the same cardinality. 

Observe that if a, b are elements of G and aH , bH are cosets having one 
element in common, then they are equal. Indeed, let ax = by with x, y e H. 
Then a = byx~ l . But yx -1 eH. Hence aH = b(yx~ l )H = bH , because for 
any z e H we have zH = H. 

We conclude that G is the disjoint union of the left cosets of H. A similar 
remark applies to right cosets (i.e. subsets of G of type Ha). The number of left 
cosets of H in G is denoted by (G : H), and is called the (left) index of H in G. 
The index of the trivial subgroup is called the order of G and is written (G: 1). 
From the above conclusion, we get: 

Proposition 2.2. Let G be a group and H a subgroup. Then 

(G: H)(H: 1) = (G: 1), 

in the sense that if two ofthese indices are finite , so is the third and equality 
holds as stated. If (G : 1) is finite , the order of H divides the order of G. 

More generally , let H, K be subgroups of G and let H => K. Let {xj be a 
set of (left) coset representatives of K in H and let {yj} be a set of coset repre- 
sentatives of H in G. Then we contend that {yyX,} is a set of coset representa- 
tives of K in G. 

Proof. Note that 

H = (J x t K (disjoint), 

í 

G = U yj H (disjoint). 

j 

Hence 

G = (J y jXi K. 

ÌJ 

We must show that this union is disjoint, i.e. that the y^x, represent distinct 
cosets. Suppose 
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yjXiK = y r x v K 

for a pair of indices (/ i) and (/, i'). Multiplying by H on the right, and noting 
that x, , x v are in H , we get 

yj H = y r H , 

whence y } = y y . From this it follows that x t K = x r K and therefore that 
x, = x r , as was to be shown. 

The formula of Proposition 2.2 may therefore be generalized by writing 
(G : K) = (G : H)(H : K), 

with the understanding that if two of the three indices appearing in this formula 
are finite, then so is the third and the formula holds. 

The above results are concerned systematically with left cosets. For the right 
cosets, see Exercise 10. 

Example. A group of prime order is cyclic. Indeed, let G have order p and 
let a E G, a # e. Let H be the subgroup generated by a. Then #(//) divides p 
and is # 1, so #(//) = p and so H = G, which is therefore cyclic. 

Example. Let J n = {1, ... , n}. Let S n be the group of permutations of 
J n . We define a transposition to be a permutation r such that there exist 
two elements r # s in J n for which r(r) = 5, 7 ( 5 ) = r, and r(k) = k for all 
k # r, s. Note that the transpositions generate S n . Indeed, say cr is a permutation, 
cr(n) = k # n. Let r be the transposition interchanging k , n. Then rcr leaves n 
fìxed, and by induction, we can write rcr as a product of transpositions in 
Perm^y^-!), thus proving that transpositions generate S n . 

Next we note that #(S n ) = n\. Indeed, let H be the subgroup of S n consisting 
of those elements which leave n fixed. Then H may be identified with S n -i. If 
(Ti (i = 1, .. . , n) is an element of S n such that a^n) = i , then it is immediately 
verified that cr x , ... , a n are coset representatives of H. Hence by induction 

(S n : 1) = n(H : 1) = n\. 

Observe that for cr, we could have taken the transposition r h which interchanges 
i and n (except for i = n, where we could take cr n to be the identity). 


§3. NORMAL SUBGROUPS 

We have already observed that the kernel of a group-homomorphism is a 
subgroup. We now wish to characterize such subgroups. 

Let/: G -► G' be a group-homomorphism, and let H be its kernel. If x is an 
element of G, then xH = Hx, because both are equal to f~ l (f(x)). We can 
also rewrite this relation as xHx ~ 1 = H. 
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Conversely, let G be a group, and let H be a subgroup. Assume that for all 
elements x of G we have xH c Hx (or equivalently, xHx~ l cz H). If we 
write x _1 instead of x, we get i/ cz xHx ~ l , whence xHx~ l = /f. Thus our 
condition is equivalent to the condition x//x“ 1 = H for all x e G. A subgroup 
H satisfying this condition will be called normal. We shall now see that a normal 
subgroup is the kernel of a homomorphism. 

Let G' be the set of cosets of H. (By assumption, a left coset is equal to a right 
coset, so we need not distinguish between them.) If xH and yH are cosets, then 
their product ( xH)(yH ) is also a coset, because 

xHyH = xyHH = xyH. 

By means of this product, we have therefore defined a law of composition on G' 
which is associative. It is clear that the coset H itself is a unit element for this 
law of composition, and that x~ l H is an inverse for the coset xH. Hence G' is a 
group. 

Let f .G-+G' be the mapping such that /(x) is the coset xH. Then / is 
clearly a homomorphism, and (the subgroup) H is contained in its kernel. If 
/(x) = H , then xH = H. Since H contains the unit element, it follows that 
xeH. Thus H is equal to the kernel, and we have obtained our desired homo- 
morphism. 

The group of cosets of a normal subgroup H is denoted by G/H (which we 
read G modulo //, or G mod H). The map/ of G onto G/H constructed above 
is called the canonical map, and G/H is called the factor group of G by H. 

Remarks 

1. Let {Hi} ieI be a family of normal subgroups of G. Then the subgroup 

h = n h, 

i G / 

is a normal subgroup. Indeed, if y e //, and xeG, then xyx ~ 1 lies in each H h 
whence in H. 

2. Let S be a subset of G and let N = N s be the set of all elements xeG 
such that x5x _1 = S. Then N is obviously a subgroup of G, called the 
normalizer of 5 1 . If S consists of one element a , then N is also called the 
centralizer of a. More generally, let Z s be the set of all elements x e G such that 
xyx~ l = yîova\\ye S. Then Z s is called the centralizer of S. The centralizer 
of G itself is called the center of G. It is the subgroup of G consisting of all 
elements of G commuting with all other elements, and is obviously a normal 
subgroup of G. 

Examples. We shall give more examples of normal subgroups later when 
we have more theorems to prove the normality. Here we give only two examples. 

First, from linear algebra, note that the determinant is a homomorphism from 
the multiplicative group of square matrices into the multiplicative group of a 
field. The kernel is called the special linear group, and is normal. 
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Second, let G be the set of all maps T ab : R R such that 
T a b {x) = ax + b, with a + 0 and b arbitrary. Then G is a group under composition 
of mappings. Let A be the multiplicative group of maps of the form T a 0 (iso- 
morphic to R*, the non-zero elements of R), and let N be the group of translations 
T x b with b e R. Then the reader will verify at once that T a b a is a homo- 
morphism of G onto the multiplicative group, whose kernel is the group of 
translations, which is therefore normal. Furthermore, we have G = AN = NA, 
and N fl A = {id}. In the terminology of Exercise 12, G is the semidirect 
product of A and N. 

Let H be a subgroup of G. Then H is obviously a normal subgroup of its 
normalizer N H . We leave the following statements as exercises: 

If K is any subgroup of G containing H and such that H is normal in K, then 
K cz N h . 

If K is a subgroup of N h , then KH is a group and H is normal in KH. 

The normalizer of H is the largest subgroup of G in which H is normal. 

Let G be a group and H a normal subgroup. Let x, yeG. We shall write 

x = y (mod H) 

if x and y lie in the same coset of H, or equivalently if xy ~ 1 (or y~ l x) lie in H. 
We read this relation “x and y are congruent modulo //.” 

When G is an additive group, then 

x = 0 (mod H) 

means that x lies in H, and 

x = y (mod H) 

means that x — y (or y — x) lies in H. This notation of congruence is used 
mostly for additive groups. 



be a sequence of homomorphisms. We shall say that this sequence is exact if 
Im/ = Ker g. For example, if H is a normal subgroup of G then the sequence 

h-úg^g/h 

is exact (where j = inclusion and cp = canonical map). A sequence of homo- 
morphisms having more than one term, like 

G J' 1 .— r J 2 /“■ f n - 1 /'■ 

l -> v 2 -* <j 3 ->-* o„, 

is called exact if it is exact at each joint, i.e. if 

Imy; = Ker/ + j 

for each i = 1,..., n — 2. For example to say that 

0 -► G' 4 G 4 G" -> 0 
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is exact means that/is injective, that Im/ = Ker g , and that g is surjective. If 
H = Ker g then this sequence is essentially the same as the exact sequence 

0-+H-+G^G/H-+0. 

More precisely, there exists a commutative diagram 

0-►G'—-—♦ G ——► G" -♦ 0 


0-*• H -♦ G -► G/H -♦ 0 

in which the vertical maps are isomorphisms, and the rows are exact. 

Next we describe some homomorphisms, all of which are called canonical. 

(i) Let G, G' be groups and f\G-*G' a homomorphism whose kernel 
is H. Let cp: G -* G/H be the canonical map. Then there exists a unique 
homomorphism f* : G/H -> G' such that f = f*° (p, and /* is injective. 

To define /*, let xH be a coset of H. Since f(xy) = f(x) for all yeH, we 
define fJ(xH) to be f(x). This value is independent of the choice of coset 
representative x, and it is then trivially verified that f* is a homomorphism, is 
injective, and is the unique homomorphism satisfying our requirements. We 
shall say that /* is induced by / 

Our homomorphism f* induces an isomorphism 

Â: G/H -► Im/ 

of G/H onto the image of/ and thus/can be factored into the following succes- 
sion of homomorphisms: 

G 4 G/// 4 Im/4 G'. 

Here, j is the inclusion of Im / in G'. 

(ii) Let G be a group and H a subgroup. Let N be the intersection of all 
normal subgroups containing H. Then N is normal, and hence is the smallest 
normal subgroup of G containing H. Let/: G -> G' be a homomorphism whose 
kernel contains H. Then the kernel of/ contains N , and there exists a unique 
homomorphism /* : G/N G', said to be induced by / making the following 
diagram commutative: 



G/N 


As before, cp is the canonical map. 

We can define/* as in (1) by the rule 

UxN) =/(x). 

This is well defined, and is trivially verified to satisfy all our requirements. 
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(iii) Let G be group and H z> K two normal subgroups of G. Then K is normal 
in //, and we can define a map of G/K onto G/H by associating with each coset 
xK the coset xH. It is immediately verified that this map is a homomorphism, 
and that its kernel consists of all cosets xK such that x e H. Thus we have a 
canonical isomorphism 


0 G/K)/(H/K ) % G/H. 


One could also describe this isomorphism using (i) and (ii). We leave it to the 
reader to show that we have a commutative diagram 


0 -> H -> G -> G/H 


can 


can 


id 


0-> H/K -► G/K -> G/H 


0 

0 


where the rows are exact. 

(iv) Let G be a group and let //, K be two subgroups. Assume that H 
is contained in the normalizer of K. Then H n K is obviously a normal 
subgroup of H , and equally obviously HK = KH is a subgroup of G. There 
is a surjective homomorphism 


H - HK/K 


associating with each xe H the coset xK of K in the group HK. The reader 
will verify at once that the kernel of this homomorphism is exactly H n K. 
Thus we have a canonical isomorphism 


H/(H nK ) % HK/K. 


(v) Let f:G-+G' be a group homomorphism, let H' be a normal sub- 
group of G', and let H = f~\H'). 

G -► G' 


f~\H ')- >H' 

Then/“ \H') is normal in G. [ Proof : Ifx e G, then/ (xHx~ l ) =f(x)f (H)f(x )~ 1 
is contained in H ', so xHx~ 1 c= //.] We then obtain a homomorphism 

G -> G' - G'/H' 

composing / with the canonical map of G' onto G'/H ', and the kernel of this 
composite is H. Hence we get an injective homomorphism 

/:G///- G'/H' 
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again called canonical, giving rise to the commutative diagram 



If/is surjective, then/is an isomorphism. 

We shall now describe some applications of our homomorphism statements. 
Let G be a group. A sequence of subgroups 

G = G 0 => Gì ^ G 2 = • =3 G m 

is called a tower of subgroups. The tower is said to be normal if each G i+ j is 
normal in G, (i = 0,... , m — 1). It is said to be abelian (resp. cyclic) if it is 
normal and if each factor group Gj/G I + ^ is abelian (resp. cyclic). 

Let/: G -> G' be a homomorphism and let 

G' = G' 0 d Gi d • • o G' m 

be a normal tower in G'. Let G, = /" ^(G;). Then the G, (i = 0,..., m) form a 
normal tower. If the G\ form an abelian tower (resp. cyclic tower) then the G, 
form an abelian tower (resp. cyclic tower), because we have an injective homo- 
morphism 

g,/g í+1 - g;/g; +1 

for each i, and because a subgroup of an abelian group (resp. a cyclic group) is 
abelian (resp. cyclic). 

A refinement of a tower 

G = G 0 => Gj =>•••=> G m 

is a tower which can be obtained by inserting a finite number of subgroups in 
the given tower. A group is said to be solvable if it has an abelian tower, whose 
last element is the trivial subgroup (i.e. G m = {e} in the above notation). 

Proposition 3.1. Let G be a finite group. An abelian tower of G admits a 
cyclic refinement. Let G be a finite solvable group. Then G admits a cyclic 
tower , whose last element is {e}. 

Proof. The second assertion is an immediate consequence of the first, and 
it clearly suffices to prove that if G is finite, abelian, then G admits a cyclic tower. 
We use induction on the order of G. Let x be an element of G. We may assume 
that x # e. Let X be the cyclic group generated by x. Let G' = G/X. By 
induction, we can find a cyclic tower in G', and its inverse image is a cyclic tower 
in G whose last element is X. If we refine this tower by inserting {e} at the end, 
we obtain the desired cyclic tower. 

Example. In Theorem 6.4 it will be proved that a group whose order is a 
prime power is solvable. 
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Example. One of the major results of group theory is the Feit-Thompson 
theorem that all finite groups of odd order are solvable. Cf. [Go 68]. 

Example. Solvable groups will occur in field theory as the Galois groups 
of solvable extensions. See Chapter VI, Theorem 7.2. 


Example. We assume the reader knows the basic notions of linear algebra. 
Let k be a field. Let G = GL(n , k) be the group of invertible n x n matrices in 
k. Let T = T(n, k) be the upper triangular group; that is, the subgroup of matrices 
which are 0 below the diagonal. Let D be the diagonal group of diagonal matrices 
with non-zero components on the diagonal. Let N be the additive group of matrices 
which are 0 on and below the diagonal, and let U = I + N, where / is the unit 
n x n matrix. Then U is a subgroup of G. (Note that N consists of nilpotent 
matrices, i.e. matrices A such that A m = 0 for some positive integer m. Then 
(I — A)~ l = / + ì4+ì4 2 +...+ A m ~ x is computed using the geometric series.) 
Given a matrix Ae f, let diag(A) be the diagonal matrix which has the same 
diagonal components as A. Then the reader will verify that we get a surjective 
homomorphism r „ D given by A „ iiag(A) 


The kernel of this homomorphism is precisely U. More generally, observe that 
for r ^ 2, the set N r ~ l consists of all matrices of the form 


/o o ••• 0 a lr . a ln 

I 0 0 • * • 0 0 02,r+l ' * ' a 2n 


M 


0 0 
0 0 

0 0 


a n-r+ 1,/î 


0 

0 


/ 


Let U r = / + N r . Then U\ = U and U r D U r+ \. Furthermore, U r+] is normal 
in U r , and the factor group is isomorphic to the additive group (!) k n ~ r , under the 
the mapping which sends / + M to the n — r-tuple ( a lr+ì , ... , a n - r n ) e k n ~ r . 
This n - r-tuple could be called the r-th upper diagonal. Thus we obtain an 
abelian tower 


TDU=U\Z)U 2 D...DU n = {/}. 

Theorem 3.2. Let G bea group andH a normal subgroup. Then G is solvable 
if and only if H and G/H are solvable. 

Proof. We prove that G solvable implies that H is solvable. Let 
G = G 0 D G\ D ... D G r = {e} be a tower of groups with G i+Ì normal in G t 
and such that Gi/G i+Ì is abelian. Let H ( = H fl G z . Then H i+Ì is normal in H i9 
and we have an embedding // z /// z+1 -+ G z /G z+1 , whence // z /// z+1 is abelian, 
whence proving that H is solvable. We leave the proofs of the other statements 
to the reader. 
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Let G be a group. A commutator in G is a group element of the form xyx~ x y ~ 1 
with x, y e G. Let G c be the subgroup of G generated by the commutators. We 
call G c the commutator subgroup of G. As an exercise, prove that G c is normal 
in G, and that every homomorphism f: G —> G' into a commutative group G' 
contains G c in its kernel, and consequently factors through the factor commutator 
group G/G c . Observe that G/G c itself is commutative. Indeed, if x denotes the 
image of x in G/G c , then by definition we have xŷx~ l ŷ~ l = ê, so x 
and ŷ commute. In light of the definition of solvability, it is clear that the 
commutator group is at the heart of solvability and non-solvability problems. 

A group G is said to be simple if it is non-trivial, and has no normal subgroups 
other than {e} and G itself. 

Examples. An abelian group is simple if and only if it is cyclic of prime 
order. Indeed, suppose A abelian and non-trivial. Let a e A, a e. If a generates 
an infinite cyclic group, then a 2 generates a proper subgroup and so A is not 
simple. If a has finite period, and A is simple, then A = (a). Let n be the period 
and suppose n not prime. Write n = rs with r, s > 1. Then a r + e and a r 
generates a proper subgroup, contradicting the simplicity of A, so a has prime 
period and A is cyclic of order p. 

Examples. Using commutators, we shall give examples of simple groups 
in Theorem 5.5 (the alternating group), and in Theorem 9.2 of Chapter XIII 
iPSL n (F ), a group of matrices to be defined in that chapter). Since a non-cyclic 
simple group is not solvable, we get thereby examples of non-solvable groups. 

A major program of finite group theory is the classification of all finite 
simple groups. Essentially most of them (if not all) have natural representa- 
tions as subgroups of linear maps of suitable vector spaces over suitable fields, 
in a suitably natural way. See [Go 82], [Go 86], [Sol 01] for surveys. Gaps in 
purported proofs have been found. As of 2001, these are still incomplete. 

Next we are concerned with towers of subgroups such that the factor groups 
Gi/G i+Ì are simple. The next lemma is for use in the proof of the Jordan-Hôlder 
and Schreier theorems. 

Lemma 3.3. (Butterfly Lemma.) (Zassenhaus) Let U, V be subgroups 

of a group. Let u, v be normal subgroups ofU and V, respectively. Then 

u(U n v) is normal in u(U n V\ 

(u n V)v is normal in (U n V)v, 

and the factor groups are isomorphic , i.e. 

u(U n V)/u(U n v) « (U n V)v/(u n V)v. 

Proof. The combination of groups and factor groups becomes clear if 
one visualizes the following diagram of subgroups (which gives its name to the 
lemma): 
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U V 



In this diagram, we are given U, u, V, v. All the other points in the diagram 
correspond to certain groups which can be determined as follows. The inter- 
section of two line segments going downwards represents the intersection of 
groups. Two lines going upwards meet in a point which represents the product 
of two subgroups (i.e. the smallest subgroup containing both of them). 

We consider the two parallelograms representing the wings of the butterfly, 
and we shall give isomoqphisms of the factor groups as follows: 

u(u n v) __ unv ^ (_u_n v)v 
U (u nv) (ii n V)(u nv) (u n V)v * 

In fact, the vertical side common to both parallelograms has U n V as its 
top end point, and (u nV)(U n v) as its bottom end point. We have an iso- 
morphism 

(U n V)/(u n V)(U n v) « u(U n V)/u(U n v). 

This is obtained from the isomorphism theorem 

H/(H n Af) « HN/N 

by setting H = U H V and N = u(U H v). This gives us the isomorphism on 
the left. By symmetry we obtain the corresponding isomorphism on the right, 
which proves the Butterfly lemma. 

Let G be a group, and let 

G = Gj => G 2 => • • => G r = { e }, 


G = //p// 2 D..o// s =W 

be normal towers of subgroups, ending with the trivial group. We shall say 
that these towers are equivalent if r = s and if there exists a permutation of the 
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indices i = 1,..., r — 1, written i i—► i\ such that 

GJG i+ì ^H r /H v+í . 

In other words, the sequences of factor groups in our two towers are the same, 
up to isomorphisms, and a permutation of the indices. 

Theorem3.4. (Schreier) LetG bea group. Two normal towers of subgroups 
ending with the trivial group have equivalent refinements. 

Proof. Let the two towers be as above. For each i = 1, ..., r — 1 ánd 
j — 1,..., s we define 

Gìj = G i+ì (Hj n G/). 

Then G is = G i+U and we have a refinement of the first tower: 

G = G n ==> G 12 ^ G 1>s _! G 2 

= G 2 iDG 22 D--oG r _ 1(1 D*-oG r . 1(S . 1 D {e}. 

Similarly, we define 

Hjì = H J+í (Gi n H 

for j = 1, ..., s - 1 and i = 1, ..., r. This yields a refinement of the second 
tower. By the butterfly lemma, for i = 1, ..., r - 1 and j = 1, ..., s - 1 we 
have isomorphisms 

Gij/Gi,j + \ ~ HjjH j i+ j. 

We view each one of our refined towers as having (r — l)(s — 1) + 1 elements, 
namely G i} (i = 1,..., r — 1 = 1,..., s — 1) and {e} in the first case, and 

{e} in the second case. The preceding isomorphism for each pair of indices 
(ij) shows that our refined towers are equivalent, as was to be proved. 

A group G is said to be simple if it is non-trivial, and has no normal sub- 
groups other than {e} and G itself. 

Theorem 3.5. (Jordan-Hòlder) Let G be a group, and let 
G = Gi =3 G 2 => • • • =3 G r = {e} 

be a normal tower such that each group Gi/G i+Í is simple , and G f ^ G i+Ì 
for i = 1,..., r — 1. Then any other normal tower of G having the same prop - 
erties is equivalent to this one. 

Proof. Given any refinement {G 0 } as before for our tower, we observe 
that for each i, there exists precisely one index j such that Gj/G i+ ^ = G l7 /G i j + v 
Thus the sequence of non-trivial factors for the original tower, or the refined 
tower, is the same. This proves our theorem. 
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§4. CYCLIC GROUPS 

The integers Z form an additive group. We shall determine its subgroups. 
Let H be a subgroup of Z. If H is not trivial, let a be the smallest positive integer 
in H. We contend that H consists of all elements na, with neZ. To prove this, 
let yeH. There exist integers n, r with 0 ^ r < a such that 

y = na + r. 

Since H is a subgroup and r = y — na, we have reH, whence r = 0, and our 
assertion follows. 

Let G be a group. We shall say that G is cyclic if there exists an element 
a of G such that every element x of G can be written in the form a n for some 
neZ (in other words, if the map/:Z -> G such that/ (n) = a n is surjective). 
Such an element a of G is then called a generator of G. 

Let G be a group and a e G. The subset of all elements a n (n e Z) is 
obviously a subgroup of G, which is cyclic. If m is an integer such that a m = e 
and m > 0 then we shall call m an exponent of a. We shall say that m > 0 is 
an exponent of G if x" 1 = e for all x e G. 

Let G be a group and aeG. Let/: Z -► G be the homomorphism such that 
f(n) = a n and let H be the kernel of/ Two cases arise: 

1. The kernel is trivial. Then/is an isomorphism of Z onto the cyclic subgroup 
of G generated by a, and this subgroup is infinite cyclic. If a generates G, then 
G is cyclic. We also say that a has infinite period. 

2. The kernel is not trivial. Let d be the smallest positive integer in the 
kernel. Then d is called the period of a. If m is an integer such that a m = e then 
m = ds for some integer s. We observe that the elements e, a,..., a d ~ l are 
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distinct. Indeed, if a r = a s with 0 ^ r, s ^ d — 1, and say r ^ s, then a s ~ r = 
e. Since 0^is-r<dwe must have s — r = 0. The cyclic subgroup generated 
by a has order d. Hence by Proposition 2.2: 

Proposition 4.1. Let G be a finite group oforder n > 1. Let a be an element 

ofG, a ± e. Then the period ofa divides n. Ifthe order ofG is a prime number 

p, then G is cyclic and the period ofany generator is equal to p. 

Furthermore: 

Proposition 4.2. Let Gbea cyclic group. Then every subgroup ofG is cyclic. 

Iffisa homomorphism ofG, then the image offis cyclic. 

Proof. If G is infinite cyclic, it is isomorphic to Z, and we determined above 
all subgroups of Z, finding that they are all cyclic. If /: G —» G' is a homo- 
morphism, and a is a generator of G, then f(a) is obviously a generator of/(G), 
which is therefore cyclic, so the image of/is cyclic. Next let H be a subgroup 
of G. We want to show H cyclic. Let a be a generator of G. Then we have a 
surjective homomorphism/: Z G such that f(n) = a n . The inverse image 
f~ l (H) is a subgroup of Z, and therefore equal to mZ for some positive integer 
m. Since / is surjective, we also have a surjective homomorphism mZ H. 
Since mZ is cyclic (generated additively by m ), it follows that H is cyclic, thus 
proving the proposition. 

We observe that two cyclic groups of the same order m are isomorphic. 
Indeed, if G is cyclic of order m with generator a , then we have a surjective 
homomorphism /: Z —» G such that f(n) = a n , and if kZ is the kernel, 
with k positive, then we have an isomorphism Z/kZ ~ G, so k = m. 
If u: Gj —> Z/mZ and v: G 2 ~* Z/mZ are isomorphisms of two cyclic groups 
with Z/mZ, then v~ x ° u: G x G 2 is an isomorphism. 

Proposition 4.3. 

(i) An infinite cyclic group has exactly two generators (if a is a generator , then 
a~ l is the only other generator). 

(ii) Let Gbea finite cyclic group of order n , and let xbe a generator. The set 
of generators of G consists of those powers x (iv) v of x such that v is relatively 
prime to n. 

(iii) Let Gbe a cyclic group , and let a , b be two generators. Then there exists 
an automorphism of G mapping a onto b. Conversely , any automorphism 
of G maps a on some generator of G. 

(iv) Let G be a cyclic group oforder n. Let dbe a positive integer dividing n. 
Then there exists a unique subgroup of G of order d. 

(v) Let Gj, G 2 be cyclic of orders m, n respectively. If m, n are relatively 
prime then Gj x G 2 is cyclic. 
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(vi) Let G be a finite abelian group. IfG is not cyclic, then there exists a prime 
p and a subgroup of G isomorphic to C x C, where C is cyclic of order 
P • 

Proof. We leave the first three statements to the reader, and prove the others. 

(iv) Let d\n. Let m = n/d. Let/: Z G be a surjective homomorphism. 
Then /(mZ) is a subgroup of G , and from the isomorphism Z/mZ ~ G/f(mZ) 
we conclude that f(mZ) has indexm in G, whence/(mZ) has order d. Conversely, 
let H be a subgroup of order d. Then f~ x (H) = mZ for some positive integer 

m, so H = /(mZ), Z/mZ ~ G///, so n = mJ, m = n/d and // is uniquely 
determined. 

(v) Let A = (a) and /? = (è) be cyclic groups of orders m, n , relatively prime. 
Consider the homomorphism Z A x B such that /: i-> (#*, ò*). An element 
in its kernel must be divisible both by m and n, hence by their product since m, 
n are relatively prime. Conversely, it is clear that mnZ is contained in the kernel, 
so the kernel is mnZ. The image of Z —» A x 5 is surjective by the Chinese 
remainder theorem. This proves (v). (A reader who does not know the Chinese 
remainder theorem can see a proof in the more general context of Chapter II, 
Theorem 2.2.) 

(vi) This characterization of cyclic groups is an immediate consequence of 
the structure theorem which will be proved in §8, because if G is not cyclic, 
then by Theorem 8.1 and (v) we are reduced to the case when G is a p-group, 
and by Theorem 8.2 there are at least two factors in the direct product (or sum) 
decomposition, and each contains a cyclic subgroup of orderp, whence G contains 
their direct product (or sum). Statement (vi) is, of course, easier to prove than 
the full structure theorem, and it is a good exercise for the reader to formulate 
the simpler arguments which yield (vi) directly. 

Note. For the group of automorphisms of a cyclic group, see the end of 
Chapter II, §2. 


§5. OPERATIONS OF A GROUP ON A SET 

Let G be a group and let S be a set. An operation or an action of G on S 
is a homomorphism 

77 : G * Perm(5) 

of G into the group of permutations of S. We then call S a G-set. We denote 
the permutation associated with an element x e G by tt x . Thus the homomorphism 
is denoted by x i-» 77*. Given íeS, the image of 5 under the permutation 77 * is 
77*(s). From such an operation we obtain a mapping 


G x S —> S , 
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which to each pair (jc, s) with x e G and s e S associates the element 7 ^( 5 ). We 
often abbreviate the notation and write simply xs instead of 7 ^( 5 ). With the 
simpler notation, we have the two properties: 

For all x, y e G and s e S, we have Jt(ys) = (xy)s. 

If e is the unit element of G, then es = s for all s e S. 

Conversely, if we are given a mapping G x S —> S, denoted by (x, s) i-> xs, 
satisfying these two properties, then for each x e G the map s i-> xs is permutation 
of S, which we then denote by 77 ^( 5 ). Then jc tt x is a homomorphism of G 
into Perm(S). So an operation of G on S could also be defined as a mapping 
G x S —» S satisfying the above two properties. The most important examples 
of representations of G as a group of permutations are the following. 

1. Conjugation. For each xe G, let c x : G G be the map such that 
c x (>>) = xyx~ l . Then it is immediately verified that the association x ^ c x is a 
homomorphism G —► Aut(G), and so this map gives an operation of G on itself, 
called conjugation. The kernel of the homomorphism x ^ c x is a normal sub- 
group of G, which consists of all x e G such that xyx~ l = y for all y e G, i.e. all 
x e G which commute with every element of G. This kemel is called the center 
of G. Automorphisms of G of the form c x are called inner. 

To avoid confusion about the operation on the left, we don’t write xy for 
c x (y). Sometimes, one writes 

C x -<{y) = x~ l yx = y x , 

i.e. one uses an exponential notation, so that we have the rules 

y (jC2) = (y x ) z and y e = y 

for all x, y, z e G. Similarly, x y = xyx~ l and z (*y) = zx y. 

We note that G also operates by conjugation on the set of subsets of G. 
Indeed, let S be the set of subsets of G, and let A e S be a subset of G. Then 
xAx~ l is also a subset of G which may be denoted by c x (A), and one verifies 
trivially that the map 

(x, A) 1—► xAx ~ 1 

ofG x S -> S is an operation of G on S. We note in addition that if A is a sub- 
group of G then xAx~ l is also a subgroup, so that G operates on the set of 
subgroups by conjugation. 

If A, B are two subsets of G, we say that they are conjugate if there exists 
xeG such that B = xAx~ l . 

2. Translation. For each jc e G we define the translation T x : G —> G by 
T x (y) = xy. Then the map 

(x, y) h-> xy = T x (y) 

defines an operation of G on itself. Warning: T x is not a group-homomorphism! 
Only a permutation of G. 
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Similarly, G operates by translation on the set of subsets, for if A is a 
subset of G, then xA = T X (A) is also a subset. If H is a subgroup of G, then 
T X (H) = xH is in general not a subgroup but a coset of H, and hence we see 
that G operates by translation on the set of cosets of H. We denote the set of 
left cosets of H by G/H. Thus even though H need not be normal, G/H is a 
G-set. It has become customary to denote the set of right cosets by H\G. 

The above two representations of G as a group of permutations will be used 
frequently in the sequel. In particular, the representation by conjugation will be 
used throughout the next section, in the proof of the Sylow theorems. 

3. Example from linear algebra. We assume the reader knows basic 
notions of linear algebra. Let k be a field and let V be a vector space over k. Let 
G = GL(V) be the group of linear automorphisms of V. For A e G and 
v e V, the map (A, v) i-> Av defines an operation of G on V. Of course, G is 
a subgroup of the group of permutations Perm(V0- Similarly, let V = k n be the 
vector space of (vertical) n-tuples of elements of k, and let G be the group of 
invertible n x n matrices with components in k. Then G operates on k n by 
(A, X) i—> AX for A e G and X e k n . 

Let S, S' be two G-sets, and f : SS' a map. We say that/is a morphism 
of G-sets, or a G-map, if 

f(xs) = xf(s) 

for all x e G and seS. (We shall soon define categories, and see that G-sets form 
a category.) 

We now return to the general situation, and consider a group operating on 
a set S. Let seS. The set ofelements xeG such that xs = s is obviously a sub- 
group of G, called the isotropy group of s in G, and denoted by G s . 

When G operates on itself by conjugation, then the isotropy group of an 
element is none other than the normalizer of this element. Similarly, when G 
operates on the set of subgroups by conjugation, the isotropy group of a sub- 
group is again its normalizer. 

Let G operate on a set S. Let s, s' be elements of S, and y an element of G 
such that ys = s'. Then 


G s = yG s y 1 

Indeed, one sees at once that yG s y~ l leaves s' fixed. Conversely, if 
x's' = s' then x'ys = ys, so y~ l x'y e G s and x' e yG s y~ l . Thus the isotropy 
gro.ups of s and s' are conjugate. 

Let K be the kernel of the representation G —» Perm(5). Then directly from 
the definitions, we obtain that 

K = Dg s = intersection of all isotropy groups. 

jeS 
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An action or operation of G is said to be faithful if K = {e}\ that is, the kernel 
of G —> Perm(5) is trivial. A fixed point of G is an element s e S such that 
xs = s for all x e G or in other words, G = G s . 

Let G operate on a set 5. Let seS. The subset of S consisting of all elements 
xs (with x g G) is denoted by Gs, and is called the orbit of s under G. If x and y 
are in the same coset of the subgroup H = G SÌ then xs = ys, and conversely 
(obvious). In this manner, we get a mapping 

/: G/H -* S 

given by f(xH) = xs, and it is clear that this map is a morphism of G-sets. In 
fact, one sees at once that it induces a bijection of G/H onto the orbit Gs. 
Consequently: 

Proposition 5.1. IfG is a group operating on a set S , and s e S, then the order 

of the orbit Gs is equal to the index (G : G s ). 

In particular, when G operates by conjugation on the set of subgroups, and 
H is a subgroup, then: 

Proposition 5.2. The number of conjugate subgroups to H is equal to the 

index of the normalizer of H. 

Example. Let G be a group and H a subgroup of index 2. Then H is normal 
in G. 

Proof. Note that H is contained in its normalizer N Hy so the index of N H 
in G is 1 or 2. If it is 1, then we are done. Suppose it is 2. Let G operate by con- 
jugation on the set of subgroups. The orbit of H has 2 elements, and G operates 
on this orbit. In this way we get a homomorphism of G into the group of 
permutations of 2 elements. Since there is one conjugate of H unequal to H , 
then the kernel of our homomorphism is normal, of index 2, hence equal to H 9 
which is normal, a contradiction which concludes the proof. 

For a generalization and other examples, see Lemma 6.7. 

In general, an operation of G on S is said to be transitive if there is only 
one orbit. 

Examples. The symmetric group S n operates transitively on {1, 2,. .., n}. 
In Proposition 2.1 of Chapter VII, we shall see a non-trivial example of transitive 
action of a Galois group operating on the primes lying above a given prime in 
the ground ring. In topology, suppose we have a universal covering space 
p: X ' —> X, where X is connected. Given x e X, the fundamental group ^(X) 
operates transitively on the inverse image p~ l (x). 
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Example. Let íp be the upper half-plane; that is, the set of complex numbers 
z = x + iy such that y > 0. Let G = SL 2 ( R) (2x2 matrices with determinant 
1). For 

( a b\ az + b 

a = I 6 G, we let az = --— 

\c d cz + d 


Readers will verify by brute force that this defines an operation of G on íp. The 
isotropy group of i is the group of matrices 


/ cos 6 sin & 
\—sin 0 cos 0, 


with 6 real. 


This group is usually denoted by K. The group G operates transitively. You can 
verify all these statements as easy exercises. 

Let G operate on a set S. Then two orbits of G are either disjoint or are 
equal. Indeed, if Gs^ and Gs 2 are two orbits with an element s in common, 
thens = xs ^ forsomex e G, and hence Gs = Gxs^ = Gs v Similarly, Gs = Gs 2 . 
Hence S is the disjoint union of the distinct orbits, and we can write 

S = 1J Gsì (disjoint), also denoted S = IJ Gs h 

iel *' e/ 


where / is some indexing set, and the s,- are elements of distinct orbits. If S is 
finite, this gives a decomposition of the order of S as a sum of orders of orbits, 
which we call the orbit decomposition formula, namely 


card(S) = £ (G : G s ). 

i € I 


Let x, y be elements of a group (or monoid) G. They are said to commute 
if xy = yx. If G is agroup, the set of allelements xeG whichcommute with all 
elements of G is a subgroup of G which we called the center of G. Let G act on 
itself by conjugation. Then x is in the center if and only if the orbit of x is x 
itself, and thus has one element. In general, the order of the orbit of x is equal 
to the index of the normalizer of x. Thus when G is a finite group, the above 
formula reads 


(G:l)= X(G:G x ) 

xeC 


where C is a set of representatives for the distinct conjugacy classes, and the 
sum is taken over all xeC. This formula is also called the class formula. 
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The class formula and the orbit decomposition formula will be used systematically 
in the next section on Sylow groups, which may be viewed as providing examples 
for these formulas. 

Readers interested in Sylow groups may jump immediately to the next section. 

The rest ofthis section deals with special properties ofthe symmetric group, 
which may serve as examples of the general notions we have developed. 

The symmetric group. Let S n be the group of permutations of a set 
with n elements. This set may be taken to be the set of integers 
J n = {1, 2,..., n}. Given any cr e S n , and any integer /, 1 ^ i ^ n 9 we may 
form the orbit of i under the cyclic group generated by or. Such an orbit is called 
a cycle for cr, and may be written 

[Ì\Ì2 * * * irì’ so O’O’l) = il, • • . , <r(i r - l) = in O’Ur) = 1 1 * 

Then {1,. .. , n} may be decomposed into a disjoint union of orbits for the cyclic 
group generated by cr , and therefore into disjoint cycles. Thus the effect of a 
on {1,. . . , n} is represented by a product of disjoint cycles. 

Example. The cycle [132] represents the permutation cr such that 

(7(1) = 3, (7(3) = 2, and (7(2) = 1. 

We have cr 2 (l) = 2, cr 3 ( 1) = 1. Thus {1, 3, 2} is the orbit of 1 under the cyclic 
group generated by o. 

Example. In Exercise 38, one will see how to generate S n by special types 
of generators. Perhaps the most important part of that exercise is that if n is 
prime, cr is an n-cycle and r is a transposition, then cr , r generate S n . As an 
application in Galois theory, if one tries to prove that a Galois group is all 
of S n (as a group of permutations of the roots), it suffices to prove that the 
Galois group contains an n-cycle and a transposition. See Example 6 of 
Chapter VI, §2. 

We want to associate a sign ±1 to each permutation. We do this in the 
standard way. Let /be a function of n variables, say /: Z n —> Z, so we can 
evaluate f{x x ,. . ., x n ). Let cr be a permutation of J n . We define the function 
tt(í7)/ by 

t T{o)f(x x , ..., x n ) = /Uo-d), . .., x a(n) ). 

Then for a, r e S n we have 7r(ar) = 7r(o-)7r(r). Indeed, we use the definition 
applied to the function g = 7r(r)/to get 


7T(&)7T(T)f(x u . . . ,x n ) = ('rr(T)f)(x 0 . i \ ) , . . . , x a(n) ) 

~ /C*ctt( 1)> • • • > *CTT(n)) 

= t T(ar)f(x\,. .., x n ). 
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Since the identity in S n operates as the identity on functions, it follows that we 
have obtained an operation of S n on the set of functions. We shall write more 
simply qfinstead of 7 T(&)f. It is immediately verified that for two functions /, 
g we have 

v(f + 9) = <rf + <79 and a(fg) = (af)((rg ). 

If c is constant, then cr(cf) = ca(f). 

Proposition 5.3. There exists a unique homomorphism e: S n —» {±1} such 
that for every transposition r we have e(r) = — 1 . 

Proof. Let A be the function 

A (x 1 ,. .. , x n ) 11 (x; Xj ), 

i<j 

the product being taken for all pairs of integers /, j satisfying 1 ^ / < j ^ n. 
Let r be a transposition, interchanging the two integers r and 5 . Say r < s. We 
wish to determine 

tAUi, . . . , X n ) = n UtO') _ x r (/>)• 

i<j 

For one factor involving j = s, i = r , we see that r changes the factor 
(x s - x r ) to -(x s - x r ). All other factors can be considered in pairs as follows: 

(x k - x s )(x k - x r ) if k > s, 

(x s - jc^Xjc* — jc r ) if r < k < s 9 
(x s — jc*)(jc r — jc*) if k < r. 

Each one of these pairs remains unchanged when we apply r. Hence we see that 

rA = — A. 

Let e(o-) be the sign 1 or -1 such that ctA = e(a)A for a permutation cr. 
Since 7 t(ctt) = 7t(o-)7t(t), it follows at once that e is a homomorphism, and the 
proposition is proved. 

In particular, if cr = t x • • • T m is a product of transpositions, then 
e(&) = (— \) m . As a matter of terminology, we call o-even if e(cr) = 1, and odd 
if e(cf) = — 1. The even permutations constitute the kernel of s, which is called 

the alternating group A n . 

Theorem 5.4. Ifn ^ 5 then S n is not solvable. 

Proof. We shall first prove that if H , N are two subgroups of S n such that 
N C H and N is normal in //, if H contains every 3-cycle, and if H/N is abelian, 
then N contains every 3-cycle. To see this, let / ,/ k , r, s be five distinct integers 
in J n , and let cr = [ijk\ and t = [krs\. Then a direct computation gives their 
commutator 


(iTcr 1 t 1 = [rki\. 
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Since the choice of i,j, k, r, s was arbitrary, we see that the cycles [rki] all lie 
in N for all choices of distinct r, k, i, thereby proving what we wanted. 

Now suppose that we have a tower of subgroups 

5 n = H 0 D H x D H 2 D • * • 3 H m = M 

such that H v is normal in H v - X for v = 1,... , m, and H v /H v - X is abelian. Since 
S n contains every 3-cycle, we conclude that H x contains every 3-cycle. By 
induction, we conclude that H m = { e } contains e very 3-cycle, which is impossible, 
thus proving the theorem. 

Remark concerning the sign e(o). A priori, we defined the sign for a 
given n, so we should write e n (&). However, suppose n < m. Then the restriction 
of e m to S n (viewed as a permutation of J n leaving the elements of J m not in J n 
fixed) gives a homomorphism satisfying the conditions of Proposition 5.3, so 
this restriction is equal to e n . Thus A m fì S n = A n . 

Next we prove some properties of the alternating group. 

(a) A n is generated by the 3 -cycles. Proof: Consider the product of two trans- 
positions [ij][rs]. If they have an element in common, the product is either the 
identity or a 3-cycle. If they have no element in common, then 

[ij][rs] = [ijr][jrs], 

so the product of two transpositions is also a product of 3-cycles. Since an even 
permutation is a product of an even number of transpositions, we are done. 

(b) ìf n ^ 5, all 3 -cycles are conjugate in A n . Proof: If y is a permutation, 
then for a cycle [i x ... i m ] we have 

y[i\ • • • «mly’ 1 = lyih) • • • y(û)]- 

Given 3-cycles [ijk] and [i'j'k'] there is a permutation y such that y(i) = /', 
j(j) ~ j' ’ an d yih) = . Thus two 3-cycles are conjugate in S n by some element 

y. If y is even, we are done. Otherwise, by assumption n ^ 5 there exist r, s 
not equal to any one of the three elements i,j, k. Then [rs] commutes with [ijk], 
and we replace y by y[rs] to prove (b). 

Theorem 5.5. If n ^ 5 then the alternating group A n is simple. 

Proof. Let /V be a non-trivial normal subgroup of A n . We prove that N 
contains some 3-cycle, whence the theorem follows by (b). Let cte N, cr =£ id, 
be an element which has the maximal number of fixed points; that is, integers 
/ such that cr(/) = /. It will suffice to prove that cr is a 3-cycle or the identity. 
Decompose J n into disjoint orbits of (cr).Then some orbits have more than one 
element. Suppose all orbits have 2 elements (except for the fixed points). Since 
cr is even, there are at least two such orbits. On their union, cr is represented as 
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a product of two transpositions [(/][rs]. Let k ¥ i, j, r, s. Let r = [rrá]. Let 
cr' = Tar~ l a~\ Then a' is a product of a conjugate of a and a~\so a' E N. 
But a' leaves i, j fixed, and any element t E J n , t ¥ i, j, r, s, k left fixed by a 
is also fixed by a', so a' has more fixed points than a, contradicting our 
hypothesis. 

So we are reduced to the case when at least one orbit of (a) has ^3 elements, 
say i,j,k,... . If a is not the 3-cycle [ijk\, then a must move at least two other 
elements of J n , otherwise a is an odd permutation [ijkr\ for some r e J n , which 
is impossible. Then let a move r, s other than i, j, k, and let r = [krs\. Let a' 
be the commutator as before. Then a' e. N and a'(i) = i, and all fixed points 
of a are also fixed points of a' whence a' has more fixed points than a, a 
contradiction which proves the theorem. 

Example. For n = 4, the group A 4 is not simple. As an exercise, show 
that A 4 contains a unique subgroup of order 4, which is not cyclic, and which 
is normal. This subgroup is also normal in S 4 . Write down explicitly its elements 
as products of transpositions. 


§6. SYLOW SUBGROUPS 

Let p be a prime number. By a p-group, we mean a finite group whose 
order is a power of p (i.e. p n for some integer n ^ 0). Let G be a finite group 
and H a subgroup. We call H a p-subgroup of G if H is a p-g roup. We call H 
a p-Sylow subgroup if the order of H is p n and if p n is the highest power of p 
dividing the order of G. We shall prove below that such subgroups always 
exist. For this we need a lemma. 

Lemma 6.1. Let G be a finite abelian group of order m, let p be a prime 

number dividing m. Then G has a subgroup of order p. 

Proof. We first prove by induction that if G has exponent n then the 
order of G divides some power of n. Let beG, b ¥ 1, and let H be the cyclic 
subgroup generated by b. Then the order of H divides n since b n = 1, and n 
is an exponent for G/H. Hence the order of G/H divides a power of n by 
induction, and consequently so does the order of G because 

(G: 1) = (G: H)(H: 1). 

Let G have order divisible by p. By what we have just seen, there exists an 
element x in G whose period is divisible by p. Let this period be ps for some 
integer 5 . Then x s ¥ 1 and obviously x s has period p, and generates a subgroup 
of order p, as was to be shown. 
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Theorem 6.2. Let G be a finite group and p a prime number dividing the 

order of G. Then there exists a p-Sylow subgroup of G. 

Proof By induction on the order of G. If the order of G is prime, our 
assertion is obvious. We now assume given a finite group G, and assume the 
theorem proved for all groups of order smaller than that of G. If there exists a 
proper subgroup H of G whose index is prime to p, then a p-Sylow subgroup of 
H will also be one ofG, and our assertion follows by induction. We may therefore 
assume that every proper subgroup has an index divisible by p. We now let G 
act on itself by conjugation. From the class formula we obtain 

(G:1) = (Z:1) + X(G:GJ. 

Here, Z is the center of G, and the term (Z : 1) corresponds to the orbits having 
one element, namely the elements of Z. The sum on the right is taken over the 
other orbits, and each index (G : G x ) is then > 1, hence divisible by p. Since p 
divides the order of G, it follows that p divides the order of Z, hence in particular 
that G has a non-trivial center. 

Let a be an element of order p in Z, and let H be the cyclic group generated 
by a. Since H is contained in Z, it is normal. Let f:G-+ G/H be the canonical 
map. Let p n be the highest power of p dividing (G : 1). Then p n ~ 1 divides the 
order of G/H. Let K' be a p-Sylow subgroup of G/H (by induction) and let 
K = f~ l (K'). Then K H and / maps K onto K'. Hence we have an iso- 
morphism K/H « K'. Hence K has order p n ~ l p = p", as desired. 

For the rest of the theorems, we systematically use the notion of a fixed point. 
Let G be a group operating on a set S. Recall that a fixed point 5 of G in S is 
an element s of S such that xs — s for all x e G. 

Lemma 6.3. Let H be a p-group acting on a finite set S. Then: 

(a) The number offixed points ofH is = #(S) mod p. 

(b) IfH has exactly one fixed point, then #(S) = 1 mod p. 

(c) If p | #(S), then the number of fixed points of H is = 0 mod p. 

Proof. We repeatedly use the orbit formula 

#(5) = 

For each fixed point s t we have H s . = H. For s ( not fixed, the index 
(H : H s .) is divisible by p, so (a) follows at once. Parts (b) and (c) are special 
cases of (a), thus proving the lemma. 

Remark. In Lemma 6.3(c), if H has one fixed point, then H has at least p 
fixed points. 

Theorem 6.4. Let G be a finite group. 

(i) IfH is a p-subgroup ofG, then H is contained in some p-Sylow subgroup. 
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(ii) All p-Sylow subgroups are conjugate. 

(iii) The number of p-Sylow subgroups ofG is =1 mod p. 

Proof. Let P be a p- Sylow subgroup of G. Suppose first that H is contained 
in the normalizer of P. We prove that H C P. Indeed, HP is then a subgroup 
of the normalizer, and P is normal in HP. But 

(HP : P) = (//:// D P), 

so if HP P , then HP has order a power of p, and the order is larger than #(P), 
contradicting the hypothesis that P is a Sylow group. Hence HP = P and 
HCP. 

Next, let S be the set of all conjugates of P in G. Then G operates on S by 
conjugation. Since the normalizer of P contains P, and has therefore index prime 
to p, it follows that #(5) is not divisible by p. Now let H be any p-subgroup. 
Then H also acts on S by conjugation. By Lemma 6.3(a), we know that// cannot 
have 0 fixed points. Let Q be a fixed point. By definition this means that H is 
contained in the normalizer of Q, and hence by the first part of the proof, that 
H C Q, which proves the first part of the theorem. The second part follows 
immediately by taking H to be a p- Sylow group, so #(//) = #((?), whence 
H = Q. In particular, when H is a /?-Sylow group, we see that H has only one 
fixed point, so that (iii) follows from Lemma 6.3(b). This proves the theorem. 

Theorem6.5. Let G be afinite p-group. Then G is solvable. If its order is 

> 1, then G has a non-triviaì center. 

Proof The first assertion follows from the second, since if G has center 
Z, and we have an abelian tower for G/Z by induction, we can lift this abelian 
tower to G to show that G is solvable. To prove the second assertion, we use 
the class equation 

(G:l) = card(Z) + I(G:G x ), 

the sum being taken over certain x for which (G: G x ) # 1. Then p divides 
(G : 1) and also divides every term in the sum, so that p divides the order of the 
center, as was to be shown. 

Corollary 6.6. Let G be a p-group \vhich is not of order 1. Then there 

exists a sequence of subgroups 


{f} = GocG,cG 2 c...cG, = G 

such that Gi is normal in G and G i+ JG^ is cyclic of order p. 

Proof. Since G has a non-trivial center, there exists an element a # e in 
the center of G, and such that a has order p. Let H be the cyclic group generated 
by a. By induction, if G # H, we can find a sequence of subgroups as stated 
above in the factor group G/H. Taking the inverse image of this tower in G 
gives us the desired sequence in G. 
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We now give some examples to show how to put some of the group theory 
together. 

Lemma 6.7. Let G be afinite group and let p be the smallest prime dividing 

the order ofG. Let H be a subgroup of index p. Then H is normal. 

Proof. Let N(H) = N be the normalizer of H. Then N = G or N = H. If 
N = G we are done. Suppose N = H. Then the orbit of H under conjugation 
has p = (G : H) elements, and the representation of G on this orbit gives a 
homomorphism of G into the symmetric group on p elements, whose order is 
p\. Let K be the kernel. Then K is the intersection of the isotropy groups, and 
the isotropy group of H is H by assumption, so K C H. If K =£ //, then from 

(G : K) = (G : H)(H : K) = p(H : K), 

and the fact that only the first power of p divides p\, we conclude that some 
prime dividing (p - 1)! also divides (H : K), which contradicts the assumption 
that p is the smallest prime dividing the order of G, and proves the lemma. 

Proposition 6.8. Let p, q be distinct primes and let G be a group of order 

pq. Then G is solvable. 

Proof. Say p < q. Let Q be a Sylow subgroup of order q. Then Q has index 
p , so by the lemma, Q is normal and the factor group has order p. But a group 
of prime order is cyclic, whence the proposition follows. 

Example. Let G be a group of order 35. We claim that G is cyclic. 

Proof. Let H n be the Sylow subgroup of order 7. Then // 7 is normal by 
Lemma 6.7. Let H 5 be a 5-Sylow subgroup, which is of order 5. Then H 5 
operates by conjugation on // 7 , so we get a homomorphism H 5 —> Aut (H 7 ). But 
Aut (H 7 ) is cyclic of order 6, so H 5 Aut (H 7 ) is trivial, so every element of 
H 5 commutes with elements of// 7 . Let H 5 = (x) and Z/ 7 = (y). Thtnx,y commute 
with each other and with themselves, so G is abelian, and so G is cyclic by 
Proposition 4.3(v). 

Example. The techniques which have been developed are sufficient to treat 
many cases of the above types. For instance every group of order < 60 is solvable, 
as you will prove in Exercise 27. 


§7. DIRECT SUMS AND FREE ABELIAN GROUPS 

Let {A,} íe/ be a family of abelian groups. We define their direct sum 

* = ®*/ 
ie/ 

to be the subset of the direct product fl consisting of all families (x^)^ with 
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x ( e Ai such that = 0 for all but a finite number of indices i. Then it is clear 
that A is a subgroup of the product. For each index j e /, we map 

Àj: Aj -> A 

by letting A/x) be the element whose ;-th component is x, and having all other 
components equal to 0. Then is an injective homomorphism. 

Proposition 7.1. Let {/: A t B} be a family of homomorphisms into an 
abelian group B. Let A = © A ( . There exists a unique homomorphism 

f:A-»B , 

such that f°Xj= f for all j. 

Proof. We can define a map /: A —> B by the rule 

/((*.)«=/) = X Mx,y 

iel 

The sum on the right is actually finite since all but a finite number of terms are 0. 
It is immediately verified that our map / is a homomorphism. Furthermore, 
we clearly have /° A/x) = /(x) for each j and each xeAj. Thus / has the 
desired commutativity property. It is also clear that the map / is uniquely 
determined, as was to be shown. 

The property expressed in Proposition 7.1 is called the universal property 
of the direct sum. Cf. §11. 

Example. Let A be an abelian group, and let {Ai] ieI be a family of sub- 
groups. Then we get a homomorphism 

@ Ai —* A such that (jq) i-> 2*/- 

ze/ 

Theorem 8.1 will provide an important specific application. 

Let A be an abelian group and B , C subgroups. If B + C = A and 
B fl C = {0} then the map 

B x C -* A 

given by (x, y) i—► x + y is an isomorphism (as we already noted in the non- 
commutative case). Instead of writing A = B x C we shall write 

A = £® C 

and say that A is the direct sum of B and C. We use a similar notation for the 
direct sum of a finite number of subgroups B ì9 . .., B n such that 

Bi + • • • + B n = A 

and 


&i+ i n (#i + • • • + B f ) = 0. 
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In that case we write 

A = Bi © • • • © B n . 

Let A be an abelian group. Let {^} (/ e /) be a family of elements of A. We 
say that this family is a basis for A if the family is not empty, and if every 
element of A has a unique expression as a linear combination 

x — 

with xi e Z and almost all x t = 0. Thus the sum is actually a finite sum. An 
abelian group is said to be free if it has a basis. If that is the case, it is immediate 
that if we let Z z = Z for all /, then A is isomorphic to the direct sum 

A-0Z». 

ie/ 

Next let 5 be a set. We shall define the free abelian group generated by 5 as 
follows. Let Z(S) be the set of all maps <p : S —» Z such that ç(x) = 0 for almost 
all x e S. Then Z (S) is an abelian group (addition being the usual addition of 
maps). If k is an integer and x is an element of S , we denote by k • x the map 
<p such that <p(x) = k and <p(y) = 0 if y x. Then it is obvious that every element 
<p of Z (S) can be written in the form 

q> = /Ci + ••• + k n -x n 

for some integers and elements x f eS (/ = 1,..., n), all the x^ being distinct. 
Furthermore, <p admits a unique such expression , because if we have 

<p= Y, k x- x = 

xeS xeS 

then 

0= 

xeS 


whence k' x = k x for all xeS. 

We map 5 into Z<5> by the map f s = f such that f(x) = 1 x. It is 
then clear that / is injective, and that f(S) generates Z<5>. If g: 5 -► B is a 
mapping of 5 into some abelian group B, then we can define a map 

Q* : Z<5> -► B 

such that 

ôf*(z fc * x ) = 'L k x9(x)- 

\xeS 1 xeS 

This map is a homomorphism (trivial) and we have g* °/ = g (also trivial). It 
is the only homomorphism which has this property, for any such homomorphism 
g * must be such that ^^(1 • x) = g(x). 
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It is customary to identify S in Z<S>, and we sometimes omit the dot when 
we write k x x or a sum £ /c x x. 

If X:S ^ S' is a mapping of sets , í/icrc is a unique homomorphism 1 mahing the 

follomng diagram commutative: 

s z<s> 

A J 

S'-^Z<S'> 

In fact, 1 is none other than (f s . o A)*, with the notation of the preceding para- 
graph. The proof of this statement is left as a trivial exercise. 

We shall denote Z (S) also by F ab (5), and call F ab (S) the free abelian group 
generated by S. We call elements of S its free generators. 

As an exercise, show that every abelian group A is a factor group of a free 
abelian group F. If A is finitely generated, show that one can select F to be 
finitely generated also. 

If the set S above consists of n elements, then we say that the free abelian 
group F ab (5) is the free abelian group on n generators. If S is the set of n 
letters x u ...,x n , we say that F ab (5) is the free abelian group with free 
generators x ì9 ..., x n . 

An abelian group is free if and only if it is isomorphic to a free abelian group 
F ab (5) for some set S. Let A be an abelian group, and let S be a basis for A. 
Then it is clear that A is isomorphic to the free abelian group F ab (S). 

As a matter of notation, if A is an abelian group and T a subset of elements 
of A, we denote by (T) the subgroup generated by the elements of T, i.e., the 
smallest subgroup of A containing T. 

Example. The Grothendieck group. Let M be a commutative monoid, 
written additively. There exists a commutative group K(M) and a monoid- 
homomorphism 

y:M -+ K(M) 

having the following universal property. If f: M A is a homomorphism into 
an abelian group A, then there exists a unique homomorphism /*: K(M) —> A 
making the following diagram commutative: 

M —1—* K(M) 



Proof. Let F ab (M) be the free abelian group generated by M. We denote 
the generator of F ah (M) corresponding to an element xeMby [x]. Let B be 
the subgroup generated by all elements of type 

[* + )']- M - DO 
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where x, ye M. We let K(M) = F ab (M)/B, and let 

y : M - K(M) 

be the map obtained by composing the injection of M into F ab (M) given by 
x i—► [x], and the canonical map 

FabíM) F ab (M)/B. 

It is then clear that y is a homomorphism, and satisfies the desired universal 
property. 

The universal group K(M) is called the Grothendieck group. 

We shall say that the cancellation law holds in M if, whenever x, y, z e M, 
and x + z = y + z, we have x = y. 

We then have an important criterion when the universal map y above is 
injective: 

If the cancellation law holds in M, then the canonical map y of M into its 
Grothendieck group is injective. 

Proof. This is essentially the same proof as when one constructs the nega- 
tive integers from the natural numbers. We consider pairs (x, y) with x,yeM 
and say that (x, y) is equivalent to (x', y') if y -f x' = x + y'. We define addition 
of pairs componentwise. Then the equivalence classes of pairs form a group, 
whose 0 element is the class of (0,0) [or the class of (x, x) for any x e M]. The 
negative of an element (x, y) is (y, x). We have a homomorphism 

x h-► class of (0, x) 

which is injective, as one sees immediately by applying the cancellation law. 
Thus we have constructed a homomorphism of M into a group, which is 
injective. It follows that the universal homomorphism must also be injective. 

Examples. See the example of projective modules in Chapter III, §4. For 
a relatively fancy context, see: K. Kato, Logarithmic structures of Fontaine- 
Illusie, Algebraic Geometry, Analysis and Number Theory, Proc. JAMI Confer- 
ence, J. Igusa (Ed.), Johns Hopkins Press (1989) pp. 195-224. 

Given an abelian group A and a subgroup B, it is sometimes desirable to 
find a subgroup C such that A = B © C. The next lemma gives us a condition 
under which this is true. 

Lemma 7.2. Let A ^ A' be a surjective homomorphism of abelian groups, 
and assume that A' is free. Let B be the kernel of f. Then there exists a 
subgroup C of A such that the restriction of f to C induces an isomorphism 
of C with A', and such that A = B © C. 

Proof. Let {xí} ie/ be a basis of A', and for each i e I, let x t be an element of 
A such that f(x/) = x' f . Let C be the subgroup of A generated by all elements 
x h ie I. If we have a relation 

= 0 

i e / 
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with integers n h almost all of which are equal to 0, then applying/yields 

o = E m ./( x ì) = 'L n > x 'i’ 

iel iel 

whence all n t = 0. Hence our family {x f } ie/ is a basis of C. Similarly, one sees 
that if zeC and f(z) = 0 then z = 0. Hence B nC = 0. Let xe A. Since 
f(x ) e A there exist integers n h ie /, such that 

f(x) = £ n i x 'i- 

i G / 

Applying f to x — £ HjX,, we find that this element lies in the kernel of /, 

I 6 / 

say 

x — Yj n i x i = b e B. 

i e / 

From this we see that x e 5 + C, and hence finally that /1 = B ® C is a direct 
sum, as contended. 

Theorem 7.3. Lcí Abe a free abelian group , and /eí Bbe a subgroup. Then 
B is also a free abelian group , arni í/ic cardinality of a basis of B is ^ the 
cardinality of a basis for A. Any two bases of B have the same cardinality. 

Proof. We shall give the proof only when A is finitely generated, say by a 
basis {xj,..., x„} (n ^ 1), and give the proof by induction on n. We have an 
expression of A as direct sum: 


A = Zx i © • • • © Zx w . 

Let/: A -+ Zx x be the projection, i.e. the homomorphism such that 
/(mjXj + ••• + m n x n ) = rnjXi 

whenever m t e Z. Let B t be the kernel of/| B. Then B x is contained in the free 
subgroup <x 2 ,..., x„>. By induction, B t is free and has a basis with ^ n — 1 
elements. By the lemma, there exists a subgroup Cj isomorphic to a subgroup 
of Zxj (namely the image oîf\B) such that 

B = B { © Cj. 

Since /(B) is either 0 or infinite cyclic, i.e. free on one generator, this proves 
that B is free. 

(When A is not finitely generated, one can use a similar transfinite argument. 
See Appendix 2, §2, the example after Zorn’s Lemma.) 

We also observe that our proof shows that there exists at least one basis 
of B whose cardinality is ^ n. We shall therefore be finished when we prove 
the last statement, that any two bases of B have the same cardinality. Let S 
be one basis, with a finite number of elements m. Let T be another basis, and 
suppose that T has at least r elements. It will suffice to prove that r ^ m (one 
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can then use symmetry). Let p be a prime number. Then B/pB is a direct 
sum of cyclic groups of order p, with m terms in the sum. Hence its order 
is p m . Using the basis T instead of S, we conclude that B/pB contains an r-fold 
product of cyclic groups of order p, whence p r ^ p m , and r ^ m, as was to 
be shown. (Note that we did not assume a priori that T was finite.) 

The number of elements in a basis of a free abelian group A will be called 
the rank of A. 


§8. FINITELY GENERATED ABELIAN GROUPS 

The groups referred to in the title of this section occur so frequently that it is 
worth while to state a theorem which describes their structure completely. 
Throughout this section we write our abelian groups additively. 

Let A be an abelian group. An element a e A is said to be a torsion element 
if it has finite period. The subset of all torsion elements of A is a subgroup of A 
called the torsion subgroup of A. (If a has period m and b has period n then, 
writing the group law additively, we see that a ± b has a period dividing mn.) 

The torsion subgroup of A is denoted by A tor , or simply A t . An abelian 
group is called a torsion group if A = A tor , that is all elements of A are of finite 
order. 

A finitely generated torsion abelian group is obviously finite. We shall begin 
by studying torsion abelian groups. If A is an abelian group and p a prime number, 
we denote by A(p) the subgroup of all elements x e A whose period is a power 
of p. Then A(p) is a torsion group, and is a p-group if it is finite. 

Theorem 8.1 Let A be a torsion abelian group. Then A is the direct sum of 

its subgroups A(p) for all primes p such that A(p) 4 0. 

Proof. There is a homomorphism 

© Mp) - A 

P 

which to each element (x p ) in the direct sum associates the element ^x p in A. 
We prove that this homomorphism is both surjective and injective. Suppose x 
is in the kernel, so ^x p = 0. Let q be a prime. Then 

x„ = 2 (-*„)• 

p*q 

Let m be the least common multiple of the periods of elements x p on the right- 
hand side, with x q 4 0 and p 4 q. Then mx q = 0. But also q r x q = 0 for some 
positive integer r. If d is the greatest common divisor of m, q r then dx q = 0, 
but d = 1, so x q = 0. Hence the kernel is trivial, and the homomorphism is 
injective. 
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As for the surjectivity, for each positive integer m, denote by A m the kernel 
of multiplication by m, i.e. the subgroup of x e A such that mx = 0. We prove: 

Ifm = rs with r , s positive relative prime integers, then A m = A r + A s . 

Indeed, there exist integers u, v such that ur + vs = 1. Then x = urx + vsx, 
and urx e A s while vsx e A n and our assertion is proved. Repeating this process 
inductively, we conclude: 

Ifm= n P e{p) then A m = 2 A p e( P ). 

p\m p\m 

Hence the map (|) A(p) -> A is surjective, and the theorem is proved. 

Example. Let A = Q/Z. Then Q/Z is a torsion abelian group, isomorphic 
to the direct sum of its subgroups (Q/Z)(/?). Each (Q/Z )(p) consists of those 
elements which can be represented by a rational number a/p k with a e Z and k 
some positive integer, i.e. a rational number having only a /?-power in the 
denominator. See also Chapter IV, Theorem 5.1. 

In what follows we shall deal with finite abelian groups, so only a finite 
number of primes (dividing the order of the group) will come into play. In this 
case, the direct sum is “the same as” the direct product. 

Our next task is to describe the structure of finite abelian p-groups. Let 
r ì9 ..., r s be integers ^ 1. A finite /7-group A is said to be of type (p rt 9 ... ,p rs ) 
if A is isomorphic to the product of cyclic groups of orders p ri (i = 1,..., s). 
We shall need the following remark. 

Remark. Let A be a finite abelian p-gvoup. Let b be an element of 
A, b ± 0. Let k be an integer ^ 0 such that p k b ± 0, and let p m be the period 
of p k b. Then b has period p k + m . [Proof. We certainly have p k+m b = 0, and if 
p n b = 0 then first n ^ k, and second n ^ k +m, otherwise the period of p k b 
would be smaller than p m .] 

Theorem 8.2. Every finite abelian p-group is isomorphic to a product of 

cyclic p-groups. If it is of type (p r \ ..., p rs ) with 

r \ = r 2 = = r s = 

then the sequence of integers (r ìf ... t r s ) is uniquely determined. 

Proof. We shall prove the existence of the desired product by induction. 
Let a x e A be an element of maximal period. We may assume without loss of 
generality that A is not cyclic. Let A x be the cyclic subgroup generated by a x , 
say of period p r K We need a lemma. 


Lemma 8.3. Let b be an element of A/A x , ofperiodp r . Then there exists a 
representative a of b in A which also has period p r . 
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Proof. Let b be any representative of h in A. Then p r b lies in A u say 
p r b = na { with some integer n ^ 0. We note that the period of E is ^ the period 
of b. If n = 0 we are done. Otherwise write n = p k fi where pi is prime to /?. 
Then ixa x is also a generator of A x , and hence has period p r '. We may assume 
k ^ r,. Then has period /? ri_ *. By our previous remarks, the element b 

has period 

* p r+r '~ k 

whence by hypothesis, r + r { — k ^ r x and r ^ k. This proves that there exists 
an element ce A x such that p r b = p r c. Let a = b — c. Then a is a representative 
for B in A and p r a = 0. Since period (a) ^ p r we conclude that a has period 
equal to p r . 

We return to the main proof. By induction, the factor group AjA x has a 
product expression 

A/A ^ = ^2 X • • ■ X Â s 

into cyclic subgroups of orders p r \ ..., p rs respectively, and we may assume 
r 2 ;> • • • ^ r s . Let be a generator for +, (i = 2,..., s) and let ^ be a 
representative in ,4 of the same period as Let A { be the cyclic subgroup 
generated by a t . We contend that A is the direct sum of A ì9 . . ., A s . 

Given x e A, let x denote its residue class in A/A v There exist integers 
nti ^ 0 (i = 2,..., s) such that 

x = m 2 â 2 -f • • • + m s â s . 

Hence x — m 2 a 2 — • • • - m s a s lies in A u and there exists an integer ^ 0 
such that 

x = mjflj -f m 2 a 2 -f • • ■ + m s a s . 

Hence A x -f • • • -f A s = A. 

Conversely, suppose that m u ... ,m s are integers ^ 0 such that 
0 = m x a x + • • • + m s a s . 

Since a- x has period p n (i = 1,..., s), we may suppose that m t < p n . Putting 
a bar on this equation yields 

0 = m 2 â 2 + • • • + m s â s . 

Since A/A x is a direct product of Â 2 ,..., À s we conclude that each m t = 0 for 
i = 2,..., s. But then m x = 0 also, and hence all m t = 0 (i = 1,..., s). From 
this it follows at once that 

(A x + • • • + A/) n A i+Í = 0 

for each i ^ 1, and hence that A is the direct product of A u ..., A S9 as desired. 

We prove uniqueness, by induction. Suppose that A is written in two ways 
as a direct sum of cyclic groups, say of type 

(p r \... 9 p r *) and (p m \... 9 p m -) 
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with ri = m '' = fs = 1 and ^ ^ m k ^ 1. Then p,4 is also a p-group, 

of order strictly less than the order of A , and is of type 

(P ri " 1 ,...,p rs ~ 1 ) and 

it being understood that if some exponent r, or mj is equal to 1, then the factor 
corresponding to 

p r,_1 or p mj _ 1 

in p/1 is simply the trivial group 0. By induction, the subsequence of 

(r t — 1,..., r s — 1) 

consisting of those integers ^ 1 is uniquely détermined, and is the same as 
the corresponding subsequence of 

(^i - 1,..., m k - 1). 

In other words, we have r t — 1 = m t — 1 for all those integers i such that 
r, — 1 or — 1 ^ 1. Hence r t = m { for all these integers i, and the two se- 
quences 

(p ri ,..., p rs ) and (p mi ,..., p mk ) 

can differ only in their last components which can be equal to p. These cor- 
respond to factors of type (p,..., p) occurring say v times in the first sequences 
and p times in the second sequence. Thus for some integer n, A is of type 

(P ri , • • •, P rn , P,..., P ) and (p ri ,..., p r \ p,..., p). 

v times n times 

Thus the order of /1 is equal to 

p ri+- +r np v _ p n + ‘ +r np n, 

whence v = p, and our theorem is proved. 

A group G is said to be torsion free, or without torsion, if whenever an 
element x of G has finite period, then x is the unit element. 

Theorem 8.4. Let A be a finitely generated torsion-free abelian group. Then 
A isfree. 

Proof Assume A # 0. Let S be a finite set of generators, and let x í9 ..., x n 
be a maximal subset of S having the property that whenever v l5 ..., v n are 
integers such that 

VjXj + • • * + v n x n = 0, 

then v,- = 0 for all j. (Note that n ^ 1 since A ^ 0). Let B be the subgroup 
generated by x l9 ..., x n . Then B is free. Given yeA there exist integers 
m í9 ..., m n9 m not all zero such that 

my + /iîjXì + • • • + m n x n = 0, 
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by the assumption of maximality on x ì9 ...,x n . Furthermore, m # 0; other- 
wise all mj = 0. Hence my lies in B. This is true for every one of a finite set of 
generators y of A , whence there exists an integer m # 0 such that mA cz B. 
The map 

xi—► mx 

of A into itself is a homomorphism, having trivial kernel since A is torsion free. 
Hence it is an isomorphism of A onto a subgroup of B. By Theorem7.3 of the 
preceding section, we conclude that mA is free, whence A is free. 

Theorem 8.5. Let A be a finitely generated abelian group , and let A tor be 
the subgroup consisting ofall elements of A having finite period. Then A tor is 
finite, andA/A tor is free. There exists a free subgroup B ofA such that A is the 
direct sum of A tor and B. 

Proof. We recall that a finitely generated torsion abelian group is obviously 
finite. Let A be finitely generated by n elements, and let F be the free abelian 
group on n generators. By the universal property, there exists a surjective 
homomorphism 

F A 

of F onto A. The subgroup (p~ l (A tor ) of F is finitely generated by Theorem 7.3. 
Hence A tor itself is finitely generated, hence finite. 

Next, we prove that A/A tor has no torsion. Let x be an element of A/A tor 
such that mx = 0 for some integer m 0. Then for any representative of x of 
x in A, we have mx e A tor , whence qmx = 0 for some integer q 0. Then 
x e A tor , so x = 0, and A/A tor is torsion free. By Theorem 8.4, A/A tor is free. 
We now use the lemma of Theorem 7.3 to conclude the proof. 

The rank of A/A tor is also called the rank of A. 

For other contexts concerning Theorem 8.5, see the structure theorem for 
modules over principal rings in Chapter III, §7, and Exercises 5, 6, and 7 of 
Chapter III. 


§9. THE DUAL GROUP 

Let A be an abelian group of exponent m ^ 1. This means that for each 
element x e A we have mx = 0. Let Z m be a cyclic group of order m. We denote 
by A A , or Hom(A, Z m ) the group of homomorphisms of A into Z m , and call it 
the dual of A. 

Let /: A-+ B be a homomorphism of abelian groups, and assume both have 
exponent m. Then /induces a homomorphism 

/a :£ A -* A A . 
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Namely, for each if/ e B A we define/ A (tA) = <A ° /• It is trivially verified that/ A 
is a homomorphism. The properties 

id A = id and (f° g) A = g A ° / A 

are trivially verified. 

Theorem 9.1. If A is a finite abelian group, expressed as a product 
A = B x C, then A A is isomorphic to B A x C A (under the mapping described 
below). A finite abelian group is isomorphic to its own dual. 

Proof. Consider the two projections 

B x C 


B C 

of B x C on its two components. We get homomorphisms 

(B x C) A 


5 A C A 

and we contend that these homomorphisms induce an isomorphism of B A x C A 
onto (B x C) A . 

In fact, let if/ 2 be in Hom(5, Z^) and Hom(C, Z m ) respectively. Then 
(ỳi, i// 2 ) £ B A X C A , and we have a corresponding element of (B X C) A by 
defining 

(</fi> íA 2)U» jO = <AiM + 

for (x, y) e B x C. In this way we get a homomorphism 

B A x C A (B x C) A . 

Conversely, let ỳ e (B x C) A . Then 

il/(x 9 y) = il/(x , 0 ) + i//(0 9 y). 

The function t/^ on B such that ỳfx) = ils(x,0) is in Z? A , and similarly the 
function ifj 2 on C such that il/ 2 (y) = i/<0, y) is in C A . Thus we get a homomorphism 

(5 x C) A B A x C A , 

which is obviously inverse to the one we defined previously. Hence we obtain 
an isomorphism, thereby proving the first assertion in our theorem. 

We can write any finite abelian group as a product of cyclic groups. Thus 
to prove the second assertion, it will suffice to deal with a cyclic group. 

Let A be cyclic, generated by one element x of period n. Then n\m 9 and Z m 
has precisely one subgroup of order n, Z n , which is cyclic (Proposition 4.3(iv)). 
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If ỳ : A —> Z m is a homomorphism, and x is a generator for A, then the period 
of x is an exponent for il/(x), so that and hence is contained in Z n . 

Let y be a generator for Z n . We have an isomorphism 

1 A 1 : A -* Z n 

such that ^j(x) = y. For each integer k with 0 ^ k < n we have the homo- 
morphism /c^j such that 

(/c^iXx) = /c-iA i(*) = ^i(M- 

In this way we get a cyclic subgroup of consisting of the n elements kil/\ 
(0 ^ k < n). Conversely, any element i/j of A A is uniquely determined by its 
effect on the generator x, and must map x on one of the n elements 
kx (0 ^ k < n) of Z n . Hence if/ is equal to one of the maps &i/q. These maps 
constitute the full group A A 9 which is therefore cyclic of order n , generated by 
if/\. This proves our theorem. 

In considering the dual group, we take various cyclic groups Z m . There are 
many applications where such groups occur, for instance the group of m-th roots 
of unity in the complex numbers, the subgroup of order m of Q/Z, etc. 

Let A, A' be two abelian groups. A bilinear map of A x A' into an abelian 
group C is a map 

A x A' ► C 

denoted by 

(x, x') i—► <x, x') 

having the following property. For each xeA the function x'(x, x') 
is a homomorphism, and similarly for each x' e Á the function x ► <x, x') is a 
homomorphism. 

As a special case of a bilinear map, we have the one given by 
A x Hom(A, C) -► C 

which to each pair (x,/) with xeA and fe Hom(/4, C) associates the element 
/(x) in C. 

A bilinear map is also called a pairing 

An element xeA is said to be orthogonal (or perpendicular) to a subset S' 
of A' if <x, x') = 0for allx'e S'. It isclear that theset ofxeA orthogonal to S' 
is a subgroup of A. We make similar definitions for elements of A', orthogonal 
io subsets of A. 

The kernel of our bilinear map on the left is the subgroup of A which is 
orthogonal to all of A'. We define its kernel on the right similarly. 

Given a bilinear map A x A' -► C, let B, B' be the respective kernels of our 
bilinear map on the left and right. An element x' of A' gives rise to an element of 
Hom/4, C) given by xi—► <x, x'), which we shall denote by i p x >. Since ij/ X ' 
vanishes on B we see that \l/ x > is in fact a homomorphism of A/B into C. 
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Furthermore, i j/ x . = ip^ if x', y' are elements of A' such that 

x' = y' (mod B'). 

Hence i j/ is in fact a homomorphism 

0 -► A'/B' -* Hom(^4/B, C), 

vvhich is injective since we defined B' to be the group orthogonal to A. 
Similarly, we get an injective homomorphism 

0 ->■ A/B -> Wom(A'/B', C). 

Assume that C is cyclic of order m. Then for any x' e A' we have 

mip x ■ = lAmx' = 0 , 

whence A'/B' has exponent m. Similarly, A/B has exponent m. 

Theorem 9.2. Let A x A' -*■ C be a bilinear map oftwo abelian groups into 
a cyclic group C oforder m. Let B, B' be its respective kernels on the left and 
right. Assume thatA'/B' isfinite. ThenA/B isfìnite, andA'/B' is isomorphic 
to the dual group ofA/B (under our map ip). 

Proof. The injection of A/B into Hom (A'/B', C) shows that A/B is finite. 
Furthermore, we get the inequalities 

ord A/B S ord(A'/fl') A = ordA'/B' 

and 

ord A'/B' S ord(A/fl) A = ord A/B. 

From this it follows that our map ip is bijective, hence an isomorphism. 

Corollary 9.3. Let A be afinite abelian group, B a subgroup, A A the dual 
group, and B L the set of tp e A A such that <p(B) = 0. Then we have a natural 
isomorphism ofA A /B L with B A . 

Proof. This is a special case of Theorem 9.2. 


§10. INVERSE LIMIT AND COMPLETION 

Consider a sequence of groups {G„} (n = 0, 1, 2,...), and suppose given 
for all n § 1 homomorphisms 

fn'- G n-\- 

Suppose first that these homomorphisms are surjective. We form infinite 
sequences 

x = (* 0 , x x , x 2 ,...) such that = f„(x„). 
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By the assumption of surjectivity, given x n e G n we can always lift x n to G n+Ì 
via f n+ so such infinite sequences exist, projecting to any given x 0 . We can 
define multiplication of such sequences componentwise, and it is then imme- 
diately verified that the set of sequences is a group, called the inverse limit 
of the family {(G n ,f n )}. We denote the inverse limit by lim (G n ,f n ), or simply 
lim G n if the reference to f n is clear. 

Example. Let A be an additive abelian group. Let p be a prime number. 
Let p A : A ^ A denote multiplication by p. We say that A is /?-divisible if p A is 
surjective. We may then form the inverse limit by taking A n = A for all n, and 
fn ~ Pa f° r n ' The inverse limit is denoted by V p (A). We let T p (A) be the 
subset of V p (A) consisting of those infinite sequences as above such that 
x 0 = 0. Let A[p n ] be the kernel of p A . Then 

T p (A) = lim A[p n+l ). 

The group T p (A) is called the Tate group associated with the p-divisible group 
A. It arose in fairly sophísticated contexts of algebraic geometry due to Deuring 
and Weil, in the theory of elliptic curves and abelian varieties developed in the 
1940s, which are far afield from this book. Interested readers can consult books 
on those subjects. 

The most commonp-divisible groups are obtained as follows. First, let A be 
the subgroup of Q/Z consisting of those rational numbers (mod Z) which can 
be expressed in the form a/p k with some positive integer k, and a e Z. Then A 
is p-divisible. 

Second, let p. [p n ] be the group ofp w -th roots of unity in thecomplex numbers. 
Let \i[p°°] be the union of all \i[p n ] for all n. Then plp 00 ] is p-divisible, and 
isomorphic to the group A of the preceding paragraph. Thus 

T p (\i) = lim \i[p n ). 

These groups are quite important in number theory and algebraic geometry. We 
shall make further comments about them in Chapter III, §10, in a broader context. 

Example. Suppose given a group G. Let {H n } be a sequence of normal 
subgroups such that H n D H n+l for all n. Let 

fn'- G/H n -* G/H n - { 

be the canonical homomorphisms. Then we may form the inverse limitlim G/H n . 
Observe that G has a natural homomorphism 

q: G -» lim G/H n , 

which sends an element x to the sequence (..., x n , ...), where x n = image of 
x in G/H n . 

Example. Let G n = Z /p n+ì Z for each n ^ 0. Let 
/„: Z/p n+l Z-+ Z/p n Z 

be the canonical homomorphism. Then f n is surjective, and the limit is called 
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the group of p-adic integers, denoted by Z p . We return to this in Chapter III, 
§10, where we shall see that Z p is also a ring. 

After these examples, we want to consider the more general situation when 
one deals not with a sequence but with a more general type of family of groups, 
which may not be commutative. We therefore defìne inverse limits of groups in 
general. 

Let / be a set of indices. Suppose given a relation of partial ordering in /, 
namely for some pairs (ij) we have a relation i ^ j satisfying the conditions: 
For all i,y, k in /, we have i ^ /; if i g j and j ^ k then i ^ /c; if i ^ j and j ^ i 
then i = j. We say that / is directed if given i , j e /, there exists k such that 
i ^ k and j ^ k. Assume that / is directed. By an (inversely) directed family 
of groups, we mean a family {G,}^/ and for each pair i = j a homomorphism 

/i : Gj —» Gt 

such that, whenever k ^ ì ^ j we have 

/l°/i=/i and /í = id. 

Let G = riG, be the product of the family. Let T be the subset of G consisting 
of all elements (jt,) with jc, e G, such that for all i and j ^ i we have 

/{(*>) = 

Then T contains the unit element, and is immediately verified to be a subgroup 
of G. We call T the inverse limit of the family, and write 

r = lim G r 

Exampie. Let G be a group. Let be the family of normal subgroups of 
finite index. If H , K are normal of finite index, then so is H D K , so ÍF is a 
directed family. We may then form the inverse limit lim G/H with H e J. There 
is a variation on this theme. Instead of fF, let p be a prime number, and let 5 p 
be the family of normal subgroups of finite index equal to a power of p. Then 
the inverse limit with respect to subgroups H e $ p can also be taken. (Verify 
that if H, K are normal of finite p- power index, so is their intersection.) 

A group which is an inverse limit of finite groups is called profinite. 

Example from applications. Such inverse limits arise in Galois theory. 
Let k be a field and let A be an infinite Galois extension. For example, k = Q 
and A is an algebraic closure of Q. Let G be the Galois group; that is, the group 
of automorphisms of A over k. Then G is the inverse limit of the factor groups 
G/H , where H ranges over the Galois groups of A over K , with K ranging over 
all finite extensions of k contained in A. See the Shafarevich conjecture in the 
chapter on Galois theory, Conjecture 14.2 of Chapter VI. 

Similarly, consider a compact Riemann surface X of genus ^ 2. Let 
p : X' —> X be the universal covering space. Let C(X) = F and C(X f ) = F f be 
the function fields. Then there is an embedding n\(X) Gal (F f /F). It is 
shown in complex analysis that n\(X) is a free group with one commutator 
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relation. The full Galois group of F'/F is the inverse limit with respect to the 
subgroups of finite index, as in the above general situation. 


Completion of a group 

Suppose now that we are given a group G, and first, for simplicity, suppose 
given a sequence of normal subgroups {H r } with H r D H r+Ì for all n , and such 
that these subgroups have finite index. A sequence {jc n } in G will be called a 
Cauchy sequence if given H r there exists N such that for all m, n ^ N we have 
x n x~ l e H r . We say that {x n } is a null sequence if given r there exists N such 
that for all n ^ N we have x n e H r . As an exercise, prove that the Cauchy 
sequences form a group under termwise product, and that the null sequences 
form a normal subgroup. The factor group is called the completion of G (with 
respect to the sequence of normal subgroups). 

Observe that there is a natural homomorphism of G into its completion; 
namely, an element x e G maps to the sequence (x, x, jc, . . .) modulo null 
sequences. The kernel of this homomorphism is the intersection H// r , so if this 
intersection is the unit element of G, then the map of G into its completion is 
an embedding. 

Theorem 10.1. The completion and the inverse limit lim G/H r are isomorphic 

under natural mappings. 

Proof. We give the maps. Let jc = {jc„} be a Cauchy sequence. Given r, 
for all n sufficiently large, by the definition of Cauchy sequence, the class of x n 
mod H r is independent of n. Let this class be jc(r). Then the sequence 
(jc(1), jc(2), . . .) defines an element of the inverse limit. Conversely, given an 
element (x Ì9 x 2 ,. . .) in the inverse limit, with x n e G/H n , let jc„ be a representa- 
tive in G. Then the sequence {jc„} is Cauchy. We leave to the reader to verify 
that the Cauchy sequence {jc„} is well-defined modulo null sequences, and that 
the maps we have defined are inverse isomorphisms between the completion and 
the direct limit. 

We used sequences and denumerability to make the analogy with the con- 
structionof the real numbers clearer. In general, given the family ÍF, one considers 
families {x H } H ^<$ of elements x H e G. Then the condition for a Cauchy family 
reads: given H 0 e & there exists H x e 5 such that if K , K f are contained in H x , 
then x K x K l e H 0 . In practice, one can work with sequences, because groups that 
arise naturally are such that the set of subgroups of finite index is denumerable. 
This occurs when the group G is countably generated. 

More generally, a family {H f } of normal subgroups of finite index is called 
cofìnal if given H e 5 there exists i such that H t C H. Suppose that there exists 
such a family which is denumerable; that is, i = 1,2,... ranges over the positive 
integers. Then it is an exercise to show that there is an isomorphism 

lim G/H t « lim G/H, 

Ì HeS 
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or equivalently, that the completion of G with respect to the sequence {//•} is 
“the same” as the completion with respect to the full family $. We leave this 
verification to the reader. 

The process of completion is frequent in mathematics. For instance, we shall 
mention completions of rings in Chapter III, §10; and in Chapter XII we shall 
deal with completions of fields. 


§11. CATEGORIES AND FUNCTORS 

Before proceeding further, it will now be convenient to introduce some new 
terminology. We have met already several kinds of objects: sets, monoids, 
groups. We shall meet many more, and for each such kind of objects we define 
special kinds of maps between them (e.g. homomorphisms). Some formal 
behavior will be common to all of these, namely the existence of identity maps 
of an object onto itself, and the associativity of maps when such maps occur in 
succession. We introduce the notion of category to give a general setting for all 
of these. 

A category fí consists of a collection of objects Ob(tì); and for two objects 
A, B e Ob(â) a set Mor/4, B ) called the set of morphisms of A into B ; and for 
three objects A, B,C e Ob(û) a law of composition (i.e. a map) 

Mor(£, C) x Mor(A, B) - Mor(A, C) 
satisfying the following axioms: 

CAT 1. Two sets Mor (A,B) and Mor(A',B') are disjoint unless A = A' 
and B = B', in which case they are equal. 

CAT 2. For each object A of Ofc there is a morphism id x g Mor(A, A) 
which acts as left and right identity for the elements of Mor(A, B) and 
Mor(B, A) respectively, for all objects B e Ob(G). 

CAT 3. The law of composition is associative (when defined), i.e. given 
/g Mor(A, B), g e Mor (B, C) and h e Mor(C, D) then 


(hog)of= ho(gof\ 


for all objects A, B, C, D of (3. 

Here we write the composition of an element g in Mor(£, C) and an element 
/in Mor(/l, B) as g © f to suggest composition of mappings. In practice, in this 
book we shall see that most of our morphisms are actually mappings, or closely 
related to mappings. 

The collection of all morphisms in a category Q will be denoted by Ar(Gfc) 
(“arrows of Cfc”). We shall sometimes use the symbols “/g Ar(Cfc)” to mean 
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that / is a morphism of G, i.e. an element of some set Mor(/l, B) for some 
A, BeOb(d). 

By abuse of language, we sometimes refer to the collection of objects as the 
category itself, if it is clear what the morphisms are meant to be. 

An element/e Mor(A, B) is also written f:A^B or 

A morphism / is called an isomorphism if there exists a morphism g: B -* A 
such that g o/and/© g are the identities in Mor(A, A) and Mor(£, B) respec- 
tively. If A = B, then we also say that the isomorphism is an automorphism. 

A morphism of an object A into itself is called an endomorphism. The set of 
endomorphisms of A is denoted by End(/4). It follows at once from our axioms 
that End/4) is a monoid. 

Let A be an object of a category û. We denote by Aut(/4) the set of auto- 
morphisms of A. This set is in fact a group, because all of our definitions are 
so adjusted so as to see immediately that the group axioms are satisfied (associa- 
tivity, unit element, and existence of inverse). Thus we now begin to see some 
feedbaclc between abstract categories and more concrete ones. 

Examples. Let S be the category whose objects are sets, and whose 
morphisms are maps between sets. We say simply that S is the category of sets. 
The three axioms CAT 1, 2, 3 are trivially satisfied. 

Let Grp be the category of groups, i.e. the category whose objects are groups 
and whose morphisms are group-homomorphisms. Here again the three axioms 
are trivially satisfied. Similarly, we have a category of monoids, denoted by 
Mon. 

Later, when we define rings and modules, it will be clear that rings form a 
category, and so do modules over a ring. 

It is important to emphasize here that there are categories for which the set 
of morphisms is not an abelian group. Some of the most important examples 
are: 

The category e°, whose objects are open sets in R n and whose morphisms 
are continuous maps. 

The category C 00 with the same objects, but whose morphisms are the C 00 
maps. 

The category Hol, whose objects are open sets in C n , and whose morphisms 
are holomorphic maps. In each case the axioms of a category are verified, because 
for instance for Hol, the composite of holomorphic maps is holomorphic, and 
similarly for the other types of maps. Thus a C°-isomorphism is a continuous 
map/: U V which has a continuous inverse g: V —> U. Note that a map may 
be a C°-isomorphism but not a C°°-isomorphism. For instance, jc i —^ jc 3 is a C°- 
automorphism of R, but its inverse is not differentiable. 

In mathematics one studies manifolds in any one of the above categories. 
The determination of the group of automorphisms in each category is one of the 
basic problems of the area of mathematics concerned with that category. In 
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complex analysis, one determines early the group of holomorphic automorphisms 
of the unit disc as the group of all maps 


with 6 real and c e C, |c| < 1. 

Next we consider the notion of operation in categories. First, observe that 
if G is a group, then the G-sets form a category, whose morphisms are the maps 
/: S —» S' such that f(xs ) = xf(s) for x e G and s e S. 

More generally, we can now define the notion of an operation of a group G 
on an object in any category. Indeed, let Ci be a category and ,4eOb(G). 
By an operation of G on A we shall mean a homomorphism of G into the group 
Aut(>l). In practice, an object A is a set with elements, and an automorphism 
in Aut(A) operates on A as a set, i.e. induces a permutation of A. Thus, if we 
have a homomorphism 

p : G —» Aut(A), 

then for each x e G we have an automorphism p(x) of A which is a permutation 
of A. 

An operation of a group G on an object A is also called a representation of 
G on A, and one then says that G is represented as a group of automorphisms 
of A. 

Examples. One meets representations in many contexts. In this book, we 
shall encounter representations of a group on finite-dimensional vector spaces, 
with the theory pushed to some depth in Chapter XVIII. We shall also deal with 
representations of a group on modules over a ring. In topology and differential 
geometry, one represents groups as acting on various topological spaces, for 
instance spheres. Thus if X is a differential manifold, or a topological manifold, 
and G is a group, one considers all possible homomorphims of G into Aut(X), 
where Aut refers to whatever category is being dealt with. Thus G may be 
represented in the group of C°-automorphims, or C°°-automorphisms, or analytic 
automorphisms. Such topological theories are not independent of the algebraic 
theories, because by functoriality, an action of G on the manifold induces an 
action on various algebraic functors (homology, A'-functor, whatever), so that 
topological or differential problems are to some extent analyzable by the functorial 
action on the associated groups, vector spaces, or modules. 

Let A, B be objects of a category Ct. Let Iso(A, B) be the set of isomorphisms 
of A with B. Then the group Aut(B) operates on Iso(A, B) by composition; 
namely, if u e Iso(A, B) and v e Aut(£), then (v,u)\-^v°u gives the operation. 
If Uq is one element of Iso(A, B ), then the orbit of u 0 is all of Iso(A, B ), so 
v \-> v ° u 0 is a bijection Aut(Z?) —» Iso(A, B). The inverse mapping is given by 
u i-» Uq u 0 l . This trivial formalism is very basic, and is applied constantly to 
each one of the classical categories mentioned above. Of course, we also have 
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a similar bijection on the other side, but the group Aut(A) operates on the right 
of Iso(A, B) by composition. Furthermore, if u: A —» B is an isomorphism, then 
Aut(A) and Aut(Z?) are isomorphic under conjugation, namely 

vv uwu _1 is an isomorphism Aut(A) Aut(Z?). 

Two such isomorphisms differ by an inner automorphism. One may visualize 
this system via the following commutative diagram. 


Let p : G —> Aut(A) and p': G —> Aut(A') be representations of a group G 
on two objects A and A' in the same category. A morphism of p into p ' is a 
morphism h: A A' such that the following diagram is commutative for all 
x e G: 


It is then clear that representations of a group G in the objects of a category fí 
themselves form a category. An isomorphism of representations is then an 
isomorphism h : A —► A' making the above diagram commutative. An isomor- 
phism of representations is often called an equivalence, but I don’t like to tamper 
with the general system of categorical terminology. Note that if h is an isomor- 
phism of representations, then instead of the above cômmutative diagram, we 
let [h\ be conjugation by h , and we may use the equivalent diagram 


r Aut(A) 


Aut(A') 

Consider next the case where Ct is the category of abelian groups, which we 
may denote by Ab. Let A be an abelian group and G a group. Given an operation 
of G on the abelian group A, i.e. a homomorphism 

p : G —> Aut(A), 

let us denote by x • a the element p x (a). Then we see that for all x, y e G, a , 
6 e A, we have: 



CATEGORIES AND FUNCTORS 57 


I, §11 

x-(y • à) = (xy) • a, x • (a + b) = x • a + x • b, 
e • a = a, x • 0 = 0. 

We observe that when a group G operates on itself by conjugation, then not 
only does G operate on itself as a set but also operates on itself as an object in the 
category of groups, i.e. the permutations induced by the operation are actually 
group-automorphisms. 

Similarly, we shall introduce later other categories (rings, modules, fields) 
and we have given a general definition of what it means for a group to operate 
on an object in any one of these categories. 

Let Ci be a category. We may take as objects of a new category G the 
morphisms of û. If and /': A' -» B' are two morphisms in G (and 

thus objects of G\ then we define a morphism/-►/' (in G) to be a pair of 
morphisms (</), ip) in Cfc making the following diagram commutative: 



In that way, it is clear that G is a category. Strictly speaking, as with maps of 
sets, we should index (</?, ij/) by / and f' (otherwise CAT 1 is not necessarily 
satisfied), but such indexing is omitted in practice. 

There are many variations on this example. For instance, we could restrict 
our attention to morphisms in Cfc which have a fixed object of departure, or those 
which have a fixed object of arrival. 

Thus let A be an object of Cfc, and let Cfc^ be the category whose objects are 
morphisms 

f:X^A 

in Cfc, having A as object of arrival. A morphism in Cfc^ from /: X -* A to 
g : Y -► A is simply a morphism 

h: X - Y 

in Cfc such that the diagram is commutative: 


X—'■Y 



Universal objects 

Let G be a category. An object P of G is called universally attracting if there 
exists a unique morphism of each object of G into P 9 and is called universally 
repelling if for every object of G there exists a unique morphism of P into this 
object. 
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When the context makes our meaning clear, we shall call objects P as above 
universal. Since a universal object P admits the identity morphism into itself, 
it is clear that if P , P' are two universal objects in C, then there exists a unique 
isomorphism between them. 

Examples. Note that the trivial group consisting only of one element is 
universal (repelling and attracting) in the category of groups. Similarly, in 
Chapter II on rings, you will see that the integers Z are universal in the category 
of rings (universally repelling). 

Next let S be a set. Let G be the category whose objects are maps f:S-+A 
of S into abelian groups, and whose morphisms are the obvious ones: If 
/: S A and /': S -> A' are two maps into abelian groups, then a morphism 
of / into /' is a (group) homomorphism g: A A' such that the usual dia- 
gram is commutative, namely g °f = /'. Then the free abelian group generated 
by S is universal in this category. This is a reformulation of the properties we 
have proved about this group. 

Let M be a commutative monoid and let y: M K(M) be the canonical 
homomorphism of M into its Grothendieck group. Then y is universal in the 
category of homomorphisms of M into abelian groups. 

Throughout this book in numerous situtaions, we define universal objects. 
Aside from products and coproducts which come immediately after these exam- 
ples, we have direct and inverse limits; the tensor product in Chapter XVI, §1; 
the alternating product in Chapter XIX, §1; Clifford algebras in Chapter XIX, 
§4; ad lib. 

We now turn to the notion of product in an arbitrary category. 

Products and coproducts 

Let û be a category and let A, B be objects of G. By a product of A, B in fí 
one means a triple (P 9 f g) consisting of an object P in Gfc and two morphisms 

P 



A B 


satisfying the following condition: Given two morphisms 
cp : C -► A and ý :C -+ B 

in G, there exists a unique morphism h : C -+ P which makes the following 
diagram commutative: 

A 



In other words, (p = /o h and ij/ = g o h. 
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More generally, given a family of objects {Ai} ieI in G, a product for this 
family consists of (P, {f} ieI ), where P is an object in G and {f} ieI is a 
family of morphisms 

f:P-+A i , 

satisfying the following condition: Given a family of morphisms 

Qi-C^ A i9 

there exists a unique morphism h: C -> P such that f ° h = g t for all i. 

Example. Let G be the category of sets, and let {Aj /e/ be a family of sets. 
Let A = rU be their cartesian product, and let p z : A A t be the projection 

/e/ 

on the /-th factor. Then ( A , {/? z }) clearly satisfies the requirements of a product 
in the category of sets. 

As a matter of notation, we shall usually write A x B for the product of two 
objects in a category, and Y[ A t for the product of an arbitrary family in a 

iel 

category, following the same notation as in the category of sets. 

Example. Let {G z } ze/ be a family of groups, and let G = Yl G z be their direct 
product. Let p t : G —» G z be the projection homomorphism. Then these constitute 
a product of the family in the category of groups. 

Indeed, if {g t :G' -* G z } íe/ is a family of homomorphisms, there is a unique 
homomorphismg : G' -* P] G z which makes the required diagram commutative. 
It is the homomorphism such that g{x)i = gfx') for x' e G' and each i e I. 

Let A, B be objects of a category û. We note that the product of A, B is 
universal in the category whose objects consist of pairs of morphisms 
/: C —» A and g : C —> B in û, and whose morphisms are described as follows. 
Let f’ : C' A and g’: C' —> B be another pair. Then a morphism from the 
first pair to the second is a morphism h : C -» C' in G, making the following 
diagram commutative: 

C 



The situation is similar for the product of a family {A z } ze/ . 

We shall also meet the dual notion: Let {/l z } ie/ be a family of objects in a 
category G. By their coproduct one means a pair (S, {/} ze/ ) consisting of an 
object S and a family of morphisms 

{fC.At-S}, 

satisfying the following property. Given a family of morphisms {g ( : A x -> C}, 
there exists a unique morphism h :S -► C such that h °f = gf z - for all i. 
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In the product and coproduct, the morphism h will be said to be the 
morphism induced by the family {g t }. 

Examples. Let S be the category of sets. Then coproducts exist. For 
instance, let 5, S' be sets. Let T be a set having the same cardinality as S' and 
disjoint from S. Let f x : S -► S be the identity, and f 2 : S' -► T be a bijection. 
Let U be the union of S and T. Then (U 9 f u f 2 ) is a coproduct for 5, S', viewing 
f l9 f 2 as maps into U. 

Let S 0 be the category of pointed sets. Its objects consist of pairs (S, x) 
where S is a set and x is an element of S. A morphism of (5, x) into (S', x') in this 
category is a map g : S -* S' such that g(x) = x'. Then the coproduct of (S, x) 
and (S', x') exists in this category , and can be constructed as follows. Let T be 
a set whose cardinality is the same as that of S', and such that T fl S = {x}. 
Let U = S U T, and let 

/ì: (S, *)-(!/, x) 

be the map which induces the identity on S. Let 

/ 2 :(S', x')^(L, x) 

be a map sending x' to x and inducing a bijection of S' — {x'} on T - {x}. 
Then the triple (( U , x) 9 f lt f 2 ) is a coproduct for (S, x) and (S', x') in the category 
of pointed sets. 

Similar constructions can be made for the coproduct of arbitrary families 
of sets or pointed sets. The category of pointed sets is especially important in 
homotopy theory. 

Coproducts are universal objects. Indeed, let û be a category, and let {Aj 
be a family of objects in d. We now defineC. We let objects of Cbe the families 
of morphisms {/•: A, —> B} ieí and given two such families, 

{f : -> B} and {f-.A^B'}, 

we define a morphism from the first into the second to be a morphism (p : B -+ B' 
in G such that (p° f = f'i for all i. Then a coproduct of {AJ is simply a universal 
object in G. 

The coproduct of {AJ will be denoted by 

u^. 

ÍG/ 

The coproduct of two objects A, B will also be denoted by A II B. 

By the general uniqueness statement, we see that it is uniquely determined, up 
to a unique isomorphism. 

Example. Let R be the category of commutative rings. Given two such 
rings A, B one may form the tensor product, and there are natural ring-homo- 
morphisms A-+ A ® B and B —» A ® B such that 

1 and b i-> 1 ® b for a e A and b e B. 

Then the tensor product is a coproduct in the category of commutative rings. 
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Fiber products and coproducts 
Pull-backs and push-outs 

Let e bea category. Let Z be an object of C. Then we have a new category, 
that of objects over Z, denoted by C z . The objects of G z are morphisms: 

f:X-+Z in C 

A morphism from / to g : Y -► Z in C z is merely a morphism h : A" -► Y in G 
which makes the following diagram commutative. 


X— h —+Y 



Z 


A product in G z is called the fiber product of / and g in G and is denoted 
by X X Z L, together with its natural morphisms on X , T over Z, which are 
sometimes not denoted by anything, but which we denote by p ì9 p 2 . 



Fibered products and coproducts exist in the category of abelian groups 

The fibered product of two homomorphisms /: X —> Z and g : Y —> Z is the 
subgroup ofXxf consisting of all pairs (jc, y ) such that 

/(*) = g(y)- 

The coproduct of two homomorphisms /: Z -► X and g : Z -► T is the 
factor group (A" © where W is the subgroup of X © Y consisting of all 
elements (/(z), —g(z)) with z e Z. 

We leave the simple verifìcation to the reader (see Exercises 50-56). 

In the fiber product diagram, one also calls p x the pull-back of g by /, and 
p 2 the pull-back of /by g. The fiber product satisfies the following universal 
mapping property: 

Given any object T in G and morphisms mahing the following diagram 
commutative: 


X 


z 
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there exists a unique morphism T —» X x z Y making the follomng diagram 
commutative: 


X x z Y 



Dually, we have the notion of coproduct in the category of morphisms f\Z-*X 
with a fixed object Z as the object of departure of the morphisms. This category 
could be denoted by G z . We reverse the arrows in the preceding discussion. 
Given two objects / and g\Z Y in this category, we have the notion of their 
coproduct. It is denoted by X U z Y , with morphisms q ì9 q 2 , as in the following 
commutative diagram: 



satisfying the dual universal property of the fiber product. We call it the fìbered 
coproduct. We call q x the push-out of g by /, and q 2 the push-out of/by g. 

Example. Let S be the category of sets. Given two maps /, g as above, 
their product is the set of all pairs (x, y) e X x Y such that f(x) = g(y). 

Functors 

Let G, (B be categories. A covariant functor F of (i into (B is a rule which 
to each object A in G associates an object F(A) in (B, and to each morphism 
f.A^B associates a morphism F(f) \ F(A) -► F(B) such that: 

FUN 1. For all A in G we have F(id A ) = id F{A) . 

FUN 2. If/: A -* B and g \ B -+ C are two morphisms of Ct then 

F(9 °f) ~ F(g) o F(f). 

Example. If to each group G we associate its set (stripped of the group 
structure) we obtain a functor from the category of groups into the category of 
sets, provided that we associate with each group-homomorphism itself, viewed 
only as a set-theoretic map. Such a functor is called a stripping functor or 

forgetful functor. 

We observe that a functor transforms isomorphisms into isomorphisms, 
because/° g = id implies F(f) ° F(g) = id also. 

We can define the notion of a contravariant functor from G into (B by using 
essentially the same definition, but reversing all arrows F(/), i.e. to each morph- 
ism /: A -► B the contravariant functor associates a morphism 
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F(f ): F(B) -+ F(A) 

(going in the opposite direction), such that, if 

f:A-+B and g:B^C 
are morphisms in (2, then 


F(g of) = F(f)oF(g). 

Sometimes a functor is denoted by writing / # instead of F(f) in the case 
of a covariant functor, and by writing /* in the case of a contravariant 
functor. 

Example. The association S i-> F ab (S) is a covariant functor from the 
category of sets to the category of abelian groups. 

Example. The association which to each group associates its completion 
with respect to the family of subgroups of finite index is a functor from the 
category of groups to the category of groups. 

Example. Let p be a prime number. Let G be the category of p-divisible 
abelian groups. The association A i-> T p (A) is a covariant functor of G into 
abelian groups (actually Z^-modules). 

Example. Exercise 49 will show you an example of the group of auto- 
morphisms of a forgetful functor. 

Example. Let Man be the category of compact manifolds. Then the homol- 
ogy is a covariant functor from Man into graded abelian groups. The cohomology 
is a contravariant functor into the category of graded algebras (over the ring of 
coefficients). The product is the cup product. If the cohomology is taken with 
coefficients in a field of characteristic 0 (for simplicity), then the cohomology 
commutes with products. Since cohomology is contravariant, this means that the 
cohomology of a product is the coproduct of the cohomology of the factors. It 
turns out that the coproduct is the tensor product, with the graded product, which 
also gives an example of the use of tensor products. See M. Greenberg and 
J. Harper, Algebraic Topology (Benjamin-Addison-Wesley), 1981, Chapter 29. 

Example. Let C be the category of pointed topological spaces (satisfying 
some mild conditions), i.e. pairs (X, jc 0 ) consisting of a space X and a point jc 0 . 
In topology one defines the connected sum of such spaces (X, jc 0 ) and (Y, y 0 ), 
glueing X , Y together at the selected point. This connected sum is a coproduct 
in the category of such pairs, where the morphisms are the continuous maps 
/: X —> Y such that f(x 0 ) = y 0 . Let tt x denote the fundamental group. Then 
(X , jc 0 ) i-> tt x (X, jc 0 ) is a covariant functor from G into the category of groups, 
commuting with coproducts. (The existence of coproducts in the category of 
groups will be proved in §12.) 
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Example. Suppose we have a morphism /: X —» Y in a category (3. By a 
section of /, one means a morphism g: Y X such that g ° f = id. Suppose 
there exists a covariant functor // from this category to groups such that 
H(Y) = {e} and H(X) {e}. Then there is no section of /. This is immediate 
from the formula H(g °f) = id, and H(f) = trivial homomorphism. In topology 
one uses the homology functor to show, for instance, that the unit circle X is 
not a retract of the closed unit disc with respect to the inclusion mapping /. 
(Topologists use the word “retract” instead of “section”.) 

Example. Let û be a category and A a fixed object in û. Then we obtain a 
covariant functor 

M a : a - S 

by letting M A (X) = Mor(A, X) for any object X of û. If (p: X -► X' is a mor- 
phism, we let 

M A ((p) : Mor (A, X) - Mor(/l, X') 
be the map given by the rule 

g^cpog 

for any g e Mor/4, X), 

A^X^X'. 

The axioms FUN 1 and FUN 2 are trivially verified. 

Similarly, for each object B of G, we have a contravariant functor 

M B .a -S 

such that M b (Y) = Mor(T, B). If i// : Y' — Y is a morphism, then 
M B (i//): Mor( Y, B) - Mor( Y', B) 
is the map given by the rule 

for any/e Mor(Y, B ), 

r y 4 b. 

The preceding two functors are called the representation functors. 

Example. Let û be the category of abelian groups. Fix an abelian group 
A. The association X i-> Hom(A, X) is a covariant functor from û into itself. 
The association X Hom(X, A) is a contravariant functor of Ct into itself. 

Example. We assume you know about the tensor product. Let A be a 
commutative ring. Let M be an A-module. The association X i-» M ® X is a 
covariant functor from the category of A-modules into itself. 

Observe that products and coproducts were defined in a way compatible with 
the representation functor into the category of sets. Indeed, given a product P 
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of two objects A and B , then for every object X the set Mor(X, P) is a product 
of the sets Mor(X, A) and Mor(X, B) in the category of sets. This is merely a 
reformulation of the defining property of products in arbitrary categories. The 
system really works. 

Let û, (B be two categories. The functors of C i into (B (say covariant, and 
in one variable) can be viewed as the objects of a category, whose morphisms 
are defined as follows. Let L, M be two such functors. A morphism H: L -> M 
(also called a natural transformation) is a rule which to each object X of Q 
associates a morphism 

H x : L(X) -+ M(X) 

such that for any morphism f: X -► Y the following diagram is commutative: 


L(X) ——> M(X) 


L(f) 


M(f) 


L(Y) - >M(Y) 

h y 


We can therefore speak of isomorphisms of functors. A functor is representable 
if it is isomorphic to a representation functor as above. 

As Grothendieck pointed out, one can use the representation functor to 
transport the notions of certain structures on sets to arbitrary categories. For 
instance, let (ì be a category and G an object of G. We say that G is a group 
object in Ct if for each object X of G we are given a group structure on the set 
Mor(A", G) in such a way that the association 

* h- Mor(X, G) 

is functorial (i.e. is a functor from G into the category of groups). One some- 
times denotes the set Mor(X, G) by G(X), and thinks of it as the set of points of 
G in X. To justify this terminology, the reader is referred to Chapter IX, §2. 

Example. Let Var be the category of projective non-singular varieties over 
the complex numbers. To each objectX in Var one can associate various groups, 
e.g. Pic(X) (the group of divisor classes for rational equivalence), which is a 
contravariant functor into the category of abelian groups. Let Pic 0 (X) be the 
subgroup of classes algebraically equivalent to 0. Then Pic 0 is representable. 

In the fifties and sixties Grothendieck was the one who emphasized the 
importance of the representation functors, and the possibility of transposing to 
any category notions from more standard categories by means of the representation 
functors. He himself proved that a number of important functors in algebraic 
geometry are representable. 
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§12. FREE GROUPS 

We now turn to the coproduct in the category of groups. First a remark. Let 
G = llG/ be a direct product of groups. 

We observe that each Gj admits an injective homomorphism into the 
product, on the y-th component, namely the map XjiGj -► f] G f such that 

i 

for x in Gj , the i-th component of Áj(x) is the unit element of G ( if i # j, and 
is equal to x itself if i = j. This embedding will be called the canonical one. 
But we still don’t have a coproduct of the family, because the factors commute 
with each other. To get a coproduct one has to work somewhat harder. 

Let G be a group and S a subset of G. We recall that G is generated by S 
if every element of G can be written as a finite product of elements of S and their 
inverses (the empty product being always taken as the unit element of G). 
Elements of S are then called generators. If there exists a finite set of generators 
for G we call G finitely generated. If S is a set and (p:S -* G is a map, we say 
that cp generates G if its image generates G. 

Let S be a set, and /: S -> F a map into a group. Let g : S -+ G be another 
map. If f(S) (or as we also say,/) generates F, then it is obvious that there exists 
at most one homomorphism ij/ of F into G which makes the following diagram 
commutative: 



G 


We now consider the category G whose objects are the maps of S into 
groups. If/: S G and f' :S -> G' are two objects in this category, we define 
a morphism from / to/' to be a homomorphism (p:G G' such that (p°f = f\ 
i.e. the diagram is commutative: 


G 



G' 


By a free group determined by S, we shall mean a universal element in this 
category. 

Proposition 12.1. Let S be a set. Then there exists a free group (F, /) 
determined by S. Furthermore, f is injective , and F is generated by the image 
off. 

Proof. (I owe this proof to J. Tits.) We begin with a lemma. 
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Lemma 12.2. There exists a set I and a family of groups {G,}^/ such that, 
if g'- S G is a map of S into a group G, and g generates G, then G is 
isomorphic to some G z . 

Proof This is a simple exercise in cardinalities, which we carry out. If S 
is finite, then G is finite or denumerable. If S is infinite, then the cardinality of G 
is ^ the cardinality of S because G consists of finite products of elements of g(S). 
Let T be a set which is infinite denumerable if S is finite, and has the same cardin- 
ality as 5 if 5 is infinite. For each non-empty subset H of T, let T H be the set of 
group structures on H . For each yeT H , let H y be the set H , together with the 
group structure y. Then the family {H y } for y e T H and H ranging over subsets 
of T is the desired family. 

We return to the proof of the proposition. For each iel we let M, be the 
set of mappings of S into G f . For each map cpeM t , we let G i (p be the set- 
theoretic product of G z and the set with one element {<p}, so that G ì (p is the 
“same” group as G t indexed by cp. We let 

f.-nn 

iel (peMi 

be the Cartesian product of the groups G i><p . We define a map 

fo-S-+F 0 

by sending S on the factor G i (p by means of cp itself. We contend that given a 
map g : S —> G of S into a group G, there exists a homomorphism i/f* : F 0 G 
making the usual diagram commutative: 





G 


That is, ^ of Q = g . To prove this, we may assume that g generates G, simply 
by restricting our attention to the subgroup of G generated by the image of g. 
By the lemma, there exists an isomorphism À: G -> G, for some i, and À ° g 
is an element ij/ of M f . We let n itìj , be the projection on the (i, i//) factor, and we 
let ij/* = À~ l ° 7 Then the map i/^ makes the following diagram com- 
mutative. 



We let F be the subgroup of F 0 generated by the image of/ 0 , and we let / 
simply be equal to / 0 , viewed as a map of S into F. We let g* be the restriction 
of ij/* to F. In this way, we see at once that the map g * is the unique one making 
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our diagram commutative, and thus that ( F,f) is the required free group. 
Furthermore, it is clear that /is injective. 

For each set S we select one free group determined by 5, and denote it 
by ( F(S),f s ) or briefly by F(S). It is generated by the image of f s . One may 
view S as contained in F(5), and the elements of 5 are called free generators 
of F(S). If g : S -► G is a map, we denote by g *: F(S) -> G the homomorphism 
realizing the universality of our free group F(S). 

If X : 5 -► 5' is a map of one set into another, we let F(X ): F(S) -► F(S') be 
the mapí/s.oA)*. 


S > F(S) 



Then we may regard Fasa functor from the category of sets to the category of 
groups (the functorial properties are trivially verified, and will be left to the 
reader). 

IfÀ is surjective , then F(À) is also surjective. 

We again leave the proof to the reader. 

If two sets S, S' have the same cardinality, then they are isomorphic in the 
category of sets (an isomorphism being in this case a bijection!), and hence 
F(S) is isomorphic to F(S'). If S has n elements, we call F(S) the free group 
on n generators. 

Let G be a group, and let S be the same set as G (i.e. G viewed as a set, without 
group structure). We have the identity map g: S -► G, and hence a surjective 
homomorphism 

g*:F(S)^G 

which will be called canonical. Thus every group is a factor group of a free 
group. 

One can also construct groups by what is called generators and relations. Let 
5 be a set, and F(S) the free group. We assume that /: 5 -► F(S) is an in- 
clusion. Let R be a set of elements of F(S). Each element of R can be written 
as a finite product 

n 

n-v 

V — 1 

where each x v is an element of S or an inverse of an element of 5. Let N be the 
smallest normal subgroup of F(S) containing R , i.e. the intersection of all normal 
subgroups of F(S) containing R. Then F(S)/N will be called the group deter- 
mined by the generators S and the relations R. 
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Example. One shows easily that the group determined by one generator 
a , and the relation { a 2 }, has order 2. 

The canonical homomorphism <p: F(S) —» F(S)/N satisfies the universal map- 
ping property for homomorphisms i/j of F(S) into groups G such that = e 
for all x e R. In view of this, one sometimes calls the group F(S)/N the group 
determined by the generators S, and the relations x = e (for all x e R). For 
instance, the group in the preceding example would be called the group determined 
by the generator a, and the relation a 2 = e. 

Let G be a group generated by a finite number of elements, and satisfying 
the relation x 2 = e for all x e G. What does G look like? It is easy to show that 
G is commutative. Then one can view G as a vector space over Z/2Z, so G is 
determined by its cardinality, up to isomorphism. 

In Exercises 34 and 35, you will prove that there exist certain groups satisfying 
certain relations and with a given order, so that the group presented with these 
generators and relations can be completely determined. A priori , it is not even 
clear if a group given by generators and relations is finite. Even if it is finite, 
one does not know its order a priori. To show that a group of certain order 
exists, one has to use various means, a common means being to represent the 
group as a group of automorphisms of some object, for instance the symmetries 
of a geometric object. This will be the method suggested for the groups in Exercises 
34 and 35, mentioned above. 

Example. Let G be a group. For x, y e G define [x, y] = xyx~ x y~ x (the 
commutator) and x y = xyx~ l (the conjugate). Then one has the cocycle relation 

[x, yz] = [x, yV[x, z]. 

Furthermore, suppose x, y, z e G and 

[x, y] = y, [y, z] = z, [z, x] = x. 

Then x = y = z = e. It is an exercise to prove these assertions, but one sees 
that certain relations imply that a group generated by x, y, z subject to those 
relations is necessarily trivial. 

Next we give a somewhat more sophisticated example. We assume that the 
reader knows the basic terminology of fields and matrices as in Chapter XIII, 
but applied only to 2 x 2 matrices. Thus SL 2 (F) denotes the group of 2 x 2 
matrices with components in a field F and determinant equal to 1. 


Example. SL 2 (F). Let F be a field. For b e F and a e. F, a ± 0, we let 


u(b) = 




and w = 
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Then it is immediately verified that: 

SL 0. s(a) = wu(a~ l )wu(a)wu(a~ l ). 

SL 1. u is an additive homomorphism. 

SL 2. s is a multiplicative homomorphism. 

SL 3. vv 2 = s(- 1). 

SL 4. s(a)u(b)s(a~ x ) = u(ba 2 ). 

Now, conversely, suppose that G is an arbitrary group with generators u(b) 
(b e F) and vv, such that if we define s(a) for a ± 0 by SL 0, then the relations 
SL 1 through SL 4 are satisfied. Then SL 3 and SL 4 show that s(— 1) is in the 
center, and vv 4 = e. In addition, one verifies that: 

SL 5. ws(a) = 5(^ _1 )w. 

Furthermore, one has the theorem: 

Let G be the free group with generators u(b), w and relations SL 1 through 
SL 4, defining s(a) as in SL 0. Then the natural homomorphism 

G SL 2 (F) 

is an isomorphism. 

Proofs of all the above statements will be found in my SL 2 (R), Springer Verlag, 
reprint of Addison-Wesley, 1975, Chapter XI, §2. It takes about a page to carry 
out the proof. 

If F = Q p is the field of p-adic numbers, then Ihara [Ih 66] proved that every 
discrete torsion free subgroup of SL 2 (Q p ) is free. Serre put this theorem in the 
context of a general theory concerning groups acting on trees [Se 80]. 

[Ih 66] Y. Ihara, On discrete subgroups of the two by two projective linear group over 
p-adic fields, J. Math. Soc. Japan 18 (1966) pp. 219-235 

[Se 80] J.-P. Serre, Trees , Springer Verlag 1980 

Further examples. For further examples of free group constructions, see 
Exercises 54 and 56. For examples of free groups occurring (possibly conjec- 
turally) in Galois theory, see Chapter VI, §2, Example 9, and the end of 
Chapter VI, §14. 

Proposition 12.3. Coproducts exist in the category of groups. 

Proof. Let {GJ ie/ be a family of groups. We let C be the category whose 
objects are families of group-homomorphisms 


{Qì'Gì -> G} ieI 
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and vvhose morphisms are the obvious ones. We must find a universal element 
in this category. For each index i, we let S t be the same set as G, if G f is infinite, 
and we let S t be denumerable if G, is finite. We let S be a set having the same 
cardinality as the set-theoretic disjoint union of the sets S ( (i.e. their coproduct 
in the category of sets). We let T be the set of group structures on 5, and for 
each y e T, we let be the set of all families of homomorphisms 

<P = {</>. : G, -+ S v }. 

Each pair (S y , cp), where (pe<S> y , is then a group, using q> merely as an index. 
We let 

Fo= n n (Sy’ <p )> 

yer (pe<!>y 

and for each i, we define a homomorphism f : G, -► F 0 by prescribing the 
component of f on each factor (S y , (p ) to be the same as that of (p { . 

Let now g = {g^: G f -> G} be a family of homomorphisms. Replacing G 
if necessary by the subgroup generated by the images of the g h we see that 
card(G) ^ card(S), because each element of G is a finite product of elements 
in these images. Embedding G as a factor in a product G x S y for some y, we 
may assume that card(G) = card(S). There exists a homomorphism <?*: F 0 -► G 
such that 


0* ° f = 9i 

for all i. Indeed, we may assume without loss of generality that G = S y for some 
y and that g = ij/ for some ij/ e O y . We let g # be the projection of F 0 on the 
factor (5 y , ỳ). 

Let F be the subgroup of F 0 generated by the union of the images of 
the maps f for all i. The restriction of g * to F is the unique homomorphism 
satisfying f ° g* = g t for all i, and we have thus constructed our universal 
object. 

Example. Let G 2 be a cyclic group of order 2 and let G 3 be a cyclic group 
of order 3. What is the coproduct? The answer is neat. It can be shown that 
G 2 U G 3 is the group generated by two elements 5, T with relations 5 2 = 1, 
{ST ) 3 = 1. The groups G 2 and G 3 are embedded in G 2 U G 3 by sending G 2 on 
the cyclic group generated by S and sending G 3 on the cyclic group generated 
by ST. This is done by representing the group as follows. Let 


G = SL 2 { Z)/±l. 
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As we have seen in an example of §5, the group G operates on the upper half- 
plane £?. Let S, T be the maps given by 

S(z) = - 1/z and T(z) = z + 1. 

Thus S and T are represented by the matrices 


S = 



and 



and satisfy the relations S 2 = 1, (ST) 3, = 1. Readers will find a proof of several 
properties of S, Tin Serre’s Course inArithmetic (Springer Verlag, 1973, Chapter 
VII, §1), including the fact that S, T generate G. It is an exercise from there to 
show that G is the coproduct of G 2 and G 3 as asserted. 

Observe that these procedures go directly from the universal definition and 
construction in the proofs of Proposition 12.1 and Proposition 12.3 to the more 
explicit representation of the free group or the coproduct as the case may be. 
One relies on the following proposition. 


Proposition 12.4. Let G be a group and {G,} /e/ a family of subgroups. 
Assume: 

(a) The family generates G. 

(b) // 

x = x ix • • • x in mth x ip E G ip , x iv =h e and i v =/= i v+ì for all v, 
then x =É e. 

Then the natural homomorphism ofthe coproduct of the family into G sending 
Gj on itself by the identity mapping is an isomorphism. In other words, simply 
put, G is the coproduct of the family of subgroups. 

Proof. The homomorphism from the coproduct into G is surjective by the 
assumption that the family generates G. Suppose an element is in the kernel. 
Then such an element has a representation 

X ix ' ' x. 

‘1 l n 

as in (b), mapping to the identity in G, so all x iv = e and the element itself is 
equal to e, whence the homomorphism from the coproduct into G is injective, 
thereby proving the proposition. 

Exercises 54 and 56 mentioned above give one illustration of the way Prop- 
osition 12.4 can be used. We now show another way, which we carry out for 
two subgroups. I am indebted to Eilenberg for the neat arrangement of the proof 
of the next proposition. 
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Proposition 12.5. Let A, B be two groups xvhose set-theoretic intersection is 
{1}. There exists a group A° B containing A y B as subgroups, such that 
A fl B = {1}, and having the following property. Every element 4 1 ofA ° B 
has a unique expression as a product 

a i ' " a n (n ^ 1, a i # 1 all í) 

with a i€ A or a i e B, and such that if a { e A then a i+l e B and if a t e B then 
a I+ i eA. 

Proof Let A° Bbe the set of sequences 

a = (ai,...,a„) (n ^ 0) 

such that either n = 0, and the sequence is empty or n ^ 1, and then elements 
in the sequence belong to A or B , are # 1, and two consecutive elements of the 
sequence do not belong both to A or both to B. If b = (b ìy , b m \ we define 
the product ab to be the sequence 

(a ìy ..., a n ,b i,..., b m ) 

if a n eA,b x eB or a n e B.b^e A, 

(a l ,...,a„b l ,...,b m ) 

ìf a n ,b x eA or a^^b^eB, and a n b x # 1, 

(a u ...,a n _ x )(b 2 , ...,b m ) by induction, 

if a n ,b { eA or a^^b^eB and a n b x = 1. 

The case when n = 0 or m = 0 is included in the first case, and the empty 
sequence is the unit element of A ° B. Clearly, 

(a l5 ..., a„)(a; \ ..., a \‘) = unit element, 

so only associativity need be proved. Let c = (c í9 ..., c r ). 

First consider the case m = 0, i.e. b is empty. Then clearly (ab)c = a(bc) 
and similarly if n = 0 or r = 0. Next consider the case m = 1. Let b = (x) 
with xeA, x # L We then verify in each possible case that (ab)c = a(bc). 
These cases are as follows: 


(fll,.. 

., a n , x,c u . 

-,C r ) 

if 

a n eB and c^eB, 

(«!>•• 

.,a n x,c u ... 

,c r ) 

if 

a n e A, a n x # 1, c x e B, 

(«1, • • 

., a „, xc i,... 

,c r ) 

if 

a n e B, Cj e A, xc { # 1, 

(«1,.. 

•>û.-lXCl,- 

..,c r ) 

if 

a n = x~ 1 and Cj eB, 
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<0(c 2 ,...,c r ) if a n eB and d=x _1 , 

(d, a n xc!, c 2 ,..., c r ) if a n , Cj e A, a n xc x # 1, 

(«!,..., a^jXc^,..., c r ) if a n , Ci e >1 and a n xcj = 1. 

If m > 1, then we proceed by induction. Write b = ò'ò" with b' and b" 
shorter. Then 

(ab)c = (a(b'b"))c = ((ab')b")c = (ab')(b"c\ 
a(bc) = a((b'b")c) = a(b'(b"c)) = (ab')(b"c) 
as was to be shown. 

We have obvious injections of A and B into AoB, and identifying A , B 
with their images in >1 o B we obtain a proof of our proposition. 

We can prove the similar result for several factors. In particular, we get the 
following corollary for the free group. 

Corollary 12.6. Let F(S) be the free group on a set S , and let x x ,..., x n be 
distinct elements of S. Let v x , ..., v r be integers 0 and let i [9 . . . , i r be 
integers , 

1 ^ i r ^ n 

such that ij ^ ij+ x for j = 1,..., r — 1. Then 

xïí •••x£# 1. 

Proof Let G Í9 ..., G n be the cyclic groups generated by x ì9 ..., x n . Let 
G = G ^ o • • • ° G n . Let 

F(S) - G 

be the homomorphism sending each x f on x h and all other elements of 5 on the 
unit element of G. Our assertion follows at once. 

Corollary 12.7. Let S be a set with n elements x h ..., x n , n ^ 1. Let G Ì9 
. •. 9 G n be the infinite cyclic groups generated by these elements. Then the map 

F(S)^G ì o...o Gn 

sending each x f on itself is an isomorphism. 

Proofi It is obviously surjective and injective. 

Corollary 12.8. Let G Ì9 ... , G n be groups with G t fl G } = {1} if i =h j. 
The homomorphism 

G 1 U-UG n ^G 1 o..oG n 

of their coproduct into G^ ° ° G„ induced by the natural inclusion 

Gi -+ Gi o... o G n is an isomorphism. 

Proof. Again, it is obviously injective and surjective. 
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EXERCISES 


1. Show that every group of order ^ 5 is abelian. 

2. Show that there are two non-isomorphic groups of order 4, namely the cyclic one, 
and the product of two cyclic groups of order 2. 

3. Let G be a group. A commutator in G is an element of the form aba~ l b~ l with a , 
b E G. Let G c be the subgroup generated by the commutators. Then G c is called the 
commutator subgroup. Show that G c is normal. Show that any homomorphism of 
G into an abelian group factors through G/G c . 

4. Let //, K be subgroups of a finite group G with K C N H . Show that 


#{HK) = 


#(H)#(K) 
#(H H K )' 


5 . Goursat’s Lemma. Let <7, G' be groups, and let H be a subgroup of G x G' such that the 
two projections p x : H -> G and p 2 : H -» G' are surjective. Let N be the kernel of p 2 
and N' be the kernel of p lm One can identify N as a normal subgroup of G, and N' as a 
normal subgroup of G'. Show that the image of H in G/N x G'/N' is the graph of an 
isomorphism 

G/N * G'/N'. 

6. Prove that the group of inner automorphisms of a group G is normal in Aut(G). 

7. Let G be a group such that Aut(G) is cyclic. Prove that G is abelian. 

8. Let G be a group and let //, H' be subgroups. By a double coset of H , H' one means 
a subset of G of the form HxH'. 

(a) Show that G is a disjoint union of double cosets. 

(b) Let {c} be a family of representatives for the double cosets. For each 
a E G denote by [a]H' the conjugate aH'a~ x of H'. For each c we have a 
decomposition into ordinary cosets 

H = U x c (H n [c]H'), 

c 

where {jc c } is a family of elements of //, depending on c. Show that the 
elements {jc c c} form a family of left coset representatives for H' in G; that 
is, 

G = U U x c cH', 

X c X c 

and the union is disjoint. (Double cosets will not emerge further until Chapter 
XVIII.) 

9. (a) Let G be a group and H a subgroup of finite index. Show that there exists a 

normal subgroup N of G contained in H and also of finite index. [Hint : If 
(G : H) = n, find a homomorphism of G into S n whose kernel is contained in 
H.] 

(b) Let G be a group and let H x , H 2 be subgroups of finite index. Prove that 
//, fl H 2 has finite index. 

10. Let G be a group and let H be a subgroup of finite index. Prove that there is only a 
finite number of right cosets of //, and that the number of right cosets is equal to the 
number of left cosets. 
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11. Let G be a group, and A a normal abelian subgroup. Show that G/A operates on A 
by conjugation, and in this manner get a homomorphism of G/A into Aut(A). 

Semidirect product 

12. Let G be a group and let //, N be subgroups with N normal. Let y x be conjugation 
by an element x E G. 

(a) Show that x i-> y x induces a homomorphism/: H i-> Aut(A). 

(b) If H fl N = { e }, show that the map H x N -> HN given by (x, y) i-> xy is 
a bijection, and that this map is an isomorphism if and only if/is trivial, 
i.e. /(;c) = idyy for all x E H. 

We define G to be the semidirect product of H and N if G = NH and H fl N = {e}. 

(c) Conversely, let N, H be groups, and let if/ : H —> Aut(A) be a given homo- 
morphism. Construct a semidirect product as follows. Let G be the set of 
pairs (jc, h) with x E N and h E H. Define the composition law 

(ï^i)feW = (fii*** 2 , h x h 2 ). 

Show that this is a group law, and yields a semidirect product of N and //, 
identifying N with the set of elements (jc, 1) and H with the set of elements 
(1 ,h). 

13. (a) Let Z/, N be normal subgroups of a finite group G. Assume that the orders of //, 

N are relatively prime. Prove that ;cy = yjc for all x E H and y E N, and that 
H X N ~ HN. 

(b) Let //,,..., // r be normal subgroups of G such that the order of //, is relatively 
prime to the order of Hj for i # j. Prove that 

//, x ... x H r « H { • // r . 

Example. If the Sylow subgroups of a finite group are normal, then G is the 
direct product of its Sylow subgroups. 

14. Let G be a finite group and let N be a normal subgroup such that N and G/A have 
relatively prime orders. 

(a) Let H be a subgroup of G having the same order as G/N. Prove that 
G = HN. 

(b) Let g be an automorphism of G. Prove that g(N) = N. 

Some operations 

15. Let G be a finite group operating on a finite set S with #(S) ^ 2. Assume that there 
is only one orbit. Prove that there exists an element jc 6 G which has no fixed point, 
i.e. JC5 # 5 for all s E S. 

16. Let H be a proper subgroup of a finite group G. Show that G is not the union of all 
the conjugates of H. (But see Exercise 23 of Chapter XIII.) 

17. Let X, Y be finite sets and let C be a subset of X x Y. For jc E X let (p(x) = number 
of elements y E Y such that (jc, y) E C. Verify that 

#(C) = X <p(.x). 

. jreX 
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Remarh. A subset C as in the above exercise is often called a correspondence, and 
cp(x) is the number of elements in Y which correspond to a given element x E X. 

18. Let S, T be finite sets. Show that #Map(S, T) = (#r) #(5) . 

19. Let G be a finite group operating on a finite set S. 

(a) For each s E S show that 

v 1 =, 

ieGs #(Gí) 

(b) For each x E G define f(x) = number of elements s E S such that xs = s. 
Prove that the number of orbits of G in S is equal to 

sïrï ^ f(x). 

#(L) X £G 


Throughout, p is a prime number. 

20. Let P be a p-group. Let Abea normal subgroup of order p. Prove that A is contained 
in the center of P. 

21. Let G be a finite group and H a subgroup. Let Ph be a p-Sylow subgroup of H. Prove 
that there exists a p-Sylow subgroup P of G such that P H = P 0 H. 

22. Let H be a normal subgroup of a finite group G and assume that #(//) = p. Prove 
that H is contained in every p-Sylow subgroup of G. 

23. Let P, P' be p- Sylow subgroups of a finite group G. 

(a) If P' C N(P) (normalizer of P ), then P' = P. 

(b) If N(P') = N(P ), then P' = P. 

(c) We have N(N(P)) = A(P). 


Explicit determination of groups 

24. Let p be a prime number. Show that a group of order p 2 is abelian, and that there are 
only two such groups up to isomorphism. 

25. Let G be a group of order p 3 , where p ìs prime, and G is not abelian. Let Z be ìts center. 
Let C be a cyclic group of order p. 

(a) Show that Z % C and G/Z % C x C. 

(b) Every subgroup of G of order p 2 contains Z and is normal. 

(c) Suppose x p = 1 for all xeG. Show that G contains a normal subgroup 
H x C x C. 

26. (a) Let G be a group of order pq , where /?, <? are primes and p < q. Assume that 

g ^ 1 mod p. Prove that G is cyclic. 

(b) Show that every group of order 15 is cyclic. 

27. Show that every group of order < 60 is solvable. 

28. Let p, q be distinct primes. Prove that a group of order p 2 q is solvable, and that one 
of its Sylow subgroups is normal. 

29. Let p, q be odd primes. Prove that a group of order 2 pq is solvable. 
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30. (a) Prove that one of the Sylow subgroups of a group of order 40 is normal. 

(b) Prove that one of the Sylow subgroups of a group of order 12 is normal. 

31. Determine all groups of order ^ 10 up to isomorphism. In particular, show that a 
non-abelian group of order 6 is isomorphic to S 3 . 

32. Let S n be the permutation group on n elements. Determine the p-Sylow subgroups of 
S 3 , S 4 , S 5 for p = 2 and p - 3. 

33. Let a be a permutation of a finite set / having n elements. Define e(a) to be (— l) m 
where 

m = n — number of orbits of o. 

If /j,..., / r are the orbits of o , then m is also equal to the sum 

m = Z [card(/ v ) - 1]. 

V — 1 

If t is a transposition, show that e(ar) = ~e(cr) be considering the two cases when 
/, j lie in the same orbit of a, or lie in different orbits. In the first case, or has one 
more orbit and in the second case one less orbit than a. In particular, the sign of a 
transposition is — 1. Prove that e(a) = e(a) is the sign of the permutation. 

34. (a) Let n be an even positive integer. Show that there exists a group of order 2 n, 

generated by two elements a, r such that a n = e = t 2 , and ar = ra n ~ x . (Draw 
a picture of a regular Ai-gon, number the vertices, and use the picture as an 
inspiration to get a, r.) This group is called the dihedral group 
(b) Let n be an odd positive integer. Let D 4n be the group generated by the matrices 



where f is a primitive n -th root of unity. Show that D 4n has order 4 n, and give 
the commutation relations between the above generators. 

35. Show that there are exactly two non-isomorphic non-abelian groups of order 8. (One 
of them is given by generators o, x with the relations 

a 4 =1, i 2 = 1, TOX = (J 3 . 

The other is the quaternion group.) 

36. Let a = [123 • • • n] in S n . Show that the conjugacy class of crhas (n - 1)! elements. 
Show that the centralizer of a is the cyclic group generated by a. 

37. (a) Let a = [i, • • • / m ] be a cycle. Let y E S n . Show that yay~ x is the cycle 

[y(h) ' * * 7 (*«)]• 

(b) Suppose that a permutation o in S n can be written as a product of r disjoint 
cycles, and let d { ,...,d r be the number of elements in each cycle, in increasing 
order. Let x be another permutation which can be written as a product of 
disjoint cycles, whose cardinalities are d\,..., d' s in increasing order. Prove 
that o is conjugate to x in S n if and only if r = s and d t = d\ for all i = 1,..., r. 

38. (a) Show that S n is generated by the transpositions [12], [13],... , [1 n\. 

(b) Show that S n is generated by the transpositions [12], [23], [34],. .. , [n - 1, n]. 
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(c) Show that S n is generated by the cycles [12] and [123 ... n\. 

(d) Assume that n is prime. Let a = [123 ... n\ and let t = [rs\ be any transposition. 
Show that cr, r generate S n . 

Let G be a finite group operating on a set S. Then G operates in a natural way on 
the Cartesian product S (n) for each positive integer n. We define the operation on 5 
to be n-transitive if given n distinct elements (jj,..., s n ) and n distinct elements 
(s[,..., s' n ) of S, there exists a E G such that as^ = s'i for all i = 1,. . . , n. 

39. Show that the action of the alternating group A n on {1,... , n} is (n - 2)-transitive. 

40. Let A n be the alternating group of even permutations of {1,..., n}. For j = 1,. . . , n 
let Hj be the subgroup of A n fixing j, so Hj ~ A„_,, and (A n : Hj) = n for n ^ 3. 
Let n^3 and let H be a subgroup of index n in A n . 

(a) Show that the action of A n on cosets of H by left translation gives an iso- 
morphism A n with the alternating group of permutations of AjH. 

(b) Show that there exists an automorphism of A n mapping //, on H, and that 
such an automorphism is induced by an inner automorphism of S n if and only 
if H = H t for some i. 

41. Let H be a simple group of order 60. 

(a) Show that the action of H by conjugation on the set of its Sylow subgroups 
gives an imbedding H ^ A 6 . 

(b) Using the preceding exercise, show that H « A 5 . 

(c) Show that A 6 has an automorphism which is not induced by an inner auto- 
morphism of S 6 . 

Abelian groups 

42. Viewing Z, Q as additive groups, show that Q/Z is a torsion group, which has one and 
only one subgroup of order n for each integer n ^ 1, and that this subgroup is cyclic. 

43. Let H be a subgroup of a finite abelian group G. Show that G has a subgroup that is 
isomorphic to G/H. 

44. Let be a homomorphism of abelian groups. Let B be a subgroup of A. 

Denote by A f and A f the image and kernel of/ in A respectively, and similarly for B f 
and B f . Show that (A : B) = (A f : B f )(A f : B f ), in the sense that if two of these three 
indices are finite, so is the third, and the stated equality holds. 

45. Let G be a finite cyclic group of order n, generated by an element o. Assume that G 
operates on an abelian group A, and let/, g: A -> A be the endomorphisms of A given by 

f(x) = ox - x and g(x) = x + ox + • • • + o n ~ l x. 

Define the Herbrand quotient by the expression q(A) = (A f : A 9 )/(A g : A f ), provided 
both indices are finite. Assume now that B is a subgroup of A such that GB c= B. 

(a) Define in a natural way an operation of G on A/B. 

(b) Prove that 

q(A) = q(B)q(A/B) 

in the sense that if two of these quotients are finite, so is the third, and the stated 
equality holds. 

(c) If A is finite, show that q(A) = 1. 
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(This exercise is a special case of the general theory of Euler characteristics discussed 
in Chapter XX, Theorem 3.1. After reading this, the present exercise becomes trivial. 
Why?) 


Primitive groups 

46. Let G operate on a set S. Let S = (J S, be a partition of S into disjoint subsets. We say 
that the partition is stable under G if G maps each S, onto Sj for some j, and hence G 
induces a permutation of the sets of the partition among themselves. There are two 
partitions of S which are obviously stable: the partition consisting of S itself, and the 
partition consisting ofthe subsets with one element. Assumethat G operates transitively, 
and that S has more than one element. Prove that the following two conditions are 
equivalent: 

PRIM 1. The only partitions of S which are stable are the two partitions mentioned 
above. 

PRIM 2. If H is the isotropy group of an element of S, then H is a maximal subgroup 
ofG. 

These two conditions define what is known as a primitive group, or more accurately, a 
primitive operation of G on S. 

Instead of saying that the operation of a group G is 2-transitive, one also says that it is 
doubly transitive. 

47. Let a finite group G operate transitively and faithfully on a set S with at least 2 
elements and let H be the isotropy group of some element s of S. (All the other 
isotropy groups are conjugates of H.) Prove the following: 

(a) G is doubly transitive if and only if H acts transitively on the complement 
of s in S. 

(b) G is doubly transitive if and only if G = HTH , where T is a subgroup of G 
of order 2 not contained in H. 

(c) If G is doubly transitive, and (G : H) = n, then 

#(G) = d(n - 1 )/i, 

where d is the order of the subgroup fixing two elements. Furthermore, H 
is a maximal subgroup of G, i.e. G is primitive. 

48. Let G be a group acting transitively on a set S with at least 2 elements. For each 
x E G let f(x) - number of elements of S fixed by x. Prove: 

(a) 2 m = #(G). 

jceG 

(b) G is doubly transitive if and only if 

2 m 2 = 2 #(G). 

jceG 

49. A group as an automorphism group. Let G be a group and let Set(G) be the category 
of G-sets (i.e. sets with a G-operation). Let F: Set(G) —> Set be the forgetful functor, 
which to each G-set assigns the set itself. Show that Aut(F) is naturally isomorphic 
to G. 
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Fiber products and coproducts 
Pull-backs and push-outs 

50. (a) Show that fiber products exist in the category of abelian groups. In fact, if X , Y 

are abeiian groups with homomorphisms f: X -+ Z and g: Y -> Z show that 
X x z y is the set of all pairs (x, y) with xe X and ye Y such that f(x) = g(y\ 
The maps /?,, p 2 are the projections on the first and second factor respectively. 

(b) Show that the pull-back of a surjective homomorphism is surjective. 

51. (a) Show that fiber products exist in the category of sets. 

(b) In any category G , consider the category G 7 of objects over Z. Let h :T~* Z 
be a fixed object in this category. Let F be the functor such that 

F(X) = Mor z (T, X\ 

where X is an object over Z, and Mor z denotes morphisms over Z. Show that 
F transforms fiber products over Z into fiber products in the category of sets. 
(Actually, once you have understood the definitions, this is tautological.) 

52. (a) Show that push-outs (i.e. fiber coproducts) exist in the category of abelian groups. 

In this case the fiber coproduct of two homomorphisms /, g as above is denoted 
by X ® z Y. Show that it is the factor group 

XQ Z Y = (X® Y)/W , 

where W is the subgroup consisting of all elements (/(z), -g(z)) with zeZ. 

(b) Show that the push-out of an injective homomorphism is injective. 

Remark. After you have read about modules over rings, you should note that the 
above two exercises apply to modules as well as to abelian groups. 

53. Let H, G, G' be groups, and let 

/://-> G, g :H-> G' 

be two homomorphisms. Define the notion of coproduct of these two homomor- 
phisms over H, and show that it exists. 

54. (Tits). Let G be a group and let {G,} ie/ be a family of subgroups generating G. 
Suppose G operates on a set S. For each i e /, suppose given a subset 5, of 5, and 
let s be a point of S - U S ( . Assume that for each g e G t — {e}, we have 

i 

gSj C 5, for all j ï /, and g(s) e 5, for all i. 

Prove that G is the coproduct of the family {G t } ie/ . (Hint: Suppose a product 
9\ ' * * 9 m = id on S. Apply this product to s , and use Proposition 12.4.) 

55. Let M e GL 2 ( C) (2x2 complex matrices with non-zero determinant). We let 

( a b\ az + b 

), and for z e C we let M(z) = -:. 

c d) cz + d 

If z = -d/c (c ï 0) then we put M(z) = «. Then you can verify (and you should 
have seen something like this in a course in complex analysis) that GL 2 ( C) thus 
operates on C U {°°}. Let À, À' be the eigenvalues of M viewed as a linear map on 
C 2 . Let W , W' be the corresponding eigenvectors, 

W = '(vv,, w 2 ) and W' = r (wi, w' 2 ). 
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By a fixed point of M on C we mean a complex number z such that M(z) = z. Assume 
that M has two distinct fixed points 00 . 

(a) Show that there cannot be more than two fixed points and that these fixed 
points are vv = w x /w 2 and vv' = w\/w' 2 . In fact one may take 

W = ‘(w, 1), W' = \w\ 1). 

(b) Assume that |A| < |A'|. Given z # vv, show that 

lim M k (z) = vv'. 

OO 

[Hint: Let S = (W, W') and consider S~ l M k S(z) = a*z where a = A/A'.] 

56. (Tits) Let M lt ..., M r E GL 2 (C) be a finite number of matrices. Let A t , A' be the 
eigenvalues of M ( . Assume that each M ( has two distinct complex fixed points, and 
that | A f | < | A'|. Also assume that the fixed points for M x ,..., M r are all distinct 
from each other. Prove that there exists a positive integer k such that M k ,...,M k 
are the free generators of a free subgroup of GL 2 ( C). [Hint: Let vv ( , vvj be the fixed 
points of M ( . Let í/ ( be a small disc centered at vv ( and U\ a small disc centered at 
vvj. Let 5 ( = í/ ( U U\. Let s be a complex number which does not lie in any S r Let 
G ( = (Mf). Show that the conditions of Exercise 54 are satisfied for k sufficiently 
large.]. 



57. Let G be a group acting on a set X. Let Y be a subset of X. Let G Y be the subset of 
G consisting of those elements g such that gY fì Y is not empty. Let G Y be the 
subgroup of G generated by G y . Then G Y Y and (G - G Y )Y are disjoint. [Hint: 
Suppose that there exist g { E G y and g 2 E G but g 2 $ G Y , and elements y,, y 2 , E Y 
such that g^y x = gyy 2 . Then g 2 x g x y x = y 2 , so g 2 x g x E G Y whence g 2 E G y , contrary 
to assumption.] 

Application. Suppose that X = GY, but that X cannot be expressed as a disjoint 
union as above unless one of the two sets is empty. Then we conclude that G — G Y 
is empty, and therefore G Y generates G. 

Example 1. Suppose X is a connected topological space, Y is open, and G acts 
continuously. Then all translates of Y are open, so G is generated by G y . 

Example 2. Suppose G is a discrete group acting continuously and discretely 
on X. Again suppose X connected and Y closed. Then any union of translates of Y 
by elements of G is closed, so again G — G y is empty, and G y generates G. 
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§1. RINGS AND HOMOMORPHISMS 

A ring A is a set, together with two laws of composition called multiplica- 
tion and addition respectively, and written as a product and as a sum respec- 
tively, satisfying the following conditions: 

RI 1. With respect to addition, A is a commutative group. 

RI 2. The multiplication is associative, and has a unit element. 

RI 3. For all x, y, z e A we have 

(x -f y)z = xz -f yz and z(x -f y) = zx -f zy. 

(This is called distributivity.) 

As usual, we denote the unit element for addition by 0, and the unit 
element for multiplication by 1. We do not assume that 1 # 0. We observe 
that Ox = 0 for all x e A. Proof : We have Ox -f x = (0 -f l)x = lx = x. 
Hence Ox = 0. In particular, if 1 = 0, then A consists of 0 alone. 

For any x, y e A we have ( — x)y = — (xy). Proof : We have 

xy + (-x)y = (x + (-x))y = Oy = 0, 

so ( —x)y is the additive inverse of xy. 

Other standard laws relating addition and multiplication are easily proved, 
for instance ( — x)( — y) = xy. We leave these as exercises. 

Let A be a ring, and let U be the set of elements of A which have both a 
right and left inverse. Then U is a multiplicative group. Indeed, if a has a 
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right inverse ò, so that ab = 1, and a left inverse c, so that cû = 1, then 
cab = ò, whence c = b, and we see that c (or 6) is a two-sided inverse, and 
that c itself has a two-sided inverse, namely a. Therefore U satisfies all the 
axioms of a multiplicative group, and is called the group of units of A. It is 
sometimes denoted by A *, and is also called the group of invertible elements 
of A. A ring A such that 1 # 0, and such that every non-zero element is 
invertible is called a division ring. 

Note. The elements of a ring which are left invertible do not necessarily 
form a group. 

Example. (The Shift Operator). Let E be the set of all sequences 

a = (a ^, a 2i a 3 ,...) 

of integers. One can define addition componentwise. Let R be the set of all 
mappings f :E-+E of E into itself such that f(a + b) = f(a) + f(b). The law 
of composition is defined to be composition of mappings. Then R is a ring. 
(Proof?) Let 

T(a x , a 2 , a 3 ,...) = (0, a l9 a 2 , a 3 ,...). 

Verify that T is left invertible but not right invertible. 

A ring A is said to be commutative if xy = yx for all x, y e A. A commu- 
tative division ring is called a field. We observe that by definition, a field 
contains at least two elements, namely 0 and 1. 

A subset B of a ring A is called a subring if it is an additive subgroup, if 
it contains the multiplicative unit, and if x, y e B implies xy e B. If that is 
the case, then B itself is a ring, the laws of operation in B being the same as 
the laws of operation in A. 

For example, the center of a ring A is the subset of A consisting of all 
elements ae A such that ax = xa for all xe A. One sees immediately that 
the center of A is a subring. 

Just as we proved general associativity from the associativity for three 
factors, one can prove general distributivity. If x, y í9 ..., y n are elements of a 
ring A, then by induction one sees that 

*0'i + ••• + yj = xy 1 + ••• + xy n . 

If Xj (i = 1,..., n) and y } (j = 1,..., m) are elements of A , then it is also easily 
proved that 



Furthermore, distributivity holds for subtraction, e.g. 

x(yi - yi) = xy 1 - xy 2 . 


We leave all the proofs to the reader. 
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Examples. Let S be a set and A a ring. Let Map(S, A) be the set of map- 
pings of S into A. Then Map(5, Á) is a ring if for f g e Map(S, A) we define 

(fg)(x) = f(x)g(x) and (f + g)(x) = f(x) + g(x) 

for all x e S. The multiplicative unit is the constant map whose value is the 
multiplicative unit of A. The additive unit is the constant map whose value 
is the additive unit of A , namely 0. The verification that Map(S, A) is a ring 
under the above laws of composition is trivial and left to the reader. 

Let M be an additive abelian group, and let A be the set End(M) of 
group-homomorphisms of M into itself. We define addition in A to be the 
addition of mappings, and we define multiplication to be composition of 
mappings. Then it is trivially verified that A is a ring. Its unit element is of 
course the identity mapping. In general, A is not commutative. 

Readers have no doubt met polynomials over a field previously. These pro- 
vide a basic example of a ring, and will be defined oíficially for this book in §3. 

Let K be a field. The set of n x n matrices with components in K is a 
ring. Its units consist of those matrices which are invertible, or equivalently 
have a non-zero determinant. 

Let S be a set and R the set of real-valued functions on S. Then R is a 
commutative ring. Its units consist of those functions which are nowhere 0. 
This is a special case of the ring Map(S, A) considered above. 

The convolution product. We shall now give examples of rings whose 
product is given by what is called convolution. Let G be a group and let K 
be a field. Denote by K[G] the set of all formal linear combinations 
a = Y, a x x w dh X ^G and a x e K, such that all but a finite number of a x are 
equal to 0. (See §3, and also Chapter III, §4.) If /? = e K[G], then one 
can define the product 

= Z Z a x b y xy= X ( Z a x b y ) z - 

xeG yeG zeG \xy=z / 

With this product, the group ring K[G] is a ring, which will be studied 
extensively in Chapter XVIII when G is a finite group. Note that X[G] is 
commutative if and only if G is commutative. The second sum on the right 
above defines what is called a convolution product. If f g are two functions 
on a group G, we define their convolution f *g by 

(f*g)(z) = E f(x)g(y)- 

xy=z 

Of course this must make sense. If G is infinite, one may restrict this 
definition to functions which are 0 except at a finite number of elements. 
Exercise 12 will give an example (actually on a monoid) when another type 
of restriction allows for a finite sum on the right. 

Example from analysis. In analysis one considers a situation as follows. 
Let L 1 = L l ( R) be the space of functions which are absolutely integrable. 
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Given functions f g e L 1 , one defines their convolution product f *g by 
(/ * 0)M = | /(*- y)^(y) dy. 

Then this product satisfìes all the axioms of a ring, except that there is no 
unit element. In the case of the group ring or the convolution of Exercise 12, 
there is a unit element. (What is it?) Note that the convolution product in 
the case of L^(R) is commutative, basically because R is a commutative 
additive group. More generally, let G be a locally compact group with a 
Haar measure fi. Then the convolution product is defined by the similar 
formula 

(f*g)(x) = J f(xy~ l )g(y) dn(y). 

After these examples, we return to the general theory of rings. 

A left ideal a in a ring A is a subset of A which is a subgroup of the 
additive group of A , such that Aa c= a (and hence Aa = a since A contains 
1). To define a right ideal, we require aA = a, and a two-sided ideal is a 
subset which is both a left and a right ideal. A two-sided ideal is called 
simply an ideal in this section. Note that (0) and A itself are ideals. 

If A is a ring and a e A, then Aa is a left ideal, called principal. We say 
that a is a generator of a (over A). Similarly, AaA is a principal two-sided 
ideal if we define AaA to be the set of all sums Y, x i a yi Cf. 

below the definition of the product of ideals. More generally, let a ì9 ..., a n 
be elements of A. We denote by (a ì9 ... 9 a n ) the set of elements of A which 
can be written in the form 

x i a i +"- + x n a n with x t e A. 

Then this set of elements is immediately verified to be a left ideal, and 
a ì9 ..., a n are called generators of the left ideal. 

If {di}i € i is a family of ideals, then their intersection 

<*i 

is also an ideal. Similarly for left ideals. Readers will easily verify that if 
a = (a ì9 ..., a n \ then a is the intersection of all left ideals containing the 
elements a í9 ..., a n . 

A ring A is said to be commutative if xy = yx for all x, y e A. In that 
case, every left or right ideal is two-sided. 

A commutative ring such that every ideal is principal and such that 1 # 0 
is called a principal ring. 

Examples. The integers Z form a ring, which is commutative. Let û be 
an ideal # Z and ^ 0. If n e a, then —nea. Let d be the smallest integer 
> 0 lying in a. If ne a then there exist integers q , r with 0 < d such that 

n = dq + r. 


n 

ie/ 
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Since a is an ideal, it follovvs that r lies in a, hence r = 0. Hence û consists of 
all multiples qd of d, with q e Z, and Z is a principal ring. 

A similar example is the ring of polynomials in one variable over a field, 
as will be proved in Chapter IV, also using the Euclidean algorithm. 

Let R be the ring of algebraic integers in a number field K. (For 
definitions, see Chapter VII.) Then R is not necessarily principal, but let p 
be a prime ideal, and let R p be the ring of all elements a/b with a, b e R and 
b $ p. Then in algebraic number theory, it is shown that R p is principal, with 
one prime ideal m p consisting of all elements a/b as above but with ae p. 
See Exercises 15, 16, and 17. 

An example from analysis. Let A be the set of entire functions on the 
complex plane. Then A is a commutative ring, and every finitely generated 
ideal is principal. Given a discrete set of complex numbers {zj and integers 
m, ^ 0, there exists an entire function / having zeros at z f of multiplicity 
and no other zeros. Every principal ideal is of the form Af for some such /. 
The group of units A* in A consists of the functions which have no zeros. It 
is a nice exercise in analysis to prove the above statements (using the 
Weierstrass factorization theorem). 

We now return to general notions. Let a, b be ideals of A. We define ab 
to be the set of all sums 

with x, e a and y t e b. Then one verifies immediately that ab is an ideal, and 
that the set of ideals forms a multiplicative monoid, the unit element being 
the ring itself. This unit element is called the unit ideal, and is often written (1). 
If a, b are left ideals, we define their product ab as above. It is also a left ideal, 
and if a, b, c are left ideals, then we again have associativity: (ab)c = a(bc). 

If a, b are left ideals of A , then a + b (the sum being taken as additive 
subgroup of A) is obviously a left ideal. Similarly for right and two-sided 
ideals. Thus ideals also form a monoid under addition. We also have 
distributivity: If a l9 ..., a n , b are ideals of A, then clearly 

b(a x -f- • • • -f* a n ) = ba^ H- • • • + ba„, 

and similarly on the other side. (However, the set of ideals does not form a 
ring!) 

Let abea left ideal. Define aA to be the set of all sums a^x^ -f ••• + a n x n 
with a t e a and x t e A. Then aA is an ideal (two-sided). 

Suppose that A is commutative. Let a, b be ideals. Then trivially 

abcanb, 

but equality does not necessarily hold. However, as an exercise, prove that if 
û + b = A then ab = a n b. 

As should be known to the reader, the integers Z satisfy another property 
besides every ideal being principal, namely unique factorization into primes. 
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We shall discuss the general phenomenon in §4. Be it noted here only that if 
a ring A has the property of unique factorization into prime elements, and p 
is a prime element, then the ideal (p) is prime, and the ring R {p) (defined as 
above) is principal. See Exercise 6. Thus principal rings may be obtained in 
a natural way from rings which are not principal. 

As Dedekind found out, some form of unique factorization can be re- 
covered in some cases, replacing unique factorization into prime elements by 
unique factorization of (non-zero) ideals into prime ideals. 

Example. There are cases when the non-zero ideals give rise to a group. 
Let o be a subring of a field K such that every element of K is a quotient of 
elements of o; that is, of the form a/b with a, b e o and b # 0. By a fractional 
ideal û we mean a non-zero additive subgroup of K such that oûcû (and 
therefore o a = a since o contains the unit element); and such that there exists 
an element c e o, c ^ 0, such that ca c: o. We might say that a fractional 
ideal has bounded denominator. A Dedekind ring is a ring o as above such 
that the fractional ideals form a group under multiplication. As proved in 
books on algebraic number theory, the ring of algebraic integers in a number 
field is a Dedekind ring. Do Exercise 14 to get the property of unique 
factorization into prime ideals. See Exercise 7 of Chapter VII for a sketch of 
this proof. 

If a e K, a ^ 0, then o a is a fractional ideal, and such ideals are called 
principal The principal fractional ideals form a subgroup. The factor group 
is called the ideal class group, or Picard group of o, and is denoted by Pic(o). 
See Exercises 13-19 for some elementary facts about Dedekind rings. It is 
a basic problem to determine Pic(o) for various Dedekind rings arising in 
algebraic number theory and function theory. See my book Algebraic Num- 
ber Theory for the beginnings of the theory in number fields. In the case of 
function theory, one is led to questions in algebraic geometry, notably the 
study of groups of divisor classes on algebraic varieties and all that this 
entails. The property that the fractional ideals form a group is essentially 
associated with the ring having “dimension 1” (which we do not define 
here). In general one is led into the study of modules under various equiva- 
lence relations; see for instance the comments at the end of Chapter III, §4. 

We return to the general theory of rings. 

By a ring-homomorphism one means a mapping f.A^B where A , B are 
rings, and such that / is a monoid-homomorphism for the multiplicative 
structures on A and £, and also a monoid-homomorphism for the additive 
structure. In other words, / must satisfy: 

f(a + a') = f(a) -f /(*'), f(aa') = f(a)f(a'\ 

/0) = 1, /(0) = 0, 

for all a , a ' e A. Its kernel is defined to be the kernel of / viewed as additive 
homomorphism. 
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The kernel of a ring-homomorphism f:A^>B is an ideal of A, as one 
verifies at once. 

Conversely, let û be an ideal of the ring A. We can construct the factor 
ring A/a as follows. Viewing A and a as additive groups, let Aja be the 
factor group. We define a multiplicative law of composition on A/a: If 
x 4 - a and y + û are two cosets of a, we define (x -f a)(y -f a) to be the coset 
(xy -f a). This coset is well defined, for if x ì9 y { are in the same coset as x, y 
respectively, then one verifies at once that x^ is in the same coset as xy. 
Our multiplicative law of composition is then obviously associative, has a 
unit element, namely the coset 1 + a, and the distributive law is satisfied 
since it is satisfied for coset representatives. We have therefore defined a ring 
structure on A/a, and the canonical map 

f:A^ A/a 

is then clearly a ring-homomorphism. 

If g: A ~+ A ' is a ring-homomorphism whose hernel contains a, then there 
exists a unique ring-homomorphism gA/a^ A' making the following dia- 
gram commutative: 


A —2—> A’ 



A/a 


Indeed, viewing f g as group-homomorphisms (for the additive struc- 
tures), there is a unique group-homomorphism g * making our diagram 
commutative. We contend that g + is in fact a ring-homomorphism. We 
could leave the trivial proof to the reader, but we carry it out in full. If 
x g A, then #(x) = g*f(x). Hence for x, y e A, 

9*(f(x)f(y)) = g*(f(xy)) = g(xý) = g(x)g(ý) 

= 9*f(x)g*f(y)- 

Given £, rj e A/a, there exist x, y e A such that Ç = f(x) and rj = f(y). Since 
/(1) = 1, we get 0*/(l) = g(ì) = 1 , and hence the two conditions that g * be a 
multiplicative monoid-homomorphism are satisfied, as was to be shown. 

The statement we have just proved is equivalent to saying that the 
canonical map f:A-+ A/a is universal in the category of homomorphisms 
whose kernel contains a. 

Let A be a ring, and denote its unit element by e for the moment. The 
map 

kZ-+A 

such that À(n) — ne is a ring-homomorphism (obvious), and its kernel is an 
ideal (n), generated by an integer n ^ 0. We have a canonical injective homo- 
morphism Z/nZ -> A, which is a (ring) isomorphism between Z/nZ and a 
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subring of A. If nZ is a prime ideal, then n = 0 or n = p for some prime number 
p. In the first case, A contains as a subring a ring which is isomorphic to Z, and 
which is often identified with Z. In that case, we say that A has characteristic 
0. If on the other hand n = p, then we say that A has characteristic p 9 and A 
contains (an isomorphic image of) Z/pZ as a subring. We abbreviate Z/pZ by 

If K is a field, then K has characteristic 0 or p > 0. In the first case, K 
contains as a subfield an isomorphic image of the rational numbers, and in 
the second case, it contains an isomorphic image of F p . In either case, this 
subfield will be called the prime field (contained in K). Since this prime field 
is the smallest subfield of K containing 1 and has no automorphism except 
the identity, it is customary to identify it with Q or F p as the case may be. 
By the prime ring (in K) we shall mean either the integers Z if K has 
characteristic 0, or F p if K has characteristic p. 

Let A be a subring of a ring B. Let 5 be a subset of B commuting with 
A; in other words we have as = sa for all a e A and se S. We denote by 
A [S] the set of all elements 

X a i x -i n S l * * ’ s n ”> 

the sum ranging over a finite number of n-tuples (i í9 ..., i n ) of integers ^ 0, 
and a ií ... in eA, s í9 ..., s n e S. If B = /4[S], we say that S is a set of 
generators (or more precisely, ring generators) for B over A, or that B is 
generated by S over A. If S is finite, we say that B is fìnitely generated 
as a ring over A. One might say that /1[S] consists of all not-necessarily- 
commutative polynomials in elements of S with coefficients in A. Note that 
elements of S may not commute with each other. 

Example. The ring of matrices over a field is finitely generated over that 
field, but matrices don’t necessarily commute. 

As with groups, we observe that a homomorphism is uniquely determined 
by its effect on generators. In other words, let f:A-*A' be a ring- 
homomorphism, and let B = A [5] as above. Then there exists at most one 
extension of / to a ring-homomorphism of B having prescribed values on S. 
Let A be a ring, a an ideal, and S a subset of A. We write 

5 = 0 (mod a) 

if 5 c= a. If x 9 y e A 9 we write 

x = y (mod a) 

if x — y e a. If a is principal, equal to ( a) 9 then we also write 

x == y (mod a). 

If /: A -► A/a is the canonical homomorphism, then x = y (mod a) means 
that f(x) = f(y). The congruence notation is sometimes convenient when we 
want to avoid writing explicitly the canonical map /. 
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The factor ring A/a is also called a residue class ring. Cosets of a in A 
are called residue classes modulo û, and if x e A, then the coset x + û is 
called the residue class of x modulo a. 

We have defined the notion of an isomorphism in any category, and so a 
ring-isomorphism is a ring-homomorphism which has a two-sided inverse. 
As usual we have the criterion: 

A ring-homomorphism f:A^B which is bijective is an isomorphism. 

Indeed, there exists a set-theoretic inverse g: B -► A, and it is trivial to verify 
that g is a ring-homomorphism. 

Instead of saying “ring-homomorphism” we sometimes say simply 
“homomorphism” if the reference to rings is clear. We note that rings form 
a category (the morphisms being the homomorphisms). 

Let f:A-+B be a ring-homomorphism. Then the image f(A) of f is a 

subring of B. Proof obvious. 

It is clear that an injective ring-homomorphism f:A-^B establishes a 
ring-isomorphism between A and its image. Such a homomorphism will be 
called an embedding (of rings). 

Let f:A^A' be a ring-homomorphism, and let a' be an ideal of A'. 
Then f~ l (a') is an ideal a in A , and we have an induced injective homo- 
morphism 

A/a -► A'/a'. 

The trivial proof is left to the reader. 

Proposition 1.1. Products exist in the category of rings. 

In fact, let {^4 £ } íe/ be a family of rings, and let A = their product 

as additive abelian groups. We define a multiplication in A in the obvious 
way: If (x f ) ie/ and (y,) ie/ are two elements of A , we define their product to 
be (xiyfiçj, i.e. we define multiplication componentwise, just as we did for 
addition. The multiplicative unit is simply the element of the product whose 
i-th component is the unit element of A { . It is then clear that we obtain a 
ring structure on A, and that the projection on the i-th factor is a ring- 
homomorphism. Furthermore, A together with these projections clearly 
satisfies the required universal property. 

Note that the usual inclusion of A t on the i-th factor is not a ring- 
homomorphism because it does not map the unit element e x of A { on the unit 
element of A. Indeed, it maps e t on the element of A having as i-th 
component, and 0 (= 0 f ) as all other components. 

Let A be a ring. Elements x, y of A are said to be zero divisors if x # 0, 
y # 0, and xy = 0. Most of the rings without zero divisors which we con- 
sider will be commutative. In view of this, we define a ring A to be entire if 
1 # 0, if A is commutative, and if there are no zero divisors in the ring. 
(Entire rings are also called integral domains. However, linguistically, I feel 
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the need for an adjective. “Integral” would do, except that in English, 
“integral” has been used for “integral over a ring” as in Chapter VII, §1. In 
French, as in English, two words exist with similar roots: “integral” and 
“entire”. The French have used both words. Why not do the same in 
English? There is a slight psychological impediment, in that it would have 
been better if the use of “integral” and “entire” were reversed to fit the 
long-standing French use. I don’t know what to do about this.) 

Examples. The ring of integers Z is without zero divisors, and is there- 
fore entire. If S is a set with more than 2 elements, and A is a ring with 
1 # 0, then the ring of mappings Map(S, A) has zero divisors. (Proof?) 

Let m be a positive integer # 1. The ring Z/mZ has zero divisors if and 
only if m is not a prime number. (Proof left as an exercise.) The ring of 
n x n matrices over a field has zero divisors if n ^ 2. (Proof?) 

The next criterion is used very frequently. 

Let A be an entire ring , and let a , b be non-zero elements of A. Then a , b 

generate the same ideal if and only if there exists a unit u of A such that 

b = au. 

Proof If such a unit exists we have Ab = Aua = Aa. Conversely, 
assume Aa = Ab. Then we can write a = bc and b = ad with some elements 
c, d e A. Hence a = adc , whence a( 1 — dc) = 0, and therefore dc = 1. Hence 
c is a unit. 


§2. COMMUTATIVE RINGS 

Throughout this section , vvc let A denote a commutative ring. 

A prime ideal in A is an ideal p # A such that A/p is entire. Equiva- 
lently, we could say that it is an ideal p # A such that, whenever x, y e A 
and xy e p, then x e p or y e p. A prime ideal is often called simply a prime. 

Let m be an ideal. We say that m is a maximal ideal if m # A and if 
there is no ideal a # A containing m and # m. 

Every maximal ideal is prime. 

Proof. Let m be maximal and let x, y e A be such that xy e m. Suppose 
x £ m. Then m + Ax is an ideal properly containing m, hence equal to A. 
Hence we can write 


1 = u + ax 

with u e m and a e A. Multiplying by y we find 
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y = yu + axy, 

whence y e m and m is therefore prime. 

Let û be an ideal # A. Then a is contained in some maximal ideal m. 

Proof. The set of ideals containing û and # A is inductively ordered by 
ascending inclusion. Indeed, if {b,} is a totally ordered set of such ideals, 
then 1 ^ b f for any i, and hence 1 does not lie in the ideal b = (J b f , which 
dominates all b f . If m is a maximal element in our set, then m # A and m is 
a maximal ideal, as desired. 

The ideal {0} is a prime ideal of A if and only if A is entire. 

(Proof obvious.) 

We defìned a field K to be a commutative ring such that 1 + 0, and such 
that the multiplicative monoid of non-zero elements of K is a group (i.e. such 
that whenever x e K and x + 0 then there exists an inverse for x). We note that 
the only ideals of a field K are K and the zero ideal. 

If m is a maximal ideal of A, then A/m is a field. 

Proof If x e A, we denote by x its residue class mod m. Since m # A 
we note that A/m has a unit element # 0. Any non-zero element of A/m can 
be written as x for some x e A, x <£ m. To find its inverse, note that m + Ax 
is an ideal of A ^ m and hence equal to A. Hence we can write 

1 = u + yx 

with uem and y e A. This means that ŷx = 1 (i.e. = I) and hence that x has 
an inverse, as desired. 

Conversely, we leave it as an exercise to the reader to prove that: 

If m is an ideal of A such that A/m is a field, then m is maximal. 

Let f'.A^A'bea homomorphism of commutative rings. Let p' be a prime 

ideal of A', and let p = / _1 (p'). Then p is prime. 

To prove this, let x, y e A, and xy e p. Suppose x £ p. Then f(x) $ p'. 
But f(x)f(y) = f(xy) e p'. Hence f(y) e p', as desired. 

As an exercise, prove that if / is surjective, and if m' is maximal in A', 
then / _1 (m') is maximal in A. 

Example. Let Z be the ring of integers. Since an ideal is also an additive 
subgroup of Z, every ideal + {0} is principal, of the form nZ for some integer 
n > 0 (uniquely determined by the ideal). Let p be a prime ideal + {0}, 
p = nZ. Then n must be a prime number, as follows essentially directly from 
the definition of a prime ideal. Conversely, if p is a prime number, then pZ is 
a prime ideal (trivial exercise). Furthermore, pZ is a maximal ideal. Indeed, 
suppose pZ contained in some ideal nZ . Then p = nm for some integer m , whence 
n = p or n = 1, thereby proving pZ maximal. 
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If n is an integer, the factor ring Z/nZ is called the ring of integers 
modulo n. We also denote 

Z/nZ = Z(n). 

If n is a prime number p, then the ring of integers modulo p is in fact a field, 
denoted by F p . In particular, the multiplicative group of F p is called the 
group of non-zero integers modulo p. From the elementary properties of 
groups, we get a standard fact of elementary number theory: If x is an 
integer # 0 (mod p), then x p ~ l = 1 (mod p). (For simplicity, it is customary 
to write mod p instead of mod pZ, and similarly to write mod n instead of 
mod nZ for any integer n.) Similarly, given an integer n > 1, the units in the 
ring Z/nZ consist of those residue classes mod nZ which are represented by 
integers m i 0 and prime to n. The order of the group of units in Z/nZ is 
called by definition <p(n ) (where cp is known as the Euler phi-function). 
Consequently, if x is an integer prime to n, then x 9(n) = 1 (mod n). 


Theorem 2.1. (Chinese Remainder Theorem). Let a l9 ..., a n be ideals of 
A such that a t -h ùj = A for all i i j. Given elements x í9 ..., x n e A 9 there 
exists x e A such that x = x f (mod a f ) for all i. 

Proof If n = 2, we have an expression 

1 = + a 2 

for some elements a x e a i9 and we let x = x 2 a y + x t a 2 . 

For each i ^ 2 we can find elements a t e a x and b t e a t such that 

a t + b t = 1, i ^ 2. 

n 

The product f] (a f + b { ) is equal to 1, and lies in 

i=2 

n 

°i + n a >’ 

i= 2 

i.e. in a^ + a 2 * * * a n . Hence 

n 

<»! + n a .= a ■ 

i = 2 

By the theorem for n = 2, we can find an element y { e A such that 


yi = 1 (modaj, 



We find similarly elements y 2 > •••> y n such that 

yj = 1 (mod aj) and y ỳ = 0 (mod a f ) for i i j. 
Then x = x^y^ -l-+ x n y n satisfies our requirements. 
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In the same vein as above, we observe that if a í9 
ring A such that 


ûj + ••• + a n = A, 

and if vj,..., v n are positive integers, then 

û;» + - + cC- = a 

The proof is trivial, and is left as an exercise. 


..., a n are ideals of a 


Corollary 2.2. Let a l9 ..., a n be ideals of A. Assume that a f + ùj = A for 
i # j. Let 

/: A -» fl A/a ( = {A/aJ X ■ ■■ x {A/a n ) 

í = 1 

òc í/ze map of A into the product induced by the canonical map of A onto 

n 

A/a t for each factor. Then the kernel of f is Q a i9 and f is surjective , 
thus giving an isomorphism 1=1 

a/ç\ Oi % n A / a i- 


Proof. That the kernel of / is what we said it is, is obvious. The 
surjectivity follows from the theorem. 

The theorem and its corollary are frequently applied to the ring of 
integers Z and to distinct prime ideals (pj, ..., ( p n ). These satisfy the 
hypothesis of the theorem since they are maximal. Similarly, one could take 
integers m ì9 ..., m n which are relatively prime in pairs, and apply the theorem 
to the principal ideals (m^) = mfZ, ..., (m n ) = m n Z. This is the ultraclassical 
case of the Chinese remainder theorem. 

In particular, let m be an integer > 1, and let 


m = n P? 

i 

be a factorization of m into primes, with exponents r,1. Then we have a 
ring-isomorphism: 

Z/mZ % II z /Pi iZ • 

i 


If A is a ring, we denote as usual by A* the multiplicative group of invertible 
elements of A. We leave the following assertions as exercises: 

The preceding ring-isomorphism of Z/mZ onto the product induces a group- 
isomorphism 

(Z/mZ)* * n (Z/P?Z)*. 

i 

In view of our isomorphism, we have 

<p(m) = n <p(p. ri )- 

i 
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If p is a prime number and r an integer ^ 1, then 

<P(p r ) = (P~ 1 )P r_1 - 

One proves this last formula by induction. If r = 1, then Z/pZ is a field, and 
the multiplicative group of that field has order p — 1. Let r be ^ 1, and 
consider the canonical ring-homomorphism 

Z/p r+í Z-+Z/p r Z , 

arising from the inclusion of ideals (p r+1 ) <= ( p r ). We get an induced group- 
homomorphism 

A: (Z/p r+í Z)* -► (Z/p r Z)*, 

which is surjective because any integer a which represents an element of 
Z/p r Z and is prime to p will represent an element of (Z/p r+1 Z)*. Let a be an 
integer representing an element of (Z/p r+1 Z)*, such that À(a) = 1. Then 

a = 1 (mod p r Z), 

and hence we can write 

a = 1 + xp r (mod p r+1 Z) 

for some xeZ. Letting x = 0, 1, ..., p — 1 gives rise to p distinct elements of 
(Z/p r+1 Z)*, all of which are in the kernel of Â. Furthermore, the element x 
above can be selected to be one of these p integers because every integer is 
congruent to one of these p integers modulo (p). Hence the kernel of k has 
order p, and our formula is proved. 

Note that the kernel of Â is isomorphic to Z/pZ. (Proof?) 

Application: The ring of endomorphisms of a cyclic group. One of the 

first examples of a ring is the ring of endomorphisms of an abelian group. In 
the case of a cyclic group, we have the following complete determination. 

Theorem 2.3. Let A be a cyclic group of order n. For each k e Z let 
f k \A->A be the endomorphism x i-> kx (writing A additively). Then k i—» f k 
induces a ring isomorphism Z/nZ ~ End(A), and a group isomorphism 
(Z/nZ)* « Aut(A). 

Proof. Recall that the additive group structure on End(A) is simply 
addition of mappings, and the multiplication is composition of mappings. 
The fact that /c i—► /* is a ring-homomorphism is then a restatement of the 
formulas 


1 a = a, (k + k')a = ka + k!a , and (kk')a = k(k f a) 

for /c, k! eZ and ae A. If a is a generator of A, then ka = 0 if and only if 
k = 0 mod n, so Z/nZ is embedded in End(A). On the other hand, let 
/: A -► A be an endomorphism. Again for a generator a , we have f(a) = ka 



POLYNOMIALS AND GROUP RINGS 97 


II. §3 

for some /c, whence / = f k since every xe A is of the form ma for some 
m e Z, and 

f(x) = f(ma) = mf(a) = mka = kma = kx. 

This proves the isomorphism Z/nZ % End(,4). Furthermore, if ke(Z/nZ)* 
then there exists k' such that kk ' = 1 mod n, so f k has the inverse f k . and / fc is 
an automorphism. Conversely, given any automorphism / with inverse g, we 
know from the first part of the proof that / = /*, g = g k > for some k, /c', and 
/o g = id means that kk' = 1 mod «, so k, k' e (Z/nZ)*. This proves the 
isomorphism (ZlnZ)* ~ Aut(>t). 

Note that if /1 is written as a multiplicative group C, then the map / k is 
given by x (—►**. For instance, let \i n be the group of n- th roots of unity in C. 
Then all automorphisms of are given by 

with k e (Z/nZ)*. 


§3. POLYNOMIALS AND GROUP RINGS 

Although all readers will have met polynomial functions, this section lays 
the ground work for polynomials in general. One needs polynomials over 
arbitrary rings in many contexts. For one thing, there are polynomials over 
a finite field which cannot be identified with polynomial functions in that 
field. One needs polynomials with integer coeíficients, and one needs to 
reduce these polynomials mod p for primes p. One needs polynomials over 
arbitrary commutative rings, both in algebraic geometry and in analysis, for 
instance the ring of polynomial differential operators. We also have seen the 
example of a ring B = A[S] generated by a set of elements over a ring A. 
We now give a systematic account of the basic definitions of polynomials 
over a commutative ring A. 

We want to give a meaning to an expression such as 

a o + a^ X + • • • + AA 

where a { e A and X is a “variable”. There are several devices for doing so, 
and we pick one of them. (I picked another in my Undergraduate Algebra.) 
Consider an infinite cyclic group generated by an element X. We let 5 be the 
subset consisting of powers X r with r ^ 0. Then S is a monoid. We define 
the set of polynomials A [X] to be the set of functions 5 -► A which are equal 
to 0 except for a finite number of elements of S. For each element a e A we 
denote by aX n the function which has the value a on X n and the value 0 for 
all other elements of 5. Then it is immediate that a polynomial can be 
written uniquely as a finite sum 
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a 0 X° + --- + a n X n 

for some integer ne N and a t e A. Such a polynomial is denoted by f(X). 
The elements a^e A are called the coefficients of /. We define the product 
according to the convolution rule. Thus, given polynomials 

f(X) = X a^X' and g(X) = £ 

i= 0 j=0 

we define the product to be 

m + n / \ 

f(X)g(X)= £ I afy )X k . 
k=0 \i+j=k J 

It is immediately verified that this product is associative and distributive. 
We shall give the details of associativity in the more general context of a 
monoid ring below. Observe that there is a unit element, namely 1A"°. 
There is also an embedding 

A^> A[XI given by a^aX °. 

One usually does not distinguish a from its image in A [X\, and one writes a 
instead of aX°. Note that for c e A we have then cf(x) = 

Observe that by our definition, we have an equality of polynomials 

X afX 1 = X b t X l 

if and only if a { = for all i. 

Let A be a subring of a commutative ring B. Let xeB. If / e A [X\ is a 
polynomial, we may then define the associated polynomial function 

Íb'.B^B 

by letting 

/bW = f(x) = a 0 + a Y x + • • • + a n x n . 


Given an element b e £, directly from the definition of multiplication of 
polynomials, we find: 

The association 

ev b :/H->/(fe) 


is a ring homomorphism of A [X] into B. 

This homomorphism is called the evaluation homomorphism, and is also said 
to be obtained by substituting b for X in the polynomial. (Cf. Proposition 
3.1 below.) 

Let x e B. We now see that the subring A [x] of B generated by x over A 
is the ring of all polynomial values /(x), for / e A[X~\. If the evaluation map 
/ f(x) gives an isomorphism of A[X\ with A\x\ then we say that x is 
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transcendental over A , or that x is a variable over A. In particular, X is a 
variable over A. 

Example. Let a = y/l. Then the set of all real numbers of the form 
a + ba, with a, b e Z, is a subring of the real numbers, generated by Jl. 
Note that a is not transcendental over Z, because the polynomial X 2 — 2 lies 
in the kernel of the evaluation map / 1 —► /(^/2). On the other hand, it can be 
shown that e = 2.718... and n are transcendental over Q. See Appendix 1. 

Example. Let p be a prime number and let K = Z/pZ. Then K is a 
field. Let f(X) = X p — X e K[X~\. Then / is not the zero polynomial. But 
f K is the zero function. Indeed, f K ( 0) = 0. If x e K, x # 0, then since the 
multiplicative group of K has order p — 1, it follows that x p_1 = 1, whence 
x p = x, so /(x) = 0. Thus a non-zero polynomial gives rise to the zero 
function on K. 

There is another homomorphism of the polynomial ring having to do 
with the coefficients. Let 

\ A -> B 

be a homomorphism of commutative rings. Then there is an associated 
homomorphism of the polynomial rings A[X\ -► B[X\, such that 

f(X) = X a.X^Y. VMX 1 = ( <pf)(X )• 

The verification that this mapping is a homomorphism is immediate, and 
further details will be given below in Proposition 3.2, in a more general 
context. We call /(—►</)/ the reduction map. 

Examples. In some applications the map cp may be an isomorphism. 
For instance, if f(X) has complex coefficients, then its complex conju- 
gate f(X) = YâiX' is obtained by applying complex conjugation to its 
coefficients. 

Let p be a prime ideal of A. Let (p\A-^A' be the canonical homo- 
morphism of A onto A/p. If f(X) is a polynomial in A[X\, then (pf will 
sometimes be called the reduction of / modulo p. 

For example, taking A = Z and p = (p) where p is a prime number, we 
can speak of the polynomial 3X 4 — X + 2 as a polynomial mod 5, viewing 
the coefficients 3, — 1, 2 as integers mod 5, i.e. elements of Z/5Z. 

We may now combine the evaluation map and the reduction map to 
generalize the evaluation map. 

Let 9 : A —» B be a homomorphism of commutative rings. 

Let x e B. There is a unique homomorphism extending 9 

A [X\ -* B such that X\—>x, 

and for this homomorphism , ►^<p(« i )x I . 
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The homomorphism of the above statement may be viewed as the composite 

A[X\ - 

where the first map applies (p to the coeíficients of a polynomial, and the 
second map is the evaluation at x previously discussed. 

Example. In Chapter IX, §2 and §3, we shall discuss such a situation in 
several variables, when ((pf) (x) = 0, in which case x is called a zero of the 
polynomial /. 

n 

When writing a polynomial f(X) = £ 0 ;X l , if a n ^0 then we define n 

i =1 

to be the degree of /. Thus the degree of / is the smallest integer n such 
that a r = 0 for r > n. If / = 0 (i.e. / is the zero polynomial), then by con- 
vention we define the degree of / to be — oo. We agree to the convention 
that 

—oo H- oo = —oo, —oo + n = — oo, —oo < n, 

for all n e Z, and no other operation with — oo is defined. A polynomial of 
degree 1 is also called a linear polynomial. If / # 0 and deg/ = n, then we 
call a n the leading coefficient of /. We call a 0 its constant term. 

Let 

g(X) = b 0 + --- + b m X m 

be a polynomial in A[X\ of degree m , and assume g # 0. Then 
f(X)g(X) = a 0 b 0 + --- + a n b m X m+n . 

Therefore: 

If we assume that at least one of the leading coefficients a n or b m is not a 
divisor of 0 in A , then 

deg (fg) = deg / + deg g 

and the leading coefficient of fg is a n b m . This holds in particular when a n or 
b m is a unit in A, or when A is entire. Consequently , when A is entire , 
A \_X\ is also entire. 

If / or g = 0, then we still have 

deg(/óf) = deg / + deg g 

if we agree that — oo + m = — oo for any integer m. 

One verifies trivially that for any polynomial f g e A[X~\ we have 

deg(/ + 0 ) g max(deg /, deg g\ 

again agreeing that — oo < m for every integer m. 
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Polynomials in several variables 

We now go to polynomials in several variables. Let A be a subring of 
a commutative ring B. Let x ì9 x n e B. For each n-tuple of integers 
(vj,..., v„) = (v) e N", we use vector notation, letting (x) = (x t ,..., x„), and 

A^ (v) (x) = xî 1 ••• x„ n . 

The set of such elements forms a monoid under multiplication. Let 
A[x] = A[x i,...,x n ] be the subring of B generated by x ì9 ... 9 x n over A. 
Then every element of A [x] can be written as a finite sum 

Z «(v)AÍ(v)W with a (v) e A. 

Using the construction of polynomials in one variable repeatedly, we may 
form the ring 

A[X í9 ... 9 XJ = A[X í ][X 2 -]-[X n l 

selecting X n to be a variable over A[X Í9 ..., X w _j]. Then every element / of 
A[X í9 ..., X n ~\ = A[X\ has a unique expression as a finite sum 

/=lFi .with f j eA[X l9 ... 9 X n _ í l 

j =o 

Therefore by induction we can write / uniquely as a sum 

/= I ( X 

v„ = 0 \ Vl ,...,v„-i / 

= X«(v,M (v) W = Xa (v) ^' •••*;” 

with elements a {v) e A, which are called the coefficients of /. The products 

M (v) (x) = xr ■••*;” 

will be called primitive monomials. Elements of A[X\ are called polynomials 
(in n variables). We call a (v) its coefficients. 

Just as in the one-variable case, we have an evaluation map. Given (x) = 
(x x ,..., x„) and / as above, we define 

f(x) = X «(v)M(v,(x) = X • • • x„ v ”. 

Then the evaluation map 

ev (x) : A[X\ -► B such that / 1 —► /(x) 

is a ring-homomorphism. It may be viewed as the composite of the suc- 
cessive evaluation maps in one variable X t i—>x ( for i = n,..., 1, because 
A[X\^B[X\. 

Just as for one variable, if f(X)eA[X\ is a polynomial in n variables, 
then we obtain a function 
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Íb'* B n -> B by (x) i ► f{x). 

We say that f(x) is obtained by substituting (x) for (X) in /, or by specializing 
(X) to (x). As for one variable, if K is a finite field, and / e K\_X~] one may 
have / # 0 but f K = 0. Cf. Chapter IV, Theorem 1.4 and its corollaries. 

Next let (p: A -> B be a homomorphism of commutative rings. Then we 
have the reduction map (generalized in Proposition 3.2 below) 

f(X) = X a (v) M (v) (2Q^X cp(a iv) )M {v) (X) = (<Pf)(X)- 

We can also compose the evaluation and reduction. An element (x) g B n is 
called a zero of / if (cpf)(x) = 0. Such zeros will be studied in Chapter IX. 

Go back to A as a subring of B. Elements x ì9 ... 9 x n e B are called 
algebraically independent over A if the evaluation map 

is injective. Equivalently, we could say that if / e A[X] is a polynomial and 
f(x) = 0, then / = 0; in other words, there are no non-trivial polynomial 
relations among x ìf ... t x n over A. 

Example. It is not known if e and n are algebraically independent over 
the rationals. It is not even known if e -f n is rational. 

We now come to the notion of degree for several variables. By the degree 
of a primitive monomial 

M M (X) = xr •••*;- 

we shall mean the integer |v| = v 1 -t-*** + v ll (which is ^ 0). 

A polynomial 

aXf ••• X n n (a g A) 

will be called a monomial (not necessarily primitive). 

If f(X) is a polynomial in A[X] written as 

/(x) = x« (v) x 1 Vi ---x;”, 

then either / = 0, in which case we say that its degree is — oo, or / # 0, and 
then we define the degree of / to be the maximum of the degrees of the 
monomials M {V) (X) such that a {v) # 0. (Such monomials are said to occur in 
the polynomial.) We note that the degree of / is 0 if and only if 

f(X) = a 0 X? ■■■ X? 

for some a 0 g A 9 a 0 # 0. We also write this polynomial simply f(X) = a 09 i.e. 
writing 1 instead of 

in other words, we identify the polynomial with the constant a 0 . 
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Note that a polynomial f(X ì9 ..., X„) in n variables can be viewed as a 
polynomial in X n with coefficients in A[X i9 ..., X n _j] (if n ^ 2). Indeed, we 
can write 

nx)= d £fj(x i,...,z n _iW, 

j=o 

where f is an element of A[X Í9 ..., X n .f\. By the degree of/in X n we shall 
mean its degree when viewed as a polynomial in X n with coefficients in 
A[X U ..., X n _f\. One sees easily that if this degree is d, then d is the largest 
integer occurring as an exponent of X n in a monomial 

0^x1'-x;* 

with a {v) # 0. Similarly, we define the degree of / in each variable X^ 
(i = 1,..., n). 

The degree of / in each variable is of course usually different from its 
degree (which is sometimes called the total degree if there is need to prevent 
ambiguity). For instance, 

X\X 2 + Xl 

has total degree 4, and has degree 3 in X x and 2 in X 2 . 

As a matter of notation, we shall often abbreviate “degree” by “deg.” 

For each integer d ^ 0, given a polynomial f let f (d) be the sum of all 
monomials occurring in / and having degree d. Then 

/ = Z/ ( ‘ ,, • 

d 

Suppose / ^ 0. We say that / is homogeneous of degree d if / = / (<,) ; thus / 
can be written in the form 

f(X) = Z ûívi^r* • • • x n" with v t + • • • + v„ = d if a (v) # 0. 

We shall leave it as an exercise to prove that a non-zero polynomial f in n 
variables over A is homogeneous of degree d if and only if for every set of 
n + 1 algebraically independent elements u, t n over A we have 

f(ut u ...,ut n ) = u 4 f(t u ...,t n ). 

We note that if f g are homogeneous of degree d, e respectively, and 
fg # 0, then fg is homogeneous of degree d + e. If d = e and / + g # 0, then 
/ + g is homogeneous of degree d. 

Remark. In view of the isomorphism 

A[X i,..., X n \ ~ A[t í9 ..., t n \ 

between the polynomial ring in n variables and a ring generated over A by n 



104 RINGS 


II. §3 

algebraically independent elements, we can apply all the terminology we have 
defined for polynomials, to elements of A[t í9 ..., t n ]. Thus we can speak of 
the degree of an element in 4[í], and the rules for the degree of a product or 
sum hold. In fact, we shall also call elements of A[t\ polynomials in (í). 
Algebraically independent elements will also be called variables (or indepen- 
dent variables), and any distinction which we make between A\_X\ and A[t] 
is more psychological than mathematical. 

Suppose next that A is entire. By what we know of polynomials in one 
variable and induction, it follows that A[X Í9 ..., X n \ is entire. In particular, 
suppose / has degree d and g has degree e. Write 

/ = + terms of lower degree, 

g = g {e) + terms of lower degree. 

Then fg = f id) g {e) -f terms of lower degree, and if fg # 0 then f id) g ie) # 0. 
Thus we find: 


deg (fg) = deg / + deg g 9 
deg(/ + g)£ max(deg / deg g). 

We are now finished with the basic terminology of polynomials. We end 
this section by indicating how the construction of polynomials is actually a 
special case of another construction which is used in other contexts. Inter- 
ested readers can skip immediately to Chapter IV, giving further important 
properties of polynomials. See also Exercise 33 of Chapter XIII for har- 
monic polynomials. 


The group ring or monoid ring 

Let A be a commutative ring. Let G be a monoid, written multiplica- 
tively. 

Let A[G~\ be the set of all maps a: G -* A such that a(x) = 0 for almost 
all xeG. We define addition in A[G\ to be the ordinary addition of 
mappings into an abelian (additive) group. If a, P e A[G\ we define their 
product aj? by the rule 


(a/S)(z)= X a (x)p(y). 

xy=z 

The sum is taken over all pairs (x, y) with x, y e G such that xy = z. This 
sum is actually finite, because there is only a finite number of pairs of 
elements (x, y) e G x G such that a(x)/?(y) # 0. We also see that (a/?)(í) = 0 
for almost all í, and thus belongs to our set A[G\. 

The axioms for a ring are trivially verified. We shall carry out the proof 
of associativity as an example. Let a, /?, y e A[G\. Then 
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((aj?)y)(z)= Z (<*P)( x )y(y) 

xy=z 

= z r z a (“)j ? ( i ')iy(>') 

xy=z L ut?=JC J 

= z I" Z *(«)^p)y(y)l 

xy=z L uy=JC J 

= Z «(“)£('’)?(>')> 

( u ,t?,y) 

uvy=z 

this last sum being taken over all triples (u v, .y) whose product is z. This 
last sum is now symmetric, and if we had computed ( a(py))(z ), we would 
have found this sum also. This proves associativity. 

The unit element of A [G] is the function S such that ô(e) = 1 and 
<5(x) = 0 for all x e G, x # e. It is trivial to verify that a = Scn = ccò for all 
a e A[G]. 

We shall now adopt a notation which will make the structure of A[G] 
clearer. Let ae A and xeG. We denote by a • x (and sometimes also by ax) 
the function whose value at x is a , and whose value at y is 0 if y # x. Then 
an element a e A [G] can be written as a sum 

« = Z «(*)•*• 

xe G 

Indeed, if {a x } xeG is a set of elements of A almost all of which are 0, and we 
set 

P= Z a x 

X eG 

then for any y e G we have /?(y) = (directly from the definitions). This also 
shows that a given element a admits a unique expression as a sum £ • x. 

With our present notation, multiplication can be written 



and addition can be written 

Z z b x 'X= Z ( a x + b x ) x, 

xeG xeG xeG 

which looks the way we want it to look. Note that the unit element of A[G] 
is simply 1 • e. 

We shall now see that we can embed both A and G in a natural way in 
A[G]. 

Let (p 0 : G -► A[G] be the map given by (p 0 (x) = l*x. It is immediately 
verified that (p 0 is a multiplicative monoid-homomorphism, and is in fact 
injective, i.e. an embedding. 

Let / 0 : A -► A[G] be the map given by 

fo(à) = a-e. 
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It is immediately verified that / 0 is a ring-homomorphism, and is also an 
embedding. Thus we view A as a subring of A[G]. One calls A[G ] the 
monoid ring or monoid algebra of G over A , or the group algebra if G is a 
group. 

Examples. When G is a finite group and /1 = k is a field, then the group 
ring /c[G] will be studied in Chapter XVIII. 

Polynomial rings are special cases of the above construction. In n vari- 
ables, consider a multiplicative free abelian group of rank n. Let X í9 ..., X n 
be generators. Let G be the multiplicative subset consisting of elements 
X\ l • • • X* n with v, ^ 0 for all i. Then G is a monoid, and the reader can 
verify at once that A[G] is just A[X U ..., X n ]. 

As a matter of notation we usually omit the dot in writing an element of 
the ring A[G\ so we write simply £ a x x for such an element. 

More generally, let / = {i} be an infinite family of indices, and let S be 
the free abelian group with free generators X h written multiplicatively. Then we 
can form the polynomial ring A [X ] by taking the monoid to consist of products 

m { v ) ( x )=n x ?’ 

iel 

where of course all but a finite number of exponents v f are equal to 0. If A is 
a subring of the commutative ring £, and S is a subset of B , then we shall 
also use the following notation. Let v: S -+ N be a mapping which is 0 except 
for a finite number of elements of S. We write 

m,v,(s)= n * v<x) - 

xeS 

Thus we get polynomials in infinitely many variables. One interesting exam- 
ple of the use of such polynomials will occur in Artin’s proof of the existence 
of the algebraic closure of a field, cf. Chapter V, Theorem 2.5. 

We now consider the evaluation and reduction homomorphisms in the 
present context of monoids. 

Proposition 3.1. Let cp: G -+ G' be a homomorphism of monoids. Then 
there exists a unique homomorphism h: A[G~\ -* A[G'] such that h(x) = 
<p(x) for allx e G and h(a) = afor all a e A. 

Proof. In fact, let a = Y a x x G >4[G]. Define 

h( a) = £ <*x<P(x)- 

Then h is immediately verified to be a homomorphism of abelian groups, and 
h(x) = cp(x). Let P = Y b y y- Then 

H*P) = Y Í ( Z <p(z). 

z \xy=z J 

We get = h(oc)h(P) immediately from the hypothesis that (p(xy) = 
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cp(x)(p(y). If e is the unit element of G, then by definition (p(e) = e\ so 
Proposition 3.1 follows. 


Proposition 3.2. Let G be a monoid and let f:A-+Bbea homomorphism 
of commutative rings. Then there is a unique homomorphism 


h:A[_G-]-+B[G] 

such that 

h( £ = E /(«*)*• 

\xeG J xeG 

Proof. Since every element of A[G] has a unique expression as a sum 
Y, a x x > the formula giving h gives a well-defined map from A\_G] into B\_G]. 
This map is obviously a homomorphism of abelian groups. As for multipli- 
cation, let 

z = 'Z a x x and P = X b y y. 

Then 

W) = Z /( E 

zeG \xy=z / 

= Z I f(a x )f(by)z 

zeG xy=z 

= Mf(p). 

We have trivially /i(l) = 1, so /i is a ring-homomorphism, as was to be 
shown. 


Observe that viewing A as a subring of A[G], the restriction of h to A is 
the homomorphism / itself. In other words, if e is the unit element of G, 
then 


h(ae) = f(a)e. 


§4. LOCALIZATION 

We continue to let A be a commutative ring. 

By a multiplicative subset of A we shall mean a submonoid of A (viewed 
as a multiplicative monoid according to RI 2). In other words, it is a subset 
S containing 1, and such that, if x, y e S, then xy e S. 

We shall now construct the quotient ring of A by S, also known as the 
ring of fractions of A by S. 

We consider pairs ( a , s) with a e A and s e S. We define a relation 

(ta , s) ~ (a\ s') 

between such pairs, by the condition that there exists an element e S such 
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that 


Sj(s'a — sa') = 0. 

It is then trivially verified that this is an equivalence relation, and the 
equivalence class containing a pair ( a , s) is denoted by a/s. The set of 
equivalence classes is denoted by S _1 A. 

Note that if 0 e S, then S~'A has precisely one element, namely 0/1. 

We define a multiplication in S~ l A by the rule 

(a/s)(a'/s') = aa'/ss'. 

It is trivially verified that this is well defined. This multiplication has a unit 
element, namely 1/1, and is clearly associative. 

We define an addition in S~ X A by the rule 

a a' s'a -f sa' 

—I—- =--—. 

s s ss 

It is trivially verified that this is well defined. As an example, we give the 
proof in detail. Let a^/s^ = a/s, and let a\/s\ = a'/s'. We must show that 

(siaj -f s^aiysjsi = (s'a -f sa')/ss'. 

There exist s 2 , s 3 e S such that 

s 2 (sai — s { a) = 0, 
s 3 (s'ai - sia') = 0. 


We multiply the first equation by s 3 s'si and the second by s^ss^. We then 
add, and obtain 

s 2 s 3 [s'si(sai — Sjíz) -f ssi(s'ai — s' x a')] = 0. 

By definition, this amounts to what we want to show, namely that there 
exists an element of S (e.g. s 2 s 3 ) which when multiplied with 


ss'(siai -f Siai) — Sisi(s'a -f sa') 


yields 0. 

We observe that given a e A and s, s' e S we have 


a/s = s'a/s's. 

Thus this aspect of the elementary properties of fractions still remains true in 
our present general context. 

Finally, it is also trivially verified that our two laws of composition on 
S~ X A define a ring structure. 

We let 

(Ps'. A —► S 1 A 

be the map such that <p s (a) = a/ì. Then one sees at once that <p s is a 
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ring-homomorphism. Furthermore, every element of (p s (S ) is invertible in 
(the inverse of s/1 is 1/s). 

Let C be the category whose objects are ring-homomorphisms 

such that for every s e 5, the element /(s) is invertible in B. If /: A -► B and 
f'.A^B' are two objects of G, a morphism g of / into f' is a homo- 
morphism 

g-B^B' 

making the diagram commutative: 



B 


We contend that <p s is a universal object in this category G. 

Proof. Suppose that a/s = a'/s', or in other words that the pairs (a, s) 
and (« a' , s') are equivalent. There exists s x e S such that 

sfs'a — sa') = 0. 

Let /: A -► B be an object of G. Then 

/(^i)[/(5')/W-/(s)/(^)]=0. 

Multiplying by /(sj) -1 , and then by /(s') _1 and /(s)" 1 , we obtain 

Mf(s)- 1 =f(a')f(s')-K 
Consequently, we can define a map 

h:S~'A^B 

such that h(a/s) = f(a)f(s)~ l , for all a/seS~ l A. It is trivially verified that h 
is a homomorphism, and makes the usual diagram commutative. It is also 
trivially verified that such a map h is unique, and hence that cp s is the 
required universal object. 

Let A be an entire ring, and let S be a multiplicative subset which does not 
contain 0. Then 

cp s : A -* S~ X A 

is injective. 

Indeed, by definition, if a/ 1 = 0 then there exists seS such that sa = 0, 
and hence a = 0. 

The most important cases of a multiplicative set S are the following: 

1. Let A be a commutative ring, and let S be the set of invertible 
elements of A (i.e. the set of units). Then S is obviously multiplicative, and is 
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denoted frequently by A*. If A is a field, then A* is the multiplicative group 
of non-zero elements of A. In that case, is simply A itself. 

2. Let A be an entire ring, and let S be the set of non-zero elements of A. 
Then 5 is a multiplicative set, and S _1 A is then a field, called the quotient 
field or the field of fractions, of A. It is then customary to identify A as a 
subset of S -1 i4, and we can write 

a/s = s~ l a 

for a e A and se S. 

We have seen in §3 that when A is an entire ring, then A[X í9 ..., X n ~\ is 
also entire. If K is the quotient field of A, the quotient field of A[X U ...,X n ~\ 
is denoted by K(X Ì9 ..., X n ). An element of K(X Ì9 ..., X n ) is called a rational 
function. A rational function can be written as a quotient f(X)/g(X) where 
/, g are polynomials. If (b ì9 ..., b n ) is in K {n \ and a rational function admits 
an expression as a quotient f/g such that g(b) # 0, then we say that the 
rational function is defìned at (b). From general localization properties, we 
see that when this is the case, we can substitute (b) in the rational function to 
get a value f(b)/g(b). 

3. A ring A is called a local ring if it is commutative and has a unique 
maximal ideal. If A is a local ring and m is its maximal ideal, and x e A, 
x $ m, then x is a unit (othenvise x generates a proper ideal, not contained in m, 
which is impossible). Let A be a ring and p a prime ideal. Let S be the com- 
plement of p in A. Then S is a multiplicative subset of A , and S~ l A is denoted 
by A p . It is a local ring (cf. Exercise 3) and is called the local ring of A at p. Cf. 
the examples of principal rings, and Exercises 15, 16. 

Let S be a multiplicative subset of A. Denote by J(A) the set of ideals of 
A. Then we can define a map 

il/ s :J(A)-+J(S- í A); 

namely we let ij/ s (á) = S~ l a be the subset of consisting of all fractions 
a/s with aea and seS. The reader will easily verify that S _1 û is an 
5 _1 /l-ideal, and that \j/ s is a homomorphism for both the additive and 
multiplicative monoid structures on the set of ideals J(A). Furthermore, i p s 
also preserves intersections and inclusions; in other words, for ideals a, b of 
A we have: 


5 -1 (û + b) = 5 _1 a + 5 -1 b, S -1 (ab) = (5 -1 a)(5 -1 b), 

5 -1 (û n b) = S -1 û n 5 -1 b. 

As an example, we prove this last relation. Let x e a n b. Then x/s is in 
S -1 a and also in S -1 b, so the inclusion is trivial. Conversely, suppose we 
have an element of 5 -1 /l which can be written as a/s = b/s' with ae a, b eb, 
and s, s' e S. Then there exists s x e S such that 

s^s'a = s^sb, 
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and this element lies in both a and b. Hence 

a/s = s^s'a/s^s's 

lies in S _1 (a n b), as was to be shown. 


§5. PRINCIPAL AND FACTORIAL RINGS 

Let A be an entire ring. An element a # 0 is called irreducible if it is not a 
unit, and if whenever one can write a = bc with b e A and c e A then b or c 
is a unit. 

Let a 0 be an element of A and assume that the principal ideal (a) is 
prime. Then a is irreducible. Indeed, if we write a = bc , then b or c lies in 
(a), say b. Then we can write b = ad with some d e A, and hence a = acd. 
Since A is entire, it follows that cd = 1, in other words, that c is a unit. 

The converse of the preceding assertion is not always true. We shall 
discuss under which conditions it is true. An element ae A, a ^ 0, is said to 
have a unique factorization into irreducible elements if there exists a unit u 
and there exist irreducible elements p t (i = 1,..., r) in A such that 

r 

a — u Yl Pí’ 

i =1 

and if given two factorizations into irreducible elements, 

r s 

a =“ n Pi = m ' n % 

i=i j =i 

we have r = s, and after a permutation of the indices i, we have p f = utfi for 
some unit u { in A, i = 1, ..., r. 

We note that if p is irreducible and u is a unit, then up is also irreducible, 
so we must allow multiplication by units in a factorization. In the ring 
of integers Z, the ordering allows us to select a representative irreducible 
element (a prime number) out of two possible ones differing by a unit, 
namely ±p, by selecting the positive one. This is, of course, impossible in 
more general rings. 

Taking r = 0 above, we adopt the convention that a unit of A has a 
factorization into irreducible elements. 

A ring is called factorial (or a unique factorization ring) if it is entire and if 
every element ^ 0 has a unique factorization into irreducible elements. We 
shall prove below that a principal entire ring is factorial. 

Let A be an entire ring and a, b e A, ab # 0. We say that a divides b and 
write a\b if there exists c e A such that ac = b. We say that d e A, d # 0, is a 
greatest common divisor (g.c.d.) of a and b if d\a , d\b , and if any element e 
of A, e =£ 0, which divides both a and b also divides d. 
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Proposition 5.1. Let A be a principal entire ring and a, b e A, a 9 b ¥ 0. 
Let ( a, b) = (c). Then c is a greatest common divisor of a and b. 

Proof. Since b lies in the ideal (c), we can write b = xc for some xe A, 
so that c|ò. Similarly, c\a. Let d divide both a and ò, and write a = dy, 
b = dz with y, z e A. Since c lies in (a, ò) we can write 

c = wa + tb 

with some vv, t e A. Then c = w dy + t dz = d(wy + tz\ whence d\c 9 and our 
proposition is proved. 


Theorem 5.2. Let A be a principal entire ring. Then A is factorial. 

Proof. We first prove that every non-zero element of A has a factoriza- 
tion into irreducible elements. Let S be the set of principal ideals ¥ 0 whose 
generators do not have a factorization into irreducible elements, and suppose 
S is not empty. Let (aj be in S. Consider an ascending chain 

(«i)p(a2)p-- - p(û n )p'“ 

of ideals in S. We contend that such a chain cannot be infinite. Indeed, the 
union of such a chain is an ideal of A, which is principal, say equal to (a). 
The generator a must already lie in some element of the chain, say ( a n ), and 
then we see that (a n ) <= ( a) <= ( a n ), whence the chain stops at (a n ). Hence S is 
inductively ordered, and has a maximal element (a). Therefore any ideal of A 
containing (a) and ¥ (a) has a generator admitting a factorization. 

We note that a n cannot be irreducible (otherwise it has a factorization), 
and hence we can write a = bc with neither b nor c equal to a unit. But then 
(b) ¥ (a) and (c) ¥ (a) and hence both b , c admit factorizations into irreducible 
elements. The product of these factorizations is a factorization for a , contra- 
dicting the assumption that S is not empty. 

To prove uniqueness, we first remark that if p is an irreducible element of 
A and a, b e A, p\ab , then p\a or p\b. Proof: If p\a, then the g.c.d. of p, a 
is 1 and hence we can write 

1 = xp + ya 

with some x, y e A. Then b = bxp + yab, and since p\ab we conclude that 
p\b. 

Suppose that a has two factorizations 

a = Pi • ** P r = <h **' q s 

into irreducible elements. Since p x divides the product farthest to the right, 
Pj divides one of the factors, which we may assume to be q x after renum- 
bering these factors. Then there exists a unit such that q x = u^p^. We 
can now cancel p x from both factorizations and get 
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Pl'" Pr = “i<h '" <L- 
The argument is completed by induction. 

We could call two elements a, b e A equivalent if there exists a unit u 
such that a = bu. Let us select one irreducible element p out of each 
equivalence class belonging to such an irreducible element, and let us denote 
by P the set of such representatives. Let ae A, a # 0. Then there exists a 
unit u and integers v(p) ^ 0, equal to 0 for almost all p e P such that 

a = u Y\ P v{p) • 

peP 

Furthermore, the unit u and the integers v(p) are uniquely determined by a. 
We call v(p) the order of a at p, also written ord p a. 

If A is a factorial ring, then an irreducible element p generates a prime 
ideal (p). Thus in a factorial ring, an irreducible element will also be called a 
prime element, or simply a prime. 

We observe that one can define the notion of least common multiple 
(l.c.m.) of a finite number of non-zero elements of A in the usual manner: If 

a u ..., a n eA 

are such elements, we define a l.c.m. for these elements to be any ce A such 
that for all primes p of A we have 

ord p c = max ord p a t . 

i 

This element c is well defined up to a unit. 

If a, b e A are non-zero elements, we say that a, b are relaively prime if 
the g.c.d. of a and b is a unit. 

Example. The ring of integers Z is factorial. Its group of units consists 
of 1 and —1. It is natural to take as representative prime element the 
positive prime element (what is called a prime number) p from the two 
possible choices p and —p. Similarly, we shall show later that the ring of 
polynomials in one variable over a field is factorial, and one selects represen- 
tatives for the prime elements to be the irreducible polynomials with leading 
coefficient 1. 

Examples. It will be proved in Chapter IV that if R is a factorial ring, 
then the polynomial ring R [À\,..., X n ~\ in n variables is factorial. In partic- 
ular, if fc is a field, then the polynomial ring k[X í9 ..., X n \ is factorial. Note 
that is a principal ring, but for n ^ 2, the ring k[X l9 ..., X n \ is not 

principal. 

In Exercise 5 you will prove that the localization of a factorial ring is 
factorial. 

In Chapter IV, §9 we shall prove that the power series ring 
fc[ [X í9 ..., A" n ]] is factorial. This result is a special case of the more general 
statement that a regular local ring is factorial, but we do not define regular 
local rings in this book. You can look them up in books on commutative 
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algebra. I recommend: 

H. Matsumura, Commutative Algebra , second edition, Benjamin-Cummings, New 
York, 1980 

H. Matsumura, Commutative Rings, Cambridge University Press, Cambridge, 
UK, 1986 

Examples from algebraic and complex geometry. Roughly speaking, reg- 
ular local rings arise in the following context of algebraic or complex geom- 
etry. Consider the ring of regular functions in the neighborhood of some 
point on a complex or algebraic manifold. This ring is regular. A typical 
example is the ring of convergent power series in a neighborhood of 0 in C". 
In Chapter IV, we shall prove some results on power series which give some 
algebraic background for those analytic theories, and which are used in 
proving the factoriality of rings of power series, convergent or not. 

Conversely to the above examples, singularities in geometric theories may 
give rise to examples of non-factoriality. We give examples using notions 
which are sufficiently basic so that readers should have encountered them in 
more elementary courses. 

Examples of non-factorial rings. Let k be a field, and let x be a variable 
over k. Let R = k[x 2 , x 3 ]. Then R is not factorial (proof?). The ring R may 
be viewed as the ring of regular functions on the curve y 2 = x 3 , which has a 
singularity at the origin, as you can see by drawing its real graph. 

Let R be the set of all numbers of the form a + bf^ 5, where a, be Z. 
Then the only units of R are ±1, and the elements 3, 2 + yj—5, 2 — yf—S 
are irreducible elements, giving rise to a non-Unique factorization 

3 2 = (2 + y 3 5)(2 - y=5). 

(Do Exercise 10.) Here the non-factoriality is not due to singularities but 
due to a non-trivial ideal class group of R, which is a Dedekind ring. For a 
definition see the exercises of Chapter III, or go straight to my book 
Algebraic Number Theory , for instance. 

As Trotter once pointed out (Math. Monthly , April 1988), the relation 

sin 2 x = (1 + cos x)(l — cos x) 

may be viewed as a non-unique factorization in the ring of trigonometric 
polynomials R[sin x, cos x], generated over R by the functions sin x and 
cos x. This ring is a subring of the ring of all functions, or of all differenti- 
able functions. See Exercise 11. 


EXERC1SES 

We let A denote a commutative ring. 

1. Suppose that 1 # 0 in A. Let S be a multiplicative subset of A not containing 0. 
Let p be a maximal element in the set of ideals of + whose intersection with S is 
empty. Show that p is prime. 
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2. Let /: A -> Á be a surjective homomorphism of rings, and assume that A is local, 
Á # 0. Show that A' is local. 

3. Let p be a prime ideal of A. Show that A p has a unique maximal ideal, consisting 
of all elements a/s with aep and s $ p. 

4. Let A be a principal ring and S a multiplicative subset with 0 $ S. Show that S~ X A is 
principal. 

5. Let A be a factorial ring and S a multiplicative subset with 0 $ S. Show that S _1 + is 
factorial, and that the prime elements of S~ X A are those primes p of A such that 

(р) n S is empty. 

6. Let A be a factorial ring and p a prime element. Show that the local ring A (p) is 
principal. 

7. Let A be a principal ring and a l9 ...,a H non-zero elements of A. Let 
(a u ..., a n ) = (d). Show that d is a greatest common divisor for the a ( 

(i = 1 

8. Let p be a prime number, and let A be the ring Z/p r Z (r = integer ^ 1). Let G be 
the group of units in A , i.e. the group of integers prime to p, modulo p r . Show 
that G is cyclic, except in the case when 

p = 2, r ^ 3, 

in which case it is of type (2, 2 r ~ 2 ). [ Hint: In the general case, show that G is 
the product of a cyclic group generated by 1 + p, and a cyclic group of order 
p — 1. In the exceptional case, show that G is the product of the group {±1} 
with the cyclic group generated by the residue class of 5 mod 2 r .] 

9. Let i be the complex number y/—ï- Show that the ring Z[i] is principal, and 
hence factorial. What are the units? 

10. Let D be an integer ^ 1, and let R be the set of all element a + byf—D with 
a, b eZ. 

(a) Show that R is a ring. 

(b) Using the fact that complex conjugation is an automorphism of C, show 
that complex conjugation induces an automorphism of R. 

(с) Show that if D ^ 2 then the only units in R are +1. 

(d) Show that 3, 2 + 5, 2 — y/^-5 are irreducible elements in Z^^/^5]. 

11. Let R be the ring of trigonometric polynomials as defined in the text. Show that 
R consists of all functions / on R which have an expression of the form 

n 

f(x) = a 0 + Yj ( a m cos mx + b m sin mx), 

m =1 

where a 0 , a m , b m are real numbers. Defìne the trigonometric degree deg tr (/) to be 
the maximum of the integers r, s such that a r , b 5 # 0. Prove that 

deg tr (fg) = deg tr (/) + dzg tr (g). 

Deduce from this that R has no divisors of 0, and also deduce that the functions 
sin x and 1 — cos x are irreducible elements in that ring. 
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12. Let P be the set of positive integers and R the set of functions defined on P with 
values in a commutative ring K. Define the sum in R to be the ordinary addition 
of functions, and define the convolution product by the formula 

(f*g)(m) = £ f(x)g(y), 

xy=m 

where the sum is taken over all pairs (x, y) of positive integers such that xy = m. 

(a) Show that R is a commutative ring, whose unit element is the function S such 
that <5(1)=1 and S(x) = Oifx^ 1. 

(b) A function / is said to be multiplicative if f(mn) = f(m)f(n) whenever m, n are 
relatively prime. If f g are multiplicative, show that f *g is multiplicative. 

(c) Let g be the Mòbius function such that //(1) = 1, g(p t • * * p r ) = (— l) r if Pu ..., p r 
are distinct primes, and p(m) = 0 if m is divisible by p 2 for some prime p. 
Show that p * (p x = S, where cp^ denotes the constant function having value 
1. [Hint: Show first that p is multiplicative, and then prove the assertion 
for prime powers.] The Môbius inversion formula of elementary number 
theory is then nothing else but the relation p*q>\*f = f 


Dedekind rings 

Prove the following statements about a Dedekind ring o. To simplify terminology, 
by an ideal we shall mean non-zero ideal unless otherwise specified. We let K 
denote the quotient field of o. 

13. Every ideal is finitely generated. [Hint: Given an ideal a, let b be the fractional 
ideal such that ab = o. Write 1 =£û,bi with a { e a and e b. Show that 
o = («!,...,«„).] 

14. Every ideal has a factorization as a product of prime ideals, uniquely determined 
up to permutation. 

15. Suppose o has only one prime ideal p. Let tep and t 4 P 2 - Then p = (t) is 
principal. 

16. Let o be any Dedekind ring. Let p be a prime ideal. Let o p be the local ring at 
p. Then o p is Dedekind and has only one prime ideal. 

17. As for the integers, we say that û|b (û divides b) if ihere exists an ideal c such that 
b = ûc. Prove: 

(a) a|b if and only if b c= a. 

(b) Let û, b be ideals. Then û + b is their greatest common divisor. In particular, 
û, b are relatively prime if and only if û + b = o. 

18. Every prime ideal p is maximal. (Remember, p # 0 by convention.) In particular, 
if p 1? ..., p„ are distinct primes, then the Chinese remainder theorem applies to 
their powers p r /, ..., p r ". Use this to prove: 

19. Let û, b be ideals. Show that there exists an element c e K (the quotient field of 
o) such that ca is an ideal relatively prime to b. In particular, every ideal class in 
Pic(o) contains representative ideals prime to a given ideal. 

For a continuation, see Exercise 7 of Chapter VII. 



CHAPTER III 

Modules 


Although this chapter is logically self-contained and prepares for future topics, 
in practice readers will have had some acquaintance with vector spaces over a 
field. We generalize this notion here to modules over rings. It is a standard fact 
(to be reproved) that a vector space has a basis, but for modules this is not always 
the case. Sometimes they do; most often they do not. We shall look into cases 
where they do. 

For examples of modules and their relations to those which have a basis, the 
reader should look at the comments made at the end of §4. 


§1. BASIC DEFINITIONS 

Let A be a ring. A left module over A , or a left /4-module M is an abelian 
group, usually written additively, together with an operation of A on M (viewing 
A as a multiplicative monoid by RI 2), such that, for all a, be A and x, ye M 
we have 


(a 4- b)x = ax + bx and a(x -F y) = ax + ay. 

We leave it as an exercise to prove that a( — x) = -(ax) and that Ox = 0. By 
definition of an operation, we have lx = x. 

In a similar way, one defines a right /t-module. We shall deal only with left 
^-modules, unless otherwise specified, and hence call these simply /I-modules, 
or even modules if the reference is clear. 
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Let M be an A- module. By a submodule N of M we mean an additive sub- 
group such that AN a N. Then N is a module (with the operation induced by 
that of A on M). 

Examples 

We note that A is a module over itself. 

Any commutative group is a Z-module. 

An additive group consisting of 0 alone is a module over any ring. 

Any left ideal of A is a module over A. 

Let J be a two-sided ideal of A. Then the factor ring A/J is actually a module 
over A. If a e A and a + J is a coset of J in A, then one defines the operation 
to be a(x + J) = ax + J. The reader can verify at once that this defines a module 
structure on A/J. More general, if M is a module and N a submodule, we shall 
define the factor module below. Thus if L is a left ideal of A, then A/L is also 
a module. For more examples in this vein, see §4. 

A module over a field is called a vector space. Even starting with vector 
spaces, one is led to consider modules over rings. Indeed, let V be a vector space 
over the field K. The reader no doubt already knows about linear maps (which 
will be recalled below systematically). Let R be the ring of all linear maps of V 
into itself. Then V is a module over R. Similarly, if V = K n denotes the vector 
space of (vertical) w-tuples of elements of K , and R is the ring of n X n matrices 
with components in K , then V is a module over R. For more comments along 
these lines, see the examples at the end of §2. 

Let S be a non-empty set and M an A-module. Then the set of maps 
Map(S, M) is an A-module. We have already noted previously that it is a com- 
mutative group, and for / e Map(S, M), a e A we define af to be the map 
such that (af)(s) = af(s). The axioms for a module are then trivially verified. 

For further examples, see the end of this section. 

For the rest of this section, we deal with a fixed ring A, and hence may omit 
the prefix A-. 

Let A be an entire ring and let M be an A-module. We define the torsion 
submodule M tor to be the subset of elements x e M such that there exists 
such that ax = 0. It is immediately verified that M tor is a submodule. 
Its structure in an important case will be determined in §7. 

Let a be a left ideal, and M a module. We define ûM to be the set of all 
elements 


ai x i + * • • + a n x n 

with a x e a and x, e M. It is obviously a submodule of M. If a, b are left ideals, 
then we have associativity, namely 


a(bM) = (ab)M. 
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We also have some obvious distributivities, like (a + b)M = ûM + bM. If 
N, N' are submodules of M, then a(N -f N') = aN + aN'. 

Let M be an /l-module, and N a submodule. We shall define a module 
structure on the factor group M/N (for the additive group structure). Let 
x -f N be a coset of N in M, and let ae A. We define a(x -f N) to be the 
coset ax -f N. It is trivial to verify that this is well defined (i.e. if y is in the 
same coset as x, then ay is in the same coset as ax ), and that this is an opera- 
tion of A on M/N satisfying the required condition, making M/N into a 
module, called the factor module of M by N. 

By a module-homomorphism one means a map 

/:M-M' 

of one module into another (over the same ring A\ which is an additive group- 
homomorphism, and such that 


f(ax) = af(x) 

for all a e A and xe M. It is then clear that the collection of ,4-modules is a 
category, whose morphisms are the module-homomorphisms usually also 
called homomorphisms for simplicity, if no confusion is possible. If we wish 
to refer to the ring A, we also say that / is an J-homomorphism, or also that 
it is an A -linear map. 

If M is a module, then the identity map is a homomorphism. For any 
module M', the map Ç: M -> M' such that £(x) = 0 for all x e M is a homo- 
morphism, called zero. 

In the next section, we shall discuss the homomorphisms of a module into 
itself, and as a result we shall give further examples of modules which arise in 
practice. Here we continue to tabulate the translation of basic properties of groups 
to modules. 

Let M be a module and N a submodule. We have the canonical additive 
group-homomorphism 


/:M -+M/N 

and one verifies trivially that it is a module-homomorphism. 

Equally trivially, one verifies that / is universal in the category of homo- 
morphisms of M whose kernel contains N. 

If f :M -► M' is a module-homomorphism , then its hernel and image are 
submodules of M and M' respectively (trivial verification). 

Let/: M —> M' be a homomorphism. By the cokernel of/we mean the factor 
module M'/Im/ = M'//(M). One may also mean the canonical homomorphism 



120 MODULES 


III, §1 


M' M'/f(M) rather than the module itself. The context should make clear 
which is meant. Thus the cokernel is a factor module of M'. 

Canonical homomorphisms discussed in Chapter I, §3 apply to modules 
mutatis mutandis. For the convenience of the reader, we summarise these 
homomorphisms: 

Let N , N' be two submodules of a module M. Then N + N' is also a sub- 
module , and we have an isomorphism 

N/(N n N') « (N + N')/N'. 

IfMzDM'^D M" are modules , then 

(M/M")/(M'/M") % M/M'. 

If f :M -► M' is a module-homomorphism , and N' /s a submodule of M\ then 
f ~ *(N') is a submodule of M and we have a canonical injective homomorphism 

f:M/f-\N')^ M'/N'. 

Iff is surjective , then f is a module-isomorphism. 

The proofs are obtained by verifying that all homomorphisms which ap- 
peared when dealing with abelian groups are now /1-homomorphisms of 
modules. We leave the verification to the reader. 

As with groups, we observe that a module-homomorphism which is bijective 
is a module-isomorphism. Here again, the proof is the same as for groups, 
adding only the observation that the inverse map, which we know is a group- 
isomorphism, actually is a module-isomorphism. Again, we leave the verifica- 
tion to the reader. 

As with abelian groups, we define a sequence of module-homomorphisms 

M' 4 M 4 M" 

to be exact if Im/ = Ker g. We have an exact sequence associated with a 
submodule N of a module M, namely 

0 - N - M - M/N - 0, 

the map of N into M being the inclusion, and the subsequent map being the 
canonical map. The notion of exactness is due to Eilenberg-Steenrod. 

If a homomorphism u : N -> M is such that 

0-^AÍÌM 

is exact, then we also say that u is a monomorphism or an embedding. Dually, 


is exact, we say that u is an epimorphism. 



III, §1 


BASIC DEFINITIONS 121 


Algebras 

There are some things in mathematics which satisfy all the axioms of a ring 
except for the existence of a unit element. We gave the example of L ! (R) in 
Chapter II, §1. There are also some things which do not satisfy associativity, 
but satisfy distributivity. For instance let R be a ring, and for x, y e R define 

the bracket product 

[x, y] = xy - yx. 

Then this bracket product is not associative in most cases when R is not com- 
mutative, but it satisfies the distributive law. 

Examples. A typical example is the ring of differential operators with C 00 
coefficients, operating on the ring of C 00 functions on an open set in R*. The 
bracket product 

[D \, D 2 ] = D x 0 D 2 — D 2 ° D x 

of two differential operators is again a differential operator. In the theory of Lie 
groups, the tangent space at the origin also has such a bracket product. 

Such considerations lead us to define a more general notion than a ring. Let 
A be a commutative ring. Let E, F be modules. By a bilinear map 

g: E x E-+ F 

we mean a map such that given x e £, the map y i-> g(x, y) is A-linear, and 
given y e E, the map jc i-> g(x, y) is A-linear. By an A-algebra we mean a 
module together with a bilinear map g: E x E —> E. We view such a map as a 
law of composition on E. But in this book, unless otherwise specified, we shall 
assume that our algebras are associative and have a unit element. 

Aside from the examples already mentioned, we note that the group ring 
A[G] (or monoid ring when G is a monoid) is an A-algebra, also called the group 
(or monoid) algebra. Actually the group algebra can be viewed as a special 
case of the following situation. 

Let /: A —» B be a ring-homomorphism such that f(A) is contained in the 
center of B, i.e.,f(a) commutes with every element of B for every a e A. Then 
we may view B as an A-module, defining the operation of A on B by the map 

(a, b) i -+f(a)b 

for all a e A and b e B. The axioms for a module are trivially satisfied, and the 
multiplicative law of composition B x B —> B is clearly bilinear (i.e., A-bilinear). 
In this book, unless otherwise specified, by an algebra over A, we shall always 
mean a ring-homomorphism as above. We say that the algebra is finitely gen- 
erated if B is finitely generated as a ring over f(A). 

Several examples of modules over a polynomial algebra or a group algebra 
will be given in the next section, where we also establish the language of 
representations. 
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§2. THE GROUP OF HOMOMORPHISMS 

Let A be a ring, and let X , X' be /4-modules. We denote by Hom A (X', X) 
the set of /l-homomorphisms of X' into X. Then Hom^A"', X) is an abelian 
group, the law of addition being that of addition for mappings into an abelian 
group. 

If A is commutative then we can make Wom A {X\ X) into an 4-module, by 
defining af for ae A and/e Horn^À", X) to be the map such that 

(af)(x) = af(x). 

The verification that the axioms for an /1-module are satisfied is trivial. However, 
if A is not commutative, then we view Hom^A"', X) simply as an abelian group: 

We also view Hom^ as a functor. It is actually a functor of two variables, 
contravariant in the first and covariant in the second. Indeed, let Y be an 
/l-module, and let 

be an /1-homomorphism. Then we get an induced homomorphism 
Hom A (f Y):Hom A (X, Y) - Uom A (X\ Y) 

(reversing the arrow!) given by 

9'-*9°f 

This is illustrated by the following sequence of maps: 

X' 4 x 4 Y. 

The fact that Hom A (f Y) is a homomorphism is simply a rephrasing of the 
property (g t + g 2 ) ° f = °f + g 2 °f which is trivially verified. If / = id, 
then composition with/acts as an identity mapping on g 9 i.e. g © id = g. 

If we have a sequence of +-homomorphisms 

X' - X X\ 

then we get an induced sequence 

Hom A (X\ Y) 4 - Hom A (X, Y) 4 - Hom^A"', Y). 

Proposition 2.1. A sequence 

is exact if and only if the sequence 

Hom A (X\ Y) 4 - Hom A (X 9 Y) 4 - Hom A (X", Y) <- 0 
is exact for all Y. 
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Proof. This is an important fact, whose proof is easy. For instance, 
suppose the first sequence is exact. If g : X" Y is an /1-homomorphism, its 
image in Hom i4 (À’, Y) is obtained by composing g with the surjective map of 
X on X". If this composition is 0, it follows that g = 0 because X -► X” is 
surjective. As another example, consider a homomorphism g: X -► Y such 
that the composition 

X' 4 X 4 Y 

is 0. Then g vanishes on the image of X. Hence we can factor g through the 
factor module, 

X/lmX 


X g ► Y 

Since X -► X" is surjective, we have an isomorphism 

X/lm X <-► X". 

Hence we can factor g through X" , thereby showing that the kernel of 
Hom A (X\ Y) «- Hom A (X 9 Y) 
is contained in the image of 

Horn^A", Y) <- tìom A (X", Y). 

The other conditions needed to verify exactness are left to the reader. So is the 
converse. 

We have a similar situation with respect to the second variable, but then 
the functor is covariant. Thus if X is fixed, and we have a sequence of A- 
homomorphisms 

y' - y - y", 

then we get an induced sequence 

Hom^í^, Y') - Hom A (X 9 Y) -► tìom A (X, Y"). 

Proposition 2.2. A sequence 

0 -> Y' -► Y -+ Y", 

is exact if and only if 

0 tìom A (X, Y') - Hom^í^, Y) - Hom^*, Y") 
is exact for all X. 
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The verification will be left to the reader. It follows at once from the defini- 
tions. 

We note that to say that 


0 Y' Y 

is exact means that Y' is embedded in Y, i.e. is isomorphic to a submodule of 
Y. A homomorphism into Y' can be viewed as a homomorphism into Y if we 
have Y' a Y. This corresponds to the injection 

0 Hom^*, Y') -> Hom^*, Y). 

Let Mod(A) and Mod (B) be the categories of modules over rings A and B , 
and let F: Mod(A) -► Mod(5) be a functor. One says that F is exact if F 
transforms exact sequences into exact sequences. We see that the Hom 
functor in either variable need not be exact if the other variable is kept fixed. 
In a later section, we define conditions under which exactness is preserved. 

Endomorphisms. Let M be an A-module. From the relations 

ÌQi + g 2 )°f = 9i°f + Qi °f 
and its analogue on the right, namely 

Q ° (/i + fi) = Q °fi + Q °fi> 

and the fact that there is an identity for composition, namely id M , we conclude 
that Hom A (M 9 M) is a ring, the multiplication being defined as composition 
of mappings. If n is an integer ^ 1, we can write f n to mean the iteration 
of / with itself n times, and define /° to be id. According to the general 
definition of endomorphisms in a category, we also write End A (M) instead of 
Hom a (M 9 M), and we call End A (M) the ring of endomorphisms. 

Since an A-module M is an abelian group, we see that Hom z (M, M)( = set 
of group-homomorphisms of M into itself) is a ring, and that we could have 
defined an operation of A on M to be a ring-homomorphism A -> Hom Z (M, M). 

Let A be commutative. Then M is a module over End A (M). If R is a subring 
of End^(M) then M is a fortiori a module over R. More generally, let R be a 
ring and let p: R —» En á A (M) be a ring homomorphism. Then p is called a 
representation of R on M. This occurs especially if A = K is a field. The linear 
algebra of representations of a ring will be discussed in Part III, in several 
contexts, mostly finite-dimensional. Infinite-dimensional examples occur in anal- 
ysis, but then the representation theory mixes algebra with analysis, and thus 
goes beyond the level of this course. 

Example. Let K be a field and let V be a vector space over K. Let 
D: V —> V be an endomorphism (A'-linear map). For every polynomial 
P(X) e K[X ], P(X) = 2 a t X l with a t e AT, we can define 
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P(D) = 2 V-* V 

as an endomorphism of V. The association P(X) \—> P(D) gives a representation 

p : K[X] -* End*(V), 

which makes V into a A^XJ-module. It will be shown in Chapter IV that K[X] 
is a principal ring. In §7 we shall give a general structure theorem for modules 
over principal rings, which will be applied to the above example in the context 
of linear algebra for finite-dimensional vector spaces in Chapter XIV, §3. Readers 
acquainted with basic linear algebra from an undergraduate course may wish to 
read Chapter XIV already at this point. 

Examples for infinite-dimensional vector spaces occur in analysis. For 
instance, let V be the vector space of complex-valued C 00 functions on R. Let 
D = d/dt be the derivative (if t is the variable). Then D: VV is a linear map, 
and C[X] has the representation p: C[X] —» End c (V) given by P i-» P(D). A 
similar situation exists in several variables, when we let V be the vector space 
of C 00 functions in n variables on an open set of R". Then we let D, = d/dti be 
the partial derivative with respect to the i-th variable (i = 1,..., n). We obtain 
a representation 

P : C[Xj,. . ., X n ] —> End c (V) 

such that p(Xi) = D t . 

Example. Let H be a Hilbert space and let A be a bounded hermitian oper- 
ator on A. Then one considers the homomorphism R[X] —> R[A] C End(//), 
from the polynomial ring into the algebra of endomorphisms of H , and one 
extends this homomorphism to the algebra of continuous functions on the spec- 
trum of A. Cf. my Real and Functional Analysis , Springer Verlag, 1993. 

Representations form a category as follows. We define a morphism of a 
representation p: R —> End A (M) into a representation p': R —> End A (M') 9 or in 
other words a homomorphism of one representation of R to another, to be 
an A-module homomorphism h: M —> M' such that the following diagram is 
commutative for every a e R: 

M ——► M’ 

p(a) p'(ot) 

M —M' 

In the case when h is an isomorphism, then we may replace the above diagram 
by the commutative diagram 


End^CAÍ) 
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where the symbol [h] denotes conjugation by h, i.e. for/e End^fA/ ) we have 
[h]f= h °/° h~ l . 

Representations: from a monoid to the monoid algebra. Let G be a 

monoid. By a representation of G on an Á-module M, we mean a homomor- 
phism p: G —* End^A/) of G into the multiplicative monoid of End^fA/). Then 
we may extend p to a homomorphism of the monoid algebra 

A[G] End 4 (A/), 

by letting 

p( 2 a x x) = 2 ^pM- 

\JteC / JteG 

It is immediately verified that this extension of ptoA[G] is a ring homomorphism, 
coinciding with the given p on elements of G. 

Examples: modules over a group ring. The next examples will follow a 
certain pattern associated with groups of automorphisms. Quite generally, sup- 
pose we have some category of objects, and to each object K there is associated 
an abelian group F(K ), functorially with respect to isomorphisms. This means 
that if a: K —» K ' is an isomorphism, then there is an associated isomorphism 
F(a): F(K') —» F(K f ) such that F(id) = id and F(ctt) = F(cr) ° F(t). Then the 
group of automorphisms Aut(/f) of an object operates on F(K)\ that is, we have 
a natural homomorphism 

Aut(A) AutíFíA')) given by a h-> F(a). 

Let G = Aut(A'). Then F(K) (written additively) can be made into a module 
over the group ring Z [G] as above. Given an element a = 2 0 (T (T E Z[G], with 
a a e Z, and an element jc e /TA), we define 

ojc = 2 cio-F(a)x. 

The conditions defining a module are trivially satisfied. We list several concrete 
cases from mathematics at large, so there are no holds barred on the terminology. 

Let K be a number field (i.e. a finite extension of the rational numbers). Let 
G be its group of automorphisms. Associated with K we have the following 
objects: 

the ring of algebraic integers o^; 
the group of units o^; 
the group of ideal classes C(K)\ 
the group of roots of unity p,(A). 

Then G operates on each of those objects, and one problem is to determine the 
structure of these objects as Z[G]-modules. Already for cyclotomic fields this 
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determination gives rise to substantial theories and to a number of unsolved 
problems. 

Suppose that K is a Galois extension of k with Galois group G (see Chapter 
VI). Then we may view K itself as a module over the group ring k[G]. In Chapter 
VI, §13 we shall prove that K is isomorphic to k[G] as module over k[G] itself. 

In topology, one considers a spaceX 0 and a finite coveringX. Then Aut(X/X 0 ) 
operates on the homology of X, so this homology is a module over the group 
ring. 

With more structure, suppose thatX is a projective non-singular variety, say 
over the complex numbers. Then to X we can associate: 

the group of divisor classes (Picard group) Pic(X); 

in a given dimension, the group of cycle classes or Chow group CH^(X); 

the ordinary homology of X\ 

the sheaf cohomology in general. 

If X is defined over a field K finitely generated over the rationals, we can 
associate a fancier cohomology defined algebraically by Grothendieck, and func- 
torial with respect to the operation of Galois groups. 

Then again all these objects can be viewed as modules over the group ring 
of automorphism groups, and major problems of mathematics consist in deter- 
mining their structure. I direct the reader here to two surveys, which contain 
extensive bibliographies. 

[CCFT 91] P. Cassou-Nogues, T. Chinburg, A. Fróhlich, M. J. Taylor, 
L-functions and Galois modules, in L-functions and Arithmetic J. Coates 
and M. J. Taylor (eds.), Proceedings of the Durham Symposium July 1989, 
London Math, Soc. Lecture Note Series 153, Cambridge University Press 
(1991), pp. 75-139 

[La 82] S. Lang, Units and class groups in number theory and algebraic geometry, 
Bull. AMS Vol. 6 No. 3 (1982), pp. 253-316 


§3. DIRECT PRODUCTS AND 
SUMS OF MODULES 

Let A be a ring. Let {Mj /e/ be a family of modules. We defined their direct 
product as abelian groups in Chapter I, §9. Given an element (xf)^ of the direct 
product, and a e A, we define a{x t ) = (axf). In other words, we multiply by an 
element a componentwise. Then the direct product FIm, is an A-module. The 
reader will verify at once that it is also a direct product in the category of 
A-modules. 
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Similarly, let 

M = © M, 

ie/ 

be their direct sum as abelian groups. We define on M a structure of /1-module: 
If (Xi) içI is an element of M, i.e. a family of elements x t e such that x, = 0 
for almost all /, and if a e A, then we define 

<^(^i)iel (^i)ie/» 

that is we define multiplication by a componentwise. It is trivially verified that 
this is an operation of A on M which makes M into an ,4-module. If one refers 
back to the proof given for the existence of direct sums in the category of abelian 
groups, one sees immediately that this proof now extends in the same way to 
show that M is a direct sum of the family {M f } ie/ as /l-modules. (For instance, 
the map 

Xj: Mj M 

such that Xj(x) has j- th component equal to x and i-th component equal to 0 
for i # j is now seen to be an ,4-homomorphism.) 

This direct sum is a coproduct in the category of A-modules. Indeed, 
the reader can verify at once that given a family of A-homomorphisms 
{fj \ M t —> N}, the map / defined as in the proof for abelian groups is also an A- 
isomorphism and has the required properties. See Proposition 7.1 of Chapter I. 

When I is a finite set, there is a useful criterion for a module to be a direct 
product. 

Proposition 3.1. Let M be an A-module and n an integer ^ 1. For each 
i = 1,..., n let cpi'. M -+ M be an A-homomorphism such that 

n 

X <Pi = ' d and <Pi °<pj = 0 if i / j- 

i = 1 

Then (pf = (Pifor all i. Let M, = (pfM ), and let q>\ M -► f] be such that 
<P(x) = ((Pi(x), ...,(p n (x)). 

Then (p is an A-isomorphism of M onto the direct product f] M f . 

Proof. For each j, we have 

n 

<Pj = <Pj ° id = <Pj ° YVi^ <Pj ° <Pj = VÌ- 

i = 1 

thereby proving the first assertion. It is clear that (p is an A-homomorphism. 
Let x be in its kernel. Since 


n 


X = id(x) = X <Pi ( x ) 
í= 1 
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we conclude that x = 0, so (p is injective. Given elements y t e M t for each 
i = 1, ..., n , let x = y x + • • • -f y n . We obviously have <p/y,) = 0 if i # j. 
Hence 


<?/*) = 


for each; = 1,..., n. This proves that (p is surjective, and concludes the proof 
of our proposition. 

We observe that when / is a finite set, the direct sum and the direct product 
are equal. 

Just as with abelian groups, we use the symbol © to denote direct sum. 

Let M be a module over a ring A and let S be a subset of M. By a linear 
combination of elements of S (with coefficients in A) one means a sum 

Z a * x 

xeS 

where {a x } is a set of elements of A, almost all of which are equal to 0. These 
elements a x are called the coefficients of the linear combination. Let N be 
the set of all linear combinations of elements of S. Then N is a submodule of 
M, for if 


Z a x x and Z b * x 

xeS x e S 

are two linear combinations, then their sum is equal to 

Z ( a x + b x )x, 

xeS 

and if ce A, then 

c( Z fl x x ) = £ca x x, 

\xeS j xeS 

and these elements are again linear combinations of elements of S. We shall call 
N the submodule generated by S, and we call 5 a set of generators for N. We 
sometimeswrite N = A(S ). If5consistsofoneelementx,themodulegenerated 
by x is also written Ax , or simply (x), and sometimes we say that (x) is aprincipal 
module. 

A module M is said to be finitely generated, or of finite type, or finite over 
A, if it has a finite number of generators. 

A subset S of a module M is said to be linearly independent (over A) if when- 
ever we have a linear combination 


Z a * x 

xeS 
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which is equal to 0, then a x = 0 for all xeS. If S is linearly independent and if 
two linear combinations 


Y a * x and X b * x 

are equal, then = b x for all xe5. Indeed, subtracting one from the other 
yields £ (a x — b x )x = 0, whence a x - b x = 0 for all x. If S is linearly indepen- 
dent we shall also say that its elements are linearly independent. Similarly, a 
family {x,} ie/ of elements of M is said to be linearly independent if whenever we 
have a linear combination 


X a i x i = 

ie / 

then a t = 0 for all i. A subset S (resp. a family {x,}) is called linearly dependent 
if it is not linearly independent, i.e. if there exists a relation 

X a x x = 0 resp. £ a i x i = 0 

x e S iel 

with not all a x (resp. a^ = 0. Warning. Let x be a single element of M which 

is linearly independent. Then the family {Xi} i=ì .„ such that x, = x for all i 

is linearly dependent if n > 1, but the set consisting of x itself is linearly inde- 
pendent. 

Let M be an ,4-module, and let {Mi} i€l be a family of submodules. Since 
we have inclusion-homomorphisms 

A,-! Mi —► M 

we have an induced homomorphism 

which is such that for any family of elements (Xi) ie/ , all but a finite number of 
which are 0, we have 


>i*(( x .)) = Z x ì - 

ÎG/ 

If A* is an isomorphism, then we say that the family {Mj} íe/ is a direct sum 
decomposition of M. This is obviously equivalent to saying that every element 
of M has a unique expression as a sum 

I*. 

with x f e Mj, and almost all x f = 0. By abuse of notation, we also write 

M = © M t 


in this case. 
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If the family {MJ is such that every element of M has some expression as a 
sum Y, x i ( not necessarily unique), then we write M = £ Mj. In any case, if 
{MJ is an arbitrary family of submodules, the image of the homomorphism A* 
above is a submodule of M, which will be denoted by M, . 

If M is a module and N , N' are two submodules such that N N' = M 
and N n N' = 0, then we have a module-isomorphism 

M « N 0 N', 

just as with abelian groups, and similarly with a finite number of submodules. 

We note, of course, that our discussion of abelian groups is a special case 
of our discussion of modules, simply by viewing abelian groups as modules 
over Z. However, it seems usually desirable (albeit inefficient) to develop first 
some statements for abelian groups, and then point out that they are valid 
(obviously) for modules in general. 

Let M, M', N be modules. Then we have an isomorphism of abelian groups 


Hom a (M ® M ', N) A Hom A (M, N) x Hom^M', N\ 


and similarly 


Hom Á (N, M x M') A Hom A (N 9 M) x Hom Á (N, M'). 


The first one is obtained as follows. If/: M © M' -► N is a homomorphism, 
then/induces a homomorphism f x : M -► N and a homomorphism f 2 : M' -► N 
by composing / with the injections of M and M' into their direct sum re- 
spectively: 

M-»M®{0)cM®M'ÌN, 

M’ -► {0} © M' c M © M' 4 /V. 

We leave it to the reader to verify that the association 

/^(/.,/2) 

gives an isomorphism as in the first box. The isomorphism in the second box 
is obtained in a similar way. Given homomorphisms 

/i: N M 


and 


f 2 :N^M' 
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we have a homomorphism /: Af -► M x M' defined by 

/(*) = (/i(x),/ 2 (x)). 

It is trivial to verify that the association 

(fufi)^f 

gives an isomorphism as in the second box. 

Of course, the direct sum and direct product of two modules are isomorphic, 
but we distinguished them in the notation for the sake of functoriality, and to 
fit the infinite case, see Exercise 22. 

Proposition 3.2. Let 0 -> M' M M" 0 be an exact sequence of 
modules. The folloxving conditions are equivalent: 

1. There exists a homomorphism cp: M" -* M such that g ° cp = id. 

2. There exists a homomorphism \j/ : M -> M' such that \j/ °f = id. 

Ifthese conditions are satisfied , then we have isomorphisms: 

M = Im/® Ker \j/, M = Ker g © Im cp, 

M * M' 0 M". 

Proof Let us write the homomorphisms on the right: 

Mè-M" -* 0. 

<P 

Let xe M. Then 


X - íp(g(x)) 

is in the kernel of g , and hence M = Ker g + Im cp. 
This sum is direct, for if 


x = y + z 

with yeKer g and zelm cp, z = cp(w) with vveM", and applying g yields 
g(x) = w. Thus vv is uniquely determined by x, and therefore z is uniquely 
determined by x. Hence so is y, thereby proving the sum is direct. 

The arguments concerning the other side of the sequence are similar and 
will be left as exercises, as well as the equivalence between our conditions. When 
these conditions are satisfied, the exact sequence of Proposition 3.2 is said to 
split. One also says that \j/ splits / and <p splits g. 
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Abelian categories 

Much in the theory of modules over a ring is arrow-theoretic. In fact, one 
needs only the notion of kernel and cokernel (factor modules). One can axi- 
omatize the special notion of a category in which many of the arguments are 
valid, especially the arguments used in this chapter. Thus we give this axi- 
omatization now, although for concreteness, at the beginning of the chapter, 
we continue to use the language of modules. Readers should strike their own 
balance when they want to slide into the more general framework. 

Consider first a category (2 such that Mor(£, F ) is an abelian group for 
each pair of objects £, F of û, satisfying the following two conditions: 

AB 1. The law of composition of morphisms is bilinear, and there exists 
a zero object 0, i.e. such that Mor(0, E) and Mor(£, 0) have precisely 
one element for each object E. 

AB 2. Finite products and finite coproducts exist in the category. 

Then we say that (2 is an additive category. 

Given a morphism E -4 F in C2, we define a kernel of/ to be a morphism 
E' -+ E such that for all objects X in the category, the following sequence is 
exact: 

0 - Mor(*, E') -► Mor(X, E) - Mor(*, F). 

We define a cokernel for/to be a morphism F -> F" such that for all objects X 
in the category, the following sequence is exact: 

0 Mor(F", X) Mor(F, X) Mor(F, X). 

It is immediately verified that kernels and cokernels are universal in a suitable 
category, and hence uniquely determined up to a unique isomorphism if they 
exist. 

AB 3. Kernels and cokernels exist. 

AB 4. If j : E -► F is a morphism whose kernel is 0, then j is the kernel 
of its cokernel. If /: E -> F is a morphism whose cokernel is 0, 
then / is the cokernel of its kernel. A morphism whose kernel 
and cokernel are 0 is an isomorphism. 

A category (2 satisfying the above four axioms is called an abelian category. 
In an abelian caegory, the group of morphisms is usually denoted by Hom, 
so for two objects F, F we write 

Mor(£, F) = Hom(F, F). 

The morphisms are usually called homomorphisms Given an exact sequence 


0 -► M' -> M, 
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we say that M' is a subobject of M, or that the homomorphism of M' into M is a 
monomorphism . Dually, in an exact sequence 

M - M" - 0, 

we say that M" is a quotient object of M, or that the homomorphism of M to 
M" is an epimorphism, instead of saying that it is surjective as in the category of 
modules. Although it is convenient to think of modules and abelian groups to 
construct proofs, usually such proofs will involve only arrow-theoretic argu- 
ments, and will therefore apply to any abelian category. However, all the abelian 
categories we shall meet in this book will have elements, and the kernels and 
cokernels will be defined in a natural fashion, close to those for modules, so 
readers may restrict their attention to these concrete cases. 

Examples of abelian categories. Of course, modules over a ring form an 
abelian category, the most common one. Finitely generated modules over a 
Noetherian ring form an abelian category, to be studied in Chapter X. 

Let k be a field. We consider pairs ( V , A) consisting of a finite-dimensional 
vector space V over k , and an endomorphism A: V —» V. By a homomorphism 
(morphism) of such pairs /: (V, A) —» ( W , B) we mean a £-homomorphism 
f: V W such that the following diagram is commutative: 



It is routinely verified that such pairs and the above defined morphisms form an 
abelian category. Its elements will be studied in Chapter XIV. 

Let k be a field and let G be a group. Let Mod*(G) be the category of finite- 
dimensional vector spaces V over k , with an operation of G on V, i.e. a homo- 
morphism G —» Aut*(V). A homomorphism (morphism) in that category is a k- 
homomorphism f: V W such that f(ax) = af(x) for all x e V and AEG.lt 
is immediate that Mod*(G) is an abelian category. This category will be studied 
especially in Chapter XVIII. 

In Chapter XX, §1 we shall consider the category of complexes of modules 
over a ring. This category of complexes is an abelian category. 

In topology and differential geometry, the category of vector bundles over 
a topological space is an abelian category. 

Sheaves of abelian groups over a topological space form an abelian category, 
which will be defined in Chapter XX, §6. 




III, §4 


FREE MODULES 135 


§4. FREE MODULES 

Let M be a module over a ring A and let S be a subset of M. We shall say that 
5 is a basis of M if S is not empty, if S generates M, and if S is linearly independent. 
If S is a basis of M, then in particular M # {0} if A # {0} and every element of 
M has a unique expression as a linear combination of elements of S. Similarly, 
let {x,} le/ be a non-empty family of elements of M. We say that it is a basis of 
M if it is linearly independent and generates M. 

If A is a ring, then as a module over itself, A admits a basis, consisting of the 
unit element 1. 

Let / be a non-empty set, and for each i ê I, let A { — A, viewed as an A- 
module. Let 

F=®A,. 

Ì€l 

Then F admits a basis, which consists of the elements e { of F whose i-th com- 
ponent is the unit element of A x , and having all other components equal to 0. 

By afree module we shall mean a module which admits a basis, or the zero 
module. 

Theorem 4.1. Let Abe a ring and M a module over A. Let I be a non-empty 
set, and let {xJ IG/ be a basis of M. Let N be an A-module, and let {yi} ieI 
be a family of elements of N. Then there exists a unique homomorphism 
f:M-*N such that /(x,) = y x for all i. 

Proof. Let x be an element of M. There exists a unique family {a f } ie/ of 
elements of A such that 

X = la iXi . 
i € I 


We define 

f(x) = 'Z a t yt. 

It is then clear that /is a homomorphism satisfying our requirements, and 
that it is the unique such, because we must have 

/(x) = Y a i f ( x i)- 

Corollary 4.2. Let the notation be as in the theorem, and assume that {yi} ieI 
is a basis of N. Then the homomorphism f is an isomorphism, i.e. a module- 
isomorphism. 

Proof. By symmetry, there exists a unique homomorphism 


g : N -> M 
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such that g(yi ) = x f for all i, and f°g and g °f are the respective identity map- 
pings. 

Corollary 4.3. Two modules having bases whose cardinalities are equaì are 
isomorphic. 

Proof Clear. 

We shall leave the proofs of the following statements as exercises. 

Let M be a free module over A , with basis {x,} ie/ , so that 

M = @ Axì. 

i e / 

Let û be a two sided ideal of A. Then aM is a submodule of M. Each ox/ is a 
submodule of Ax/. /zave an isomorphism (of A-modules) 


M/aM « @ Axjaxi. 

i e I 


Furthermore, each /txi/ax/ is isomorphic to A/a, as /1-module. 

Suppose in addition that A is commutative. Then A/a is a ring. Furthermore 
M/aM is afree module over A/a , and each AxJaXi is free over A/a. Ifx t is the 
image of x t under the canonical homomorphism 


Axì -► AxJaXi , 

then the single element x, is a basis of Axjax^ over A/a. 

All of these statements should be easily verified by the reader. Now let A be 
an arbitrary commutative ring. A module M is called principal if there exists 
an element x e M such that M = Ax. The map 

a i-> ax (for a e A) 

is an A-module homomorphism of A onto M, whose kernel is a left ideal a, and 
inducing an isomorphism of A-modules 

A/a ® M. 

Let M be a finitely generated module, with generators {v x ,. . . , v n }. Let F 
be a free module with basis {e Xì . . ., e n }. Then there is a unique surjective 
homomorphism/: F M such that/(e z ) = v t . The kernel of/is a submodule 
Mj. Under certain conditions, M x is finitely generated ( cf. Chapter X, §1 on 
Noetherian rings), and the process can be continued. The systematic study of 
this process will be carried out in the chapters on resolutions of modules and 
homology. 
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Of course, even if M is not fìnitely generated, one can carry out a similar 
construction, by using an arbitrary indexing set. Indeed, let {v t } (/ e /) be a family 
of generators. For each /, let F t be free with basis consisting of a single element 
e i9 so Fi ~ A. Let F be the direct sum of the modules F t (i e /), as in Proposi- 
tion 3.1. Then we obtain a surjective homomorphism/: F A/ such that 
/(£/) = v t . Thus every module is a factor module of a free module. 

Just as we did for abelian groups in Chapter I, §7, we can also defìne the 
free module over a ring A generated by a non-empty set S. We let A(S) be the 
set of functions <p : S —> A such that <p(x) = 0 for almost all x e S. If a e A and 

x e 5, we denote by ax the map cp such that <p(x) = a and <p(y) = 0 for y =£ x. 
Then as for abelian groups, given <p e A(S) there exist elements a t e A and 

xi e S such that 

<p = a x x x + • • • + a n x n . 

It is immediately verified that the family of functions {&J (x e S) such that 
8*(jc) = 1 and â/y) = 0 fory x form a basis for A(S). In other words, the ex- 
pression of <p as 2 tf/*, above is unique. This construction can be applied 
when S is a group or a monoid G, and gives rise to the group algebra as in 
Chapter II, §5. 


Projective modules 

There exists another important type of module closely related to free modules, 
which we now discuss. 

Let A be a ring and P a module. The following properties are equivalent, 
and define what it means for P to be a projective module. 

P 1. Given a homomorphism /: P -► M" and surjective homomorphism 
g : M M", there exists a homomorphism h : P -+ M making the 
following diagram commutative. 



P 2. Every exact sequence 0 -+ M' M" -* P -+ 0 splits. 

P 3. There exists a module M such that P © M is free, or in words, P is a 
direct summand of a free module. 

P 4. The functor M i—► Hom^P, M) is exact. 

We prove the equivalence of the four conditions. 
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Assume P 1. Given the exact sequence of P 2, we consider the map / = id 
in the diagram 


P 

id 

M"- >P ->0 

Then h gives the desired splitting of the sequence. 

Assume P 2. Then represent P as a quotient of a free module (cf. Exercise 1) 
F -► P -► 0, and apply P 2 to this sequence to get the desired splitting, which 
represents F as a direct sum of P and some module. 

Assume P3. Since Horn^X © T, M) = Hom A (X, M) © Horn^Y, M), 
and since M i-> Hom^F, M) is an exact functor if F is free, it follows that 
Hom^(P, M) is exact when P is a direct summand of a free module, which proves 
P4. 

Assume P 4. The proof of P 1 will be left as an exercise. 

Examples. It will be proved in the next section that a vector space over a 
field is always free, i.e. has a basis. Under certain circumstances, it is a theorem 
that projective modules are free. In §7 we shall prove that a finitely generated 
projective module over a principal ring is free. In Chapter X, Theorem 4.4 we 
shall prove that such a module over a local ring is free; in Chapter XVI, Theo- 
rem 3.8 we shall prove that a finite flat module over a local ring is free; and in 
Chapter XXI, Theorem 3.7, we shall prove the Quillen-Suslin theorem that 
if A = k[X Ì9 . . . , X n ] is the polynomial ring over a field k 9 then every finite pro- 
jective module over A is free. 

Projective modules give rise to the Grothendieck group. Let A be a ring. 
Isomorphism classes of finite projective modules form a monoid. Indeed, if P 
is finite projective, let [P] denote its isomorphism class. We define 

[P] + [Q] = [P®QY 

This sum is independent of the choice of representatives P 9 Q in their class. The 
conditions defining a monoid are immediately verified. The corresponding Groth- 
endieck group is denoted by K(A). 

We can impose a further equivalence relation that P is equivalent to P' if 
there exist finite free modules F and F' such that P © F is isomorphic to 
P' © F'. Under this equivalence relation we obtain another group denoted by 
K 0 (A). If A is a Dedekind ring (Chapter II, §1 and Exercises 13-19) it can be 
shown that this group is isomorphic in a natural way with the group of ideal 
classes Pic(A) (defined in Chapter II, §1). See Exercises 11, 12, 13. It is also a 




III, §5 


VECTOR SPACES 139 


problem to determine AT 0 (A) for as many rings as possible, as explicitly as pos- 
sible. Algebraic number theory is concemed with Af 0 (A) when A is the ring of 
algebraic integers of a number field. The Quillen-Suslin theorem shows if A is 
the polynomial ring as above, then Â^A) is trivial. 

Of course one can carry out a similar constmction with all finite modules. 
Let [M] denote the isomorphism class of a finite module M. We define the sum 
to be the direct sum. Then the isomorphism classes of modules over the ring 
form a monoid, and we can associate to this monoid its Grothendieck group. 
This construction is applied especially when the ring is commutative. There are 
many variations on this theme. See for instance the book by Bass, Algebraic 
K-theory , Benjamin, 1968. 

There is a variation of the definition of Grothendieck group as follows. Let 
F be the free abelian group generated by isomorphism classes of finite modules 
over a ring R , or of modules of bounded cardinality so that we deal with sets. 
In this free abelian group we let T be the subgroup generated by all elements 

[M] - [M'] - [M"] 

for which there exists an exact sequence 0 —> M' —> M —> M" —> 0. The factor 
group F/T is called the Grothendieck group K(R). We shall meet this group 
again in §8, andinChapterXX, §3. Note that we may form a similarGrothendieck 
group with any family of modules such that M is in the family if and only if M' 
and M" are in the family. Taking for the family finite projective modules, one 
sees easily that the two possible definitions of the Grothendieck group coincide 
in that case. 


§5. VECTOR SPACES 

A module over a field is called a vector space. 

TheoremS.l. Let V be a vector space over a field K , and assume that 
V # {0}. Let T be a set of generators of V over K and let Sbe a subset ofT 
which is linearly independent. Then there exists a basis (R of V such that 

s c ffl c r. 

Proof. Let X be the set whose elements are subsets T of T which contain S 
and are linearly independent. Then ï is not empty (it contains 5), and we 
contend that X is inductively ordered. Indeed, if {7^} is a totally ordered subset 
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of Z (by ascending inclusion), then (J 7] is again linearly independent and con- 
tains S. By Zorn’s lemma, let ® be a maximal element of X. Then (B is linearly 
independent. Let W be the subspace of V generated by ®. If W # V, there 
exists some element xeT such that x $ W. Then ® u {x} is linearly inde- 
pendent, for given a linear combination 

X a y y + bx = 0, a y9 be K, 

yei b 

we must have b = 0, otherwise we get 

x = - ^b~ l a y yeW. 

ye ® 

By construction, we now see that a y = 0 for all ye®, thereby proving that 
® u {x} is linearly independent, and contradicting the maximality of ®. It 
follows that W = V, and furthermore that ® is not empty since V + {0}. This 
proves our theorem. 

If V is a vector space # {0}, then in particular, we see that every set of 
linearly independent elements of V can be extended to a basis, and that a basis 
may be selected from a given set of generators. 

Theorem 5.2. Let V be a vector space over a field K. Then two bases of V 
over K have the same cardinality. 

Proof. Let us first assume that there exists a basis of V with a finite 
number of elements, say {v ì9 ..., v m }, m ^ 1. We shall prove that any other 
basis must also have m elements. For this it will suffice to prove: If w l5 ..., w n 
are elements of V which are linearly independent over K, then n ^ m (for 
we can then use symmetry). We proceed by induction. There exist elements 
Cj,..., c m of K such that 

(1) VV, = C(U j + ••• + c m v m , 

and some c i9 say c ì9 is not equal to 0. Then v t lies in the space generated 
by w l5 v 2 , . •., v m over K, and this space must therefore be equal to V itself. 
Furthermore, w í9 v 2 ,-.. 9 v m are linearly independent, for suppose b ì9 ... ,b m 
are elements of K such that 


bjWj + b 2 v 2 + ••• + b m v m = 0. 

If b x #0, divide by b x and express Wj as a linear combination of v l9 ..., v m . 
Subtracting from (1) would yield a relation of linear dependence among the 
v t , which is impossible. Hence b x = 0, and again we must have all b t = 0 
because the v ( are linearly independent. 
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Suppose inductively that after a suitable renumbering of the v i9 we have 
found w í9 ... ,w r (r < n ) such that 

(Wi, • • •, w r , v r + 1 , •.., v m } 

is a basis of V. We express w r+ì as a linear combination 

(2) w r+1 = £\Wi -f ••• + c r w r + c r+1 i; r+1 + ... + c m v m 

with Ci e K. The coeíïìcients of the v t in this relation cannot all be 0; otherwise 
there would be a linear dependence among the Wj. Say c r+1 # 0. Using an 
argument similar to that used above, we can replace r r+ iby w r+1 and still have 
a basis of V. This means that we can repeat the procedure until r = n, and 
therefore that n ^ m, thereby proving our theorem. 

We shall leave the general case of an infinite basis as an exercise to the 
reader. [Hint : Use the fact that a finite number of elements in one basis is 
contained in the space generated by a finite number of elements in another basis.] 

If a vector space V admits one basis with a finite number of elements, say m , 
then we shall say that V is finite dimensional and that m is its dimension. In 
view of Theorem 5.2, we see that m is the number of elements in any basis 
of V. If V = {0}, then we define its dimension to be 0, and say that V is 
0-dimensional. We abbreviate “dimension” by “dim” or “dim x ” if the 
reference to K is needed for clarity. 

When dealing with vector spaces over a field, we use the words subspace 
and factor space instead of submodule and factor module. 

Theorem 5.3. Let V bea vector space over a field K , and let W be a subspace. 

Then 


dim* V = dim* W + dim K V/W. 

If f: V -► U is a homomorphism of vector spaces over K , then 
dim V = dim Ker / + dim Im/ 

Proof. The first statement is a special case of the second, taking for / the 
canonical map. Let {Ui} ieI be a basis of Im /, and let {Wj} jeJ be a basis of 
Ker /. Let {^} ie / be a family of elements of V such that /( v ( ) = u ( for each 
i e I. We contend that 


{Vi,Wj}u, Je j 


is a basis for V. This will obviously prove our assertion. 
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Let x be an element of V. Then there exist elements {ai} ieI of K almost 
all of which are 0 such that 

/(*) = !«!«<• 
ie / 

Hence/(x - Y a,”,) = f(x) - £ a,/(i>,) = 0. Thus 

X - 


is in the kernel of/, and there exist elements {bj} j€j of K almost all of which are 
0 such that 

X - Z a > v i = Z b i w i- 

From this we see that x = Y, a > v > + Y b j w j> ant ^ that {íi,, vv/ generates V. 
It remains to be shown that the family {v i9 Wj} is linearly independent. Suppose 
that there exist elements c i9 dj such that 

0 = 1 CiVi + Y d i w j■ 

Applying / yields 

0 = 1 Cif(vi) = Y CiU h 

whence all = 0. From this we conclude at once that all dj = 0, and hence that 
our family {v i9 Wj} is a basis for V over K , as was to be shown. 

Corollary 5.4. Let V be a vector space and W a subspace. Then 

dim W ^ dim V. 

If V is finite dimensional and dim W = dim V then W = V. 

Proof. Clear. 


§6. THE DUAL SPACE AND DUAL MODULE 

Let E be a free module over a commutative ring A. We view A as a free 
module of rank 1 over itself. By the dual module E y of E we shall mean the 
module HomíF, A). Its elements will be called functionals. Thus a functional 
on E is an A-linear map f:E-*A. If x e E and/ e £ v , we sometimes denote 
f(x) by (x,f). Keeping x fixed, we see that the symbol (x 9 f) as a function of 
/e £ v is A-linear in its second argument, and hence that x induces a linear map 
on £ v , which is 0 if and only if x = 0. Hence we get an injection E —> £ vv 
which is not always a surjection. 
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Let {jc z } /e/ be a basis of E. For each i e / let/• be the unique functional such 
that = Sij (in other words, 1 if i = j and 0 if i =£ j). Such a linear map 
exists by general properties of bases (Theorem 4.1). 

Theorem 6.1. Let E be a finite free module over the commutative ring A , 
offinite dimension n. Then £ v is also free , and dim £ v = n. If {x ìt . .., x„f 
is a basis for E , and f is the functional such that f(xf) = 8^, then 
/j a basis for £ v . 

Proof. Let/e £ v and let a z = /(x z ) (/ = 1,..., n). We have 

f(c\x i + ' ' ' + = cj(x x ) + * * • + c n f(x n ). 

Hence/ = a x f x + • • • + a n f n , and we see that the f generate E y . Furthermore, 
they are linearly independent, for if 

bxfx + ' • • + bj n = 0 

with bi e AT, then evaluating the left-hand side on x z yields 

bif(Xi) = 0, 

whence b t = 0 for all i. This proves our theorem. 

Given a basis {jc z } (i = 1,..., n) as in the theorem, we call the basis {/•} 
the dual basis. In terms of these bases, we can express an element A of E with 
coordinates (a ìt . . ., a n ), and an element B of E v with coordinates (b ì9 .. . , b n ), 
such that 


A = a^Xi + • • • + a n x„, B = b ,/, + •••+ £>„/„■ 

Then in terms of these coordinates, we see that 

(A, B) = a x b x + • • • + a n b n = A • B 
is the usual dot product of n-tuples. 

Corollary 6.2. When E is free finite dimensional, then the map E —> £ vv 
which to each x e V associates the functional /h-> (jc,/) on £ v ij an isomorphism 
of E onto E vy . 

Proof. Note that since {f lt , f n ) is a basis for E y , it follows from the 
definitions that {x x ,. . . , Jt„} is the dual basis in E, so E = £ vv . 

Theorem 6.3. Let U t V t W be finite free modules over the commutative ring 
A t and let 

0 -*■ W-^ V -4 u^> 0 

be an exact sequence of A-homomorphisms. Then the induced sequence 
0 -> Hom \ A (U, A) -> Hom^CV, A) -* Hom^ílV, A) —> 0 



144 MODULES 


III. §6 


i.e. 


0 


f/v —> V v -> W v -> 0 


is also exact. 

Proof. This is a consequence of P2, because a free module is projective. 


We now consider properties which have specifically to do with vector spaces, 
because we are going to take factor spaces. So we assume that we deal with 
vector spaces over a field K. 

Let V, V' be two vector spaces, and suppose given a mapping 

V x V' -* K 


denoted by 

(x, x') i—► <x, x') 

for xe V and x' e V'. We call the mapping bilinear if for each x e V the function 
x' I— ► <x, x') is linear, and similarly for each x' e V' the function x i—► <x, x') is 
linear. An element x e V is said to be orthogonal (or perpendicular) to a subset 
S' of V' if <x, x') = 0 for all x'eS'. We make a similar definition in the 
opposite direction. It is clear that the set of x e V orthogonal to S' is a sub- 
space of V. 

We define the kernel of the bilinear map on the left to be the subspace of V 
which is orthogonal to V', and similarly for the kernel on the right. 

Given a bilinear map as above, 

V x V' -► K, 

let W' be its kernel on the right and let W be its kernel on the left. Let x' be 
an element of V'. Then x' gives rise to a functional on V, by the rule x i—► <x, x'), 
and this functional obviously depends only on the coset of x' modulo W'\ in 
other words, if x^ = x' 2 (mod W'\ then the functionals xi—► <x, x^) and 
x <x, x' 2 > are equal. Hence we get a homomorphism 

V' -> V v 

whose kernel is precisely W' by definition, whence an injective homomorphism 

0-* V'/W' V v . 

Since all the functionals arising from elements of V' vanish on W 9 we can view 
them as functionals on V/W , i.e. as elements of (V/W) v . So we actually get an 
injective homomorphism 

0 V'/W' -» (V/W) v . 

One could give a name to the homomorphism 


9 : V' -* V v 
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such that 


<x, x') = <x, g(x')} 

for all x g V and x' e V'. However, it will usually be possible to describe it by an 
arrow and call it the induced map, or the natural map. Giving a name to it 
would tend to make the terminology heavier than necessary. 

Theorem 6.4. Let V x V' —» K be a bilinear map, let W, W' be its kernels 
on the left and right respectively, and assume that V'/W' isfinite dimensional. 
Then the induced homomorphism V'/W' —> (V/Wf is an isomorphism. 

Proof. By symmetry, we have an induced homomorphism 

V/W (V'/W') v 

which is injective. Since 

dim (V'/W'f = dim V'/W' 


it follows that V/W is finite dimensional. From the above injective homomor- 
phism and the other, namely 

0 V'/W' (V/W) y , 


we get the inequalities 

dim V/W ^ dim V'/W' 


and 


dim V'/W' ^ dim V/W, 

whence an equality of dimensions. Hence our homomorphisms are surjective 
and inverse to each other, thereby proving the theorem. 

Remark 1. Theorem 6.4 is the analogue for vector spaces of the duality 
Theorem 9.2 of Chapter I. 

Remark 2. Let A be a commutative ring and let E be an A-module. Then 
we may form two types of dual: 

E A = Hom(£, Q/Z), viewing E as an abelian group; 

E y = Hom^Zs, A), viewing E as an A-module. 

Both are called dual, and they usually are applied in different contexts. For 
instance, E y will be considered in Chapter XIII, while E A will be considered in 
the theory of injective modules, Chapter XX, §4. For an example of dual module 
E y see Exercise 11. If by any chance the two duals arise together and there is 
need to distinguish between them, then we may call E A the Pontrjagin dual. 
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Indeed, in the theory of topological groups G, the group of continuous homo- 
morphisms of G into R/Z is the classical Pontrjagin dual, and is classically 
denoted by G A , so I find the preservation of that terminology appropriate. 

Instead of R/Z one may take other natural groups isomorphic to R/Z. The 
most common such group is the group of complex numbers of absolute value 1, 
which we denote by S 1 . The isomorphism with R/Z is given by the map 

x ^ e 2nix . 

Remark 3. A bilinear map V x V-+ K for which V' = V is called a bilinear 
form. We say that the form is non-singular if the corresponding maps 

V' —> V v and V-*(V’Ỳ 

are isomorphisms. Bilinear maps and bilinear forms will be studied at greater 
length in Chapter XV. See also Exercise 33 of Chapter XIII for a nice example. 


§7. MODULES OVER PRINCIPAL RINGS 

Throughout this section , we assume that R is a principal entire ring. All modules 
are over R, and homomorphisms are R-homomorphisms, unless otherwise specified. 

The theorems will generalize those proved in Chapter I for abelian groups. 
We shall also point out how the proofs of Chapter I can be adjusted with sub- 
stitutions of terminology so as to yield proofs in the present case. 

Let F be a free module over R, with a basis {x,} le/ . Then the cardinality of 
/ is uniquely determined, and is called the dimension of F. We recall that this 
is proved, say by taking a prime element p in R, and observing that F/pF is a 
vector space over the field R/pR, whose dimension is precisely the cardinality 
of /. We may therefore speak of the dimension of a free module over R. 

Theorem 7.1. Let F be a free module, and M a submodule. Then M is free, 

and its dimension is less than or equal to the dimension of F. 

Proof. For simplicity, we give the proof when F has a finite basis {xj, 
i = 1 , , n. Let M r be the intersection of M with (x^,..., x r ), the module 
generated by x í9 ..., x r . Then M x = M n (xj) is a submodule of (xj, and is 
therefore of type (a^x^) with some a x e R. Hence M { is either 0 or free, of di- 
mension 1. Assume inductively that M r is free of dimension ^ r. Let a be 
the set consisting of all elements ae R such that there exists an element xeM 
which can be written 


x = ftjXi -I- • • • -f b r x r + ax r+í 
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with bi e R. Then û is obviously an ideal, and is principal, generated say by an 
element a r + j. If a r+1 =0, then M r+Í = M r and we are done with the inductive 
step. If a r+ì ^ 0, let w e M r+ t be such that the coefficient of vv with respect 
to x r+1 is a r+í . If xeM r+ì then the coefficient of x with respect to x r+1 is 
divisible by a r+l , and hence there exists ceR such that x — cvv lies in M r . 
Hence 

M r+Ì = M r + (vv). 

On the other hand, it is clear that M r n (vv) is 0, and hence that this sum is direct, 
thereby proving our theorem. (For the infinite case, see Appendix 2, §2.) 

Corollary 7.2. Let E be a finiteìy generated module and E' a submodule. 

Then E' is finitely generated. 

Proof. We can represent E as a factor module of a free module F with a 
finite number of generators: If v ì9 ..., v n are generators of E, we take a free 
module F with basis {x^,..., x„} and map x, on v t . The inverse image of E' in F 
is a submodule, which is free, and finitely generated, by the theorem. Hence 
E' is finitely generated. The assertion also follows using simple properties of 
Noetherian rings and modules. 

If one wants to translate the proofs of Chapter I, then one makes the 
following definitions. A free 1-dimensional module over R is called infìnite 
cyciic. An infinite cyclic module is isomorphic to R, viewed as module over 
itself. Thus every non-zero submodule of an infinite cyclic module is infinite 
cyclic. The proof given in Chapter I for the analogue of Theorem 7.1 applies 
without further change. 

Let E be a module. We say that E is a torsion module if given x e E, there 
exists a e R, a # 0, such that ax = 0. The generalization of finite abelian group 
is fìnitely generated torsion module. An element x of E is called a torsion element 
if there exists ae R, a ^ 0, such that ax = 0. 

Let E be a module. We denote by £ tor the submodule consisting of all torsion 
elements of E, and call it the torsion submodule of E. If £ tor = 0, we say that 
E is torsion free. 

Theorem 7.3. Let E be finitely generated. Then E/E ior is free. There exists 

afree submodule F of E such that E is a direct sum 

E = E ior 0 F. 

The dimension of such a submodule F is uniquely determined. 

Proof. We first prove that E/E ior is torsion free. If x e E, let x denote its 
residue class mod £ tor . Let b e R, b + 0 be such that bx = 0. Then bx e £ tor , 
and hence there exists c e. R, c ± 0, such that cbx = 0. Hence x e £ tor and 
x = 0, thereby proving that E/E ior is torsion free. It is also finitely generated. 
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Assume now that M is a torsion free module which is finitely generated. Let 
{v x ,.. ., v n } be a maximal set of elements of M among a given finite set of 
generators {y l9 . . . , y m } such that {v u ..., v n } is linearly independent. If y is 
one of the generators, there exist elements a, b Ì9 ... , b n e R not all 0, such that 

ay b^Vi -+■ • • • + b n v n = 0. 

Then a ^ 0 (otherwise we contradict the linear independence of v ì9 ..., t; n ). 
Hence «y lies in (uj,..., v n ). Thus for each j = 1, ..., m we can find a } e R, 
a } # 0, such that a^yj lies in (t^,..., u M ). Let a = a x • • • a m be the product. Then 
aM is contained in (v ì9 ..., t?„), and « # 0. The map 

xi— >ax 

is an injective homomorphism, whose image is contained in a free module. 
This image is isomorphic to M, and we conclude from Theorem 7.1 that M is 
free, as desired. 

To get the submodule F we need a lemma. 

Lemma 7.4. Let E, E' be modules , and assume that E' is free. Let f.E^E' 

be a surjective homomorphism. Then there exists a free submodule F ofE such 

that the restriction offtoF induces an isomorphism of F with E', and such that 

E = F ® Ker / 

Proof. Let {x;} l€/ be a basis of E'. For each i, let x t be an element of E such 
that f(x t ) = x\. Let F be the submodule of E generated by all the elements x i9 
i e I. Then one sees at once that the family of elements {Xi} íe/ is linearly inde- 
pendent, and therefore that F is free. Given xeE, there exist elements a t e R 
such that 

f(x) = Z a i x ’i- 

Then x — £ ûjX, lies in the kernel of/ and therefore E = Ker f + F. It is clear 
that Ker/n F = 0, and hence that the sum is direct, thereby proving the lemma. 

We apply the lemma to the homomorphism E —» E/E tor in Theorem 7.3 to 
get our decomposition E = E ior © F. The dimension of F is uniquely determined, 
because F is isomorphic to E/E tor for any decomposition of E into a direct sum 
as stated in the theorem. 

The dimension of the free module F in Theorem 7.3 is called the rank of E. 

In order to get the structure theorem for finitely generated modules over R, 
one can proceed exactly as for abelian groups. We shall describe the dictionary 
which allows us to transport the proofs essentially without change. 

Let E be a module over R. Let xeE. The map a ax is a homomorphism 
of R onto the submodule generated by x, and the kernel is an ideal, which is 
principal, generated by an element me R. We say that m is a period of x. We 
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note that m is determined up to multiplication by a unit (if m # 0). An element 
c e R, c # 0, is said to be an exponent for E (resp. for x) if cE = 0 (resp. cx = 0). 

Let p be a prime element. We denote by E(p) the submodule of E consisting 
ofallelementsxhavinganexponent which is a power p r (r ^ 1). Ap-submodule 
of E is a submodule contained in E(p). 

We select once and for all a system of representatives for the prime elements 
of R (modulo units). For instance, if R is a polynomial ring in one variable over 
a field, we take as representatives the irreducible polynomials with leading 
coefficient 1. 

Let m e R, m # 0. We denote by E m the kernel of the map x mx. It consists 

of all elements of E having exponent m. 

A module E is said to be cyclic if it is isomorphic to R/(a) for some element 
a e R. Without loss of generality if a # 0, one may assume that a is a product of 
primes in our system of representatives, and then we could say that a is the order 
of the module. 

Let r i,..., r s be integers ^ 1. A /7-module E is said to be of type 

(p ri ,...,P rs ) 

if it is isomorphic to the product of cyclic modules R/(p n ) (i = 1,..., s). If p 
is fixed, then one could say that the module is of type (r í9 ..., r s ) (relative to p). 

All the proofs of Chapter I, §8 now go over without change. Whenever we 
argue on the size of a positive integer m , we have a similar argument on the 
number of prime factors appearing in its prime factorization. If we deal with a 
prime power p r 9 we can view the order as being determined by r. The reader 
can now check that the proofs of Chapter I, §8 are applicable. 

However, we shall develop the theory once again without assuming any 
knowledge of Chapter I, §8. Thus our treatment is self-contained. 

Theorem 7.5. Let Ebe a finitely generated torsion module ^ 0. Then E is 

the direct sum 


E = ® E(p), 

P 

taken over all primes p such that E(p) # 0. Each E(p) can be written as a direct 
sum 


E(p) = R/(p Ví )®-'® R/(p Vs ) 

with The sequence v u ..., v s is uniquely determined. 

Proof. Letabeanexponentfor£,andsupposethata = bcwith(b, c) = (1). 
Let x, y e R be such that 


1 = xb + yc. 
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We contend that E = E b © E c . Our first assertion then follows by induction, 
expressing a as a product of prime powers. Let v e E. Then 

v = xbv + ycv. 

Then xbv e E c because cxbv = xav = 0. Similarly, ycv e E b . Finally E b r\E c = 0, 
as one sees immediately. Hence E is the direct sum of E b and E c . 

We must now prove that E(p) is a direct sum as stated. If y í9 ..., y m are 
elements of a module, we shall say that they are independent if whenever we have 
a relation 

«ìJ'i + • • • + a m y m = 0 

with a t e R, then we must have = 0 for all i. (Observe that independent 
does not mean linearly independent.) We see at once that y u ..., y m are inde- 
pendent if and only if the module (y l5 ..., y m ) has the direct sum decomposition 

(yi. • • •. y m ) = (yi) 0 • • • © O'J 

in terms of the cyclic modules (y,), i = 1,..., m. 

We now have an analogue of Lemma 7.4 for modules having a prime power 
exponent. 

Lemma 7.6. Let Ebea torsion module of exponent p r (r ^ 1) for some prime 
element p. Let x t eE be an eìement of period p r . Let E = F/(xj). Let 
ŷŷ m be independent elements of E. Then for each i there exists a repre - 
sentative y t e E of ŷ i9 such that the period of y t is the same as the period of J+ 
The elements x ì9 y Ì9 ..., y m are independent. 

Proof. Let ŷ e E have period p n for some n ^ 1. Let y be a representative of 
ŷ in E. Then p n y e (xj, and hence 

p n y = p s cx í9 ce R 9 pf c, 

for some s ^ r. If s = r, we see that y has the same period as ŷ. If s < r, then 
p s cx j has period p r ~ s , and hence y has period p n+r ~ s . We must have 

n + r — s ^ r, 

because p r is an exponent for E. Thus we obtain n ^ s, and we see that 

y - p s ~ n cx i 

is a representative for ŷ, whose period is p n . 

Let y t be a representative for ŷ, having the same period. We prove that 
*i> • • • > y m are independent. Suppose that a, a í9 ..., a m e R are elements such 

that 


+ 01^1 + *•* + <*mym = 0 . 
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Then 

a iy\ + • • • + a m = 0. 

By hypothesis, we must have = 0 for each i. If p r * is the period of ŷ h then 
p n divides a x . We then conclude that a,y, = 0 for each i, and hence finally 
that ax { =0, thereby proving the desired independence. 

To get the direct sum decomposition of E(p ), we first note that E(p) is 
finitely generated. We may assume without loss of generality that E = E(p). 
Let Xj be an element of E whose period p r ' is such that r { is maximal. Let 
E = £/(*!). We contend that dim E p as vector space over R/pR is strictly less 
than dim E p . Indeed, if ŷ u ..., ŷ m are linearly independent elements of E p 
over R/pR, then Lemma 7.6 implies that dim E p ^ m + 1 because we can always 
find an element of (x^) having period p, independent of y í9 ..., y m . Hence 
dim E p < dim E p . We can prove the direct sum decomposition by induction. 
If E # 0, there exist elements x 2 ,..., x s having periods p r2 ,..., p r * respectively, 
such that r 2 = • • * ^ r s . By Lemma 7.6, there exist representatives x 2 , . . . , x r 
in E such that x, has period p n and x u ..., x r are independent. Since p ri is such 
that r { is maximal, we have r x ^ r 2 , and our decomposition is achieved. 

The uniqueness will be a consequence of a more general uniqueness theorem, 
which we state next. 

Theorem7.7. Let E be a finitely generated torsion module , E # 0. Then 

E is isomorphic to a direct sum of non-zero factors 

R/(qi) 0 * * ■ 0 R/(q r \ 

where q u ..., q r are non-zero elements of R , and q { \q 2 | • • • \q r . The sequence 

of ideals (q J,..., (q r ) is uniquely determined by the above conditions. 

Proof. U sing Theorem 7.5, decompose E into a direct sum of p-submodules, 
say £(Pi) © • • • © E(p t ), and then decompose each £(p,) into a direct sum of 
cyclic submodules of periods p nj . We visualize these symbolically as described 
by the following diagram: 

E(p i) • r 11 = r 12 = ''' 

E(Pl) : r 21 = r 22 = 

E(Pl) : r ll Ú r l2 ú'" 

A horizontal row describes the type of the module with respect to the prime at 
the left. The exponents r^ are arranged in increasing order for each fixed 
i = 1, ..., /. We let q l9 ..., q r correspond to the columns of the matrix of 
exponents, in other words 

qi = pTpT'"P?\ 
qi = pTp? 2 '' • pT , 
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The direct sum of the cyclic modules represented by the first column is then 
isomorphic to R/(q j), because, as with abelian groups, the direct sum of cyclic 
modules whose periods are relatively prime is also cyclic. We have a similar 
remark for each column, and we observe that our proof actually orders the 
by increasing divisibility, as was to be shown. 

Now for uniqueness. Let p be any prime, and suppose that E = R/(pb ) for 
some b e R, b =£ 0. Then E p is the submodule bR/(pb\ as follows at once from 
unique factorization in R. But the kernel of the composite map 

R^bR^ bR/(pb) 

is precisely (p). Thus we have an isomorphism 

R/(p) » bR/(pb). 

Let now E be expressed as in the theorem, as a direct sum of r terms. An 
element 


i? = f?i©---©t; r , v t e R/(q/) 

is in E p if and only if pv t = 0 for all i. Hence E p is the direct sum of the kernel of 
multiplication by p in each term. But E p is a vector space over R/(p), and its 
dimension is therefore equal to the number of terms R/^q/} such that p divides q t . 

Suppose that p is a prime dividing q ì9 and hence q t for each i = 1,..., r. Let 
E have a direct sum decomposition into d terms satisfying the conditions of the 
theorem, say 


E = R/(q\) © * ■ * © R/(q')> 

Then p must divide at least r of the elements q'-, whence r ^ s. By symmetry, 
r = s, and p divides q'j for all j. 

Consider the module pE. By a preceding remark, if we write q x = pb { , then 
pE » R/(b,) © • • • © R/(b r \ 

and b { | • • • \b r . Some of the b t may be units, but those which are not units 
determine their principal ideal uniquely, by induction. Hence if 

(M = ••• = (&,)= i 

but (bj+i) # (1), then the sequence of ideals 


...,(b r ) 
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is uniquely determined. This proves our uniqueness statement, and concludes 
the proof of Theorem 7.7. 

The ideals ( q x ),. . ., ( q r ) are called the invariants of E. 

For one of the main applications of Theorem 7.7 to linear algebra, see Chapter 
XV, §2. 

The next theorem is included for completeness. It is called the elementary 
divisors theorem. 

Theorem 7.8. Let F bea free module over R, and letM be afinitely generated 

submodule 4= 0. Then there exists a basis (B of F, elements e ìf , e m in this 

basis, and non-zero elements a m e R such that: 

(i) The elements a x e x ,..., a m e m forrn a basis of M over R. 

(ii) We have ai\a i+x for i = 1,..., m — 1. 

The sequence of ideals (<*!), ..., (a m ) is uniquely determined by the preceding 

conditions. 

Proof. Write a finite set of generators for M as linear combination of a finite 
number of elements in a basis for F. These elements generate a free submodule 
of finite rank, and thus it suffices to prove the theorem when F has finite rank, 
which we now assume. We let n = rank(F). 

The uniqueness is a corollary of Theorem 7.7. Suppose we have a basis as 
in the theorem. Say a x ,. .., a s are units, and so can be taken to be = 1, and 
a s +j = qj with q x \q 2 1 • • • | q r non-units. Observe that F/M = F is a finitely 
generated module over R , having the direct sum expression 

r 

F/M = F ~ (J) ( R/qjR)êj © free module of rank n — (r + s) 

j= i 

where a bar denotes the class of an element of_F mod M. Thus the direct sum 
over j = I,. .. , r is the torsion submodule of F , whence the elements q x ,..., 
q r are uniquely determined by Theorem 7.7. We have r + s = m, so the rank 
of F/M is n — m, which determines m uniquely. Then s = m - r is uniquely 
determined as the number of units among a x ,... ,a m . This proves the uniqueness 
part of the theorem. Next we prove existence. 

Let À be a functional on F, in other words, an element of Hom^(F, R). We 
let / A = À(M). Then J x is an ideal of R. Select À^ such that Àj(M) is maximal 
in the set of ideals {/ A }, that is to say, there is no properly larger ideal in the 
set {y A }. 

Let Àj(M) = (a x ). Then a x # 0, because there exists a non-zero element of 
M, and expressing this element in terms of some basis for F over R, with some 
non-zero coordinate, we take the projection on this coordinate to get a func- 
tional whose value on M is not 0. Let x x e M be such that À^Xj) = a x . For 
any functional g we must have g(x x ) e(a x ) [immediate from the maximality of 
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Ai(A/)]. Writing x x in terms of any basis of F, we see that its coefficients must 
all be divisible by a x . (If some coefficient is not divisible by a u project on this 
coefficient to get an impossible functional.) Therefore we can write x x = a x e x 
with some element e x eF. 

Next we prove that F is a direct sum 

F = Re x © Ker 

Since AjÍCj) = 1, it is clear that Re x n Ker X x = 0. Furthermore, given xe F 
we note that x — A^x)^ is in the kernel of k v Hence F is the sum of the in- 
dicated submodules, and therefore the direct sum. 

We note that Ker is free, being a submodule of a free module (Theorem 
7.1). Welet 

F x = Ker X x and M x = M fl Ker X x . 

We see at once that M = Rx x ® M x . 

Thus M x is a submodule of F x and its dimension is one less than the dimension 
of M. From the maximality condition on X x (M ), it follows at once that for any 
functional X on F x , the image X(M) will be contained in X x (M) (because otherwise, 
a suitable linear combination of functionals would yield an ideal larger than 
(^!)). We can therefore complete the existence proof by induction. 

In Theorem 7.8, we call the ideals («!),..., (a m ) the invariants of M in F. 
For another characterization of these invariants, see Chapter XIII, Proposition 
4.20. 

Example. First, see examples of situations similar to those of Theorem 7.8 
in Exercises 5,7, and 8, and for Dedekind rings in Exercise 13. 

Example. Another way to obtain a module M as in Theorem 7.8 is as 
a module of relations. Let W be a finitely generated module over F, with genera- 
tors Wj,. . . , w n . By a relation among {vvj,. . ., w n ) we mean an element 
(a x ,.. . , a n ) E R n such that 2 tf/vv,- = 0. The set of such relations is a sub- 
module of R n , to which Theorem 7.8 may be applied. 

It is also possible to formulate a proof of Theorem 7.8 by considering M as 
a submodule of R n , and applying the method of row and column operations to 
get a desired basis. In this context, we make some further comments which may 
serve to illustrate Theorem 7.8. We assume that the reader is acquainted with 
matrices over a ring. By row operations we mean: interchanging two rows; 
adding a multiple of one row to another; multiplying a row by a unit in the ring. 
We define column operations similarly. These row and column operations 
correspond to multiplication with the so-called elementary matrices in the ring. 

Theorem 7.9. Assume that the elementary matrices in R generate GL n (R). 

Let (Xij) be a non-zero matrix with components in R. Then with a finite 

number of row and column operations , it is possible to bring the matrix to 

the form 
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a x 0.0 

0 a 2 .0 


o • o 

0 . 0 ••• 0 

I 0 . 0 

with a\ • ■ ■ a m # 0 and a x \a 2 \ • • • | a m . 


We leave the proof for the reader. Either Theorem 7.9 can be viewed as 
equivalent to Theorem 7.8, or a direct proof may be given. In any case, Theorem 
7.9 can be used in the following context. Consider a system of linear equations 


c n*i + ' * • + c ln x n = 0 


C r l*l Crn^n 

with coefficients in R. Let F be the submodule of R n generated by the vectors 
X = (x ì9 ... , x n ) which are solutions of this system. By Theorem 7.1, we know 
that F is free of dimension ^ n. Theorem 7.9 can be viewed as providing a 
normalized basis for F in line with Theorem 7.8. 

Further example. As pointed out by Paul Cohen, the row and column 
method can be applied to modules over a power series ring o[[X]], where o is 
a complete discrete valuation ring. Cf. Theorem 3.1 of Chapter 5 in my Cyclo- 
tomic Fields I and II (Springer Verlag, 1990). For instance, one could pick o it- 
self to be a power series ring k[[T]] in one variable over a field k 9 but in the 
theory of cyclotomic fields in the above reference, o is taken to be the ring of 
p -adic integers. On the other hand, George Bergman has drawn my attention to 
P. M. Cohn’s “On the structure of GL^ of a ring,” IHES Publ. Math. No. 30 
(1966), giving examples of principal rings where one cannot use row and column 
operations in Theorem 7.9. 


§8. EULER-POINCARÉ MAPS 

The present section may be viewed as providing an example and application 
of the Jordan-Hôlder theorem for modules. But as pointed out in the examples 
and references below, it also provides an introduction for further theories. 

Again let A be a ring. We continue to consider A-modules. Let T be an 
abelian group, written additively. Let cp be a rule which to certain modules 
associates an element of T, subject to the following condition: 
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If 0 -► M' -► M -+ M" -> 0 is exact , í/zen <p(M) is defined if and only if (p(M') 
and cp(M") are defined , and in that case , 

= <p(M') -f- (p(M"). 

Furthermore (p( 0) is defined and equal to 0. 

Such a rule (p will be called an Euler-Poincaré mapping on the category of 
/1-modules. If M' is isomorphic to M, then from the exact sequence 

0->M'->M->0->0 

we conclude that (p(M') is defined if (p(M) is defined, and that (p(M') = (p(M). 
Thus if (p(M) is defined for a module M, <p is defined on every submodule and 
factor module of M. In particular, if we have an exact sequence of modules 

M'-+M-+ M" 

and if (p(M') and (p(M") are defined, then so is (p(M\ as one sees at once by 
considering the kernel and image of our two maps, and using the definition. 

Examples. We could let A = Z, and let (p be defined for all finite abelian 
groups, and be equal to the order of the group. The value of (p is in the multi- 
plicative group of positive rational numbers. 

As another example, we consider the category of vector spaces over a field k. 
We let (p be defined for finite dimensional spaces, and be equal to the dimension. 
The values of <p are then in the additive group of integers. 

In Chapter XV we shall see that the characteristic polynomial may be con- 
sidered as an Euler-Poincaré map. 

Observe that the natural map of a finite module into its image in the Groth- 
endieck group defined at the end of §4 is a universal Euler-Poincaré mapping. 
We shall develop a more extensive theory of this mapping in Chapter XX, §3. 

If M is a module (over a ring A), then a sequence of submodules 
M = Mj i3 M 2 => • • • =3 M r = 0 

is also called a finite fìltration, and we call r the length of the filtration. A module 
M is said to be simple if it does not contain any submodule other than 0 and M 
itself, and if M # 0. A filtration is said to be simple if each MJM i+ ^ is simple. 
The Jordan-Hôlder theorem asserts that two simple filtrations of a module are 
equivalent. 

A module M is said to be of finite length if it is 0 or if it admits a simple 
(finite) filtration. By the Jordan-Hòlder theorem, the length of such a simple 
filtration is the uniquely determined, and is called the length of the module. In 
the language of Euler characteristics, the Jordan-Holder theorem can be re- 
formulated as follows: 
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Theorem8.1. Let cp be a rule which to each simple module associates an 
element of a commutative group T, and such that ifM & M' then 

(p(M) = (p(M'). 

Then (p has a unique extension to an Euler-Poincaré mapping defined on all 
modules of finite length. 

Proof Given a simple filtration 

M = M { => M 2 => • • • => M r = 0 

we define 

ç>(M)=X <p(Mj/M i+1 ). 

i= 1 

The Jordan-Hòlder theorem shows immediately that this is well-defined, and 
that this extension of (p is an Euler-Poincaré map. 

In particular, we see that the length function is the Euler-Poincaré map 
taking its values in the additive group of integers, and having the value 1 for any 
simple module. 


§9. THE SNAKE LEMMA 

This section gives a very general lemma, which will be used many times, 
so we extract it here. The reader may skip it until it is encountered, but already 
we give some exercises which show how it is applied: the five lemma in Exercise 
15 and also Exercise 26. Other substantial applications in this book will occur 
in Chapter XVI, §3 in connection with the tensor product, and in Chapter XX 
in connection with complexes, resolutions, and derived functors. 

We begin with routine comments. Consider a commutative diagram of homo- 
morphisms of modules. 


M'—-—> M 



n — r^N 

Then / induces a homomorphism 

Ker d' Ker d. 

Indeed, suppose d'x' = 0. Then df(x') = 0 because df(x') = hd'(x') = 0. 
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Similarly, h induces a homomorphism 

Coker d' -> Coker d 

in a natoral way as follows. Let y' e N' represent an element of N'/d’M'. Then 
hy' mod dM does not depend on the choice of y' representing the given element, 
because if y" = y' + d'x ', then 

hy" = hy' + hd'x' = hy' + dfx' = hy' mod dM. 

Thus we get a map 

/î*: N'/d'M' = Coker d' -» N/dM = Coker d, 

which is immediately verified to be a homomorphism. 

In practice, given a commutative diagram as above, one sometimes writes/ 
instead of h, so one writes /for the horizontal maps both above and below the 
diagram. This simplifies the notation, and is not so incorrect: we may view 
M', N' as the two components of a direct sum, and similarly for M, N. Then/ 
is merely a homomorphism defined on the direct sum M' © N' into M © N. 

The snake lemma concerns a commutative and exact diagram called a snake 
diagram: 


M’——*M —-—♦ M" -►0 

d' d d" 

0 -> N' -—> N -> N" 

f g 

Let z" e Ker d". We can construct elements of N' as follows. Since g is 
surjective, there exists an element z e M such that gz = z". We now move 
vertically down by d, and take dz. The commutativity d"g = gd shows that 
gdz = 0 whence dz is in the kernel of g in N. By exactness, there exists an 
element z' e N' such that fz' = dz. In brief, we write 

z' =/ _1 ° d o g~ l z". 

Of course, z' is not well defined because of the choices made when taking inverse 
images. However, the snake lemma will state exactly what goes on. 

Lemma 9.1. (Snake Lemma). Given a snake diagram as above , the map 

ô : Ker d" - Coker d' 

given by ôz" = f ~ 1 o d ° g~ l z" is well defined , and we have an exact sequence 
Ker d' -► Ker d -* Ker d" 4 Coker d! Coker d -* Coker d" 
where the maps besides ô are the natural ones. 
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Proof. It is a routine verification that the class of z' mod Im à' is in- 
dependent of the choices made when taking inverse images, whence defining 
the map <5. The proof of the exactness of the sequence is then routine, and 
consists in chasing around diagrams. It should be carried out in full detail 
by the reader who wishes to acquire a feeling for this type of triviality. As an 
example, we shall prove that 

Ker 8 C Im g* 

where g* is the induced map on kernels. Suppose the image of z" is 0 in Coker 
d'. By definition, there exists u e M' such that z’ = d'u'. Then 

dz = fz' = fd'u' = dfu' 


by commutativity. Hence 

d(z - fu ') = 0, 

and z — fu' is in the kernel of d. But g(z — fu ') = gz = z". This means that z" is 
in the image of #*, as desired. All the remaining cases of exactness will be left 
to the reader. 

The original snake diagram may be completed by writing in the kernels 
and cokernels as follows (whence the name of the lemma): 


Ker d' -> Ker d 


M' 


M 


Ker d"- 


M" 


N' 


N 


N" 


Coker d' -► Coker d -► Coker d" 


§10. DIRECT AND INVERSE LIMITS 

We return to limits, which we considered for groups in Chapter I. We now 
consider limits in other categories (rings, modules), and we point out that limits 
satisfy a universal property, in line with Chapter I, §11. 

Let / = {/} be a directed system of indices, defined in Chapter I, §10. Let 
Ofc be a category, and {Aj a family of objects in Cfc. For each pair i , j such that 



160 MODULES 


III , §10 


i ^ j assume given a morphism 

fY-A^Aj 

such that, whenever i g /c, we have 

fí°f)=Û and /; = id. 

Such a family will be called a directed family of morphisms. A direct limit 
for the family {/}} is a universal object in the following category C. Ob(C) 
consists of pairs ( A , (/')) where /1 e Ob(Ct) and (/') is a family of morphisms 
/': i4 f >4, i e /, such that for all i ^ j the following diagram is commutative: 


f\ 

Ai ———>Aj 



(Universal of course means universally repelling.) 

Thus if ( A , (/*)) is the direct limit, and if (£, (g 1 )) is any object in the above 
category, then there exists a unique morphism cp\ Â -► B which makes the 
following diagram commutative: 



For simplicity, one usually writes 


A = Um A h 

i 

omitting the f) from the notation. 

Theorem 10.1. Direct limits exist in the category of abelian groups , or more 
generally in the category of modules over a ring . 

Proof Let {MJ be a directed system of modules over a ring. Let M be 
their direct sum. Let N be the submodule generated by all elements 

Xij = (. . . , 0, *, 0, . . . , -fj(x\ 0, . . .) 
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where, for a given pair of indices (ij) with j ^ i, x u has component x in M h 
f)(x ) in Mj , and component 0 elsewhere. Then we leave to the reader the veri- 
fication that the factor module M/N is a direct limit, where the maps of into 
M/N are the natural ones arising from the composite homomorphism 

Mi~* M M/N. 

Example. Let X be a topological space, and let x e X. The open neigh- 
borhoods of x form a directed system, by inclusion. Indeed, given two open 
neighborhoods U and V , then U fl V is also an open neighborhood contained in 
both U and V. In sheaf theory, one assigns to each U an abelian group A(U) and 
for each pair U D V ahomomorphism hy: A(U) —> A(V) such that if UD VD W 
then hw°hy = hjf. Then the family of such homomorphisms is a directed family. 
The direct limit 


lim A(U) 

U 

is called the stalk at the point x. We shall give the formal definition of a sheaf 
of abelian groups in Chapter XX, §6. For further reading, I recommend at least 
two references. First, the self-contained short version of Chapter II in Hartshorne’s 
Algebraic Geometry , Springer Verlag, 1977. (Do all the exercises of that section, 
concerning sheaves.) The section is only five pages long. Second, I recommend 
the treatment in Gunning’s Introduction to Holomorphic Functions of Several 
Variables , Wadsworth and Brooks/Cole, 1990. 

We now reverse the arrows to define inverse limits. We are again given a 
directed set / and a family of objects A h lîj ^ i we are now given a morphism 

fì'Aj^Ai 


satisfying the relations 


f[ofi=fi and /{ = id, 

if j ^ i and i ^ k. As in the direct case, we can define a category of objects 
(A, f) with f: A -> A t such that for all i, j the following diagram is com- 
mutative: 



A universal object in this category is called an inverse limit of the system (A h f l j). 
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As before, we often say that 


A = lim Aj 


is the inverse limit, omitting the f) from the notation. 

Theorem 10.2. Inverse limits exist in the category of groups, in the category 
of modules over a ring, and also in the category of rings. 

Proof Let {Gj be a directed family of groups, for instance, and let T be 
their inverse limit as defined in Chapter I, §10. Let /7 f -: T —> G t be the projection 
(defined as the restriction from the projection of the direct product, since T is 
a subgroup of n Gi). It is routine to verify that these data give an inverse limit 
in the category of groups. The same construction also applies to the category of 
rings and modules. 

Example. Let p be a prime number. For n^m we have a canonical surjective 
ring homomorphism 

fZ:Z/p n Z-*Z/p"'Z. 

The projective limit is called the ring of p -adic integers, and is denoted by Z p . 
For a consideration of this ring as a complete discrete valuation ring, see Exercise 
17 and Chapter XII. 

Let k be a field. The power series ring k[[T]] in one variable may be viewed 
as the projective limit of the factor polynomial rings k[T]/(T n ), where for 
n ^ m we have the canonical ring homomorphism 

fm- k[T]/(T n ) - k[T]/(T m ). 

A similar remark applies to power series in several variables. 

More generally, let R be a commutative ring and let J be a proper ideal. If 
n ^ m we have the canonical ring homomorphism 

f n :R/J n -^R/J m . 

Let Rj = lim R/J n be the projective limit. Then R has a natural homomorphism 
into Rj. If R is a Noetherian local ring, then by Krull’s theorem (Theorem 5.6 
of Chapter X), one knows that fl J n = {0}, and so the natural homorphism of R 
in its completion is an embedding. This construction is applied especially when 
J is the maximal ideal. It gives an algebraic version of the notion of holomorphic 
functions for the following reason. 

Let R be a commutative ring and J a proper ideal. Define a 7-Cauchy se- 
quence {x n } to be a sequence of elements of R satisfying the following condition. 
Given a positive integer k there exists N such that for all w, m ^ N we have 
x n — x m e J k - Define a null sequence to be a sequence for which given k there 
exists N such that for all n ^ N we have x n e J k . Define addition and multipli- 
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cation of sequences termwise. Then the Cauchy sequences form a ring C, the 
null sequences form an ideal X, and the factor ring C/X is called the 7-adic 
completion of R. Prove these statements as an exercise, and also prove that there 
is a natural isomorphism 

e/x ~ Um R /J n • 

Thus the inverse limit lim R/J n is also called the 7-adic completion. See Chapter 
XII for the completion in the context of absolute values on fields. 

Examples. In certain situations one wants to determine whether there exist 
solutionsofasystemofapolynomialequation/(Xj,. . . ,X n ) = 0 withcoefficients 
in a power series ring k[T], say in one variable. One method is to consider the 
ring mod (T N ), in which case this equation amounts to a finite number of equations 
in the coefficients. A solution of f(X) = 0 is then viewed as an inverse limit of 
truncated solutions. For an early example of this method see [La 52], and for 
an extension to several variables [Ar 68]. 

[La 52] S. Lang, On quasi algebraic closure, Ann of Math. 55 (1952), pp. 373-390 

[Ar 68] M. Artin, On the solutions of analytic equations, Invent. Math. 5 (1968), pp. 
277-291 

See also Chapter XII, §7. 

In Iwasawa theory, one considers a sequence of Galois cyclic extensions K n 
over a number field k of degree p n with p prime, and with K n C K n+Ì . Let G n 
be the Galois group of K n over k. Then one takes the inverse limit of the group 
rings ( Z/p n Z)[G n ], following Iwasawa and Serre. Cf. my Cyclotomic Fields, 
Chapter 5. In such towers of fields, one can also consider the projective limits 
of the modules mentioned as examples at the end of §1. Specifically, consider 
the group of p n -th roots of unity py, and let K n = Q(p p n+i), with K 0 = Qíp^). 
We let 

T p (ii) = lim \i p n 

under the homomorphisms p p «+i \k p n given by ^ i—^ Then T p (\i) becomes 
a module for the projective limits of the group rings. Similarly, one can consider 
inverse limits for each one of the modules given in the examples at the end of 
§1. (See Exercise 18.) The determination of the structure of these inverse limits 
leads to fundamental problems in number theory and algebraic geometry. 

After such examples from real life after basic algebra, we return to some 
general considerations about inverse limits. 

Let (Ai,f{) = (A t ) and (B h g{) = (B^) be two inverse systems of abelian 
groups indexed by the same indexing set. A homomorphism (AJ -> (B t ) is the 
obvious thing, namely a family of homomorphisms 


hi : A t -> B t 
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for each i which commute with the maps of the inverse systems: 


A sequence 


Aj ~ ► Bj 


fi 


B t 


0 - (Ai) - (Bd - (Cd - 0 


is said to be exact if the corresponding sequence of groups is exact for each i. 

Let (A n ) be an inverse system of sets, indexed for simplicity by the positive 
integers, with connecting maps 


“m,n'-A m -► A n for m ^ n. 

We say that this system satisfies the Mittag-Leffler condition ML if for each n, 
the decreasing sequence u m n (A m ) (m ^ n) stabilizes, i.e. is constant for m 
sufficiently large. This condition is satisfied when u m n is surjective for all m, 
n. 

We note that trivially, the inverse limit functor is left exact, in the sense that 
given an exact sequence 


then 


is exact. 


0 - (A n ) - (B n ) -> (C n ) -> 0 
0 -> lim A n -► Hm B n -► Hm C n 


Proposition 10.3. Assume that (A„) satisfies ML. Given an exact sequence 


0 - (A n ) - (B n ) 4 (C n ) - 0 


of inverse systems , then 

0 -> lim A n -> Hm B n -► Hm C„ -► 0 

/s CXÛCÍ. 

Proc/ The only point is to prove the surjectivity on the right. Let (c n ) be 
an element of the inverse limit. Then each inverse image g'^Cn) is a coset of 
A n , so in bijection with A n . These inverse images form an inverse system, and 
the ML condition on (Af) implies ML on (g~ l (c n )). Let S n be the stable subset 

S n = D U m,n(g-'(C m )). 
m^n 
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Then the connecting maps in the inverse system ( S n ) are surjective, and so there 
is an element ( b n ) in the inverse limit. It is immediate that g maps this element 
on the given (c n ), thereby concluding the proof of the Proposition. 

Proposition 10.4. Let ( C n ) be an inverse system ofabelian groups satisfying 
ML, and let ( u m n ) be the system of connecting maps. Then we have an exact 
sequence 


o- Um c,-nc.^nc.-*o. 

Proof. For each positive integer N we have an exact sequence with a finite 
product 


0 -► lim C n -> Y[ -► n 0. 

1 n=l /i=l 

The map u is the natural one, whose effect on a vector is 

(0,..., 0, c m , 0,..., 0) »-► (0,..., 0, u m m _ jc m , 0,..., 0). 

One sees immediately that the sequence is exact. The infinite products are in- 
verse limits taken over N. The hypothesis implies at once that ML is satisfied 
for the inverse limit on the left, and we can therefore apply Proposition 10.3 to 
conclude the proof. 


EXERCISES 

1. Let V be a vector space over a field K , and let U , W be subspaces. Show that 

dim U -I- dim W = dim (U + W) + dim (U n W). 

2. Generalize the dimension statement ofTheorem 5.2 to free modules over a commutative 
ring. [ Hint: Recall how an analogous statement was proved for free abelian groups, 
and use a maximal ideal instead of a prime number.] 

3. Let R be an entire ring containing a field k as a subring. Suppose that R is a finite 
dimensional vector space over k under the ring multiplication. Show that R is a field. 

4. Direct sums. 

(a) Prove in detail that the conditions given in Proposition 3.2 for a sequence to 
split are equivalent. Show that a sequence 0 —> M' -4 M M" —> 0 splits if 

and only if there exists a submodule N of M such that M is equal to the direct 
sum Im/0 N, and that if this is the case, then N is isomorphic to M”. Complete 
all the details of the proof of Proposition 3.2. 
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(b) Let E and E t (i = 1,. .., m) be modules over a ring. Let <p, : E t —» E and 
: E —» £, be homomorphisms having the follovving properties: 

° <P« = id, ỳi °(Pj = 0 if i # j, 

m 

Z <Pi °= id - 


Show that the map x (i/^x,..., </r m x) is an isomorphism of E onto the direct product 
of the £, (i = 1,..., m), and that the map 

+ ••• + <p m x m 

is an isomorphism of this direct product onto E. 

Conversely, if E is equal to a direct product (or direct sum) of submodules 
E' (i = 1, . . ., m), if we let be the inclusion of E t in E , and ipj the projection of 
E on E h then these maps satisfy the above-mentioned properties. 

5. Let A be an additive subgroup of Euclidean space R", and assume that in every bounded 
region of space, there is only a finite number of elements of A. Show that A is a free 
abelian group on ^ n generators. [Hint: Induction on the maximal number of 
linearly independent elements of A over R. Let v u ..., v m be a maximal set of such 
elements, and let A 0 be the subgroup of A contained in the R-space generated by 
v u ... 9 v m _ j. By induction, one may assume that any element of A 0 is a linear integral 
combination of v í9 ..., v m -i. Let S be the subset of elements v e A of the form 
v = a^v i -I- • • • + a m v m with real coefficients û, satisfying 

0 ^ a t < 1 if i = 1,..., m — 1 

o á <!» s i. 

If v m is an element of S with the smallest a m # 0, show that {v í9 ..., v m - í9 v m } is a basis 
of A over Z.] 

Note. The above exercise is applied in algebraic number theory to show that the 
group of units in the ring of integers of a number field modulo torsion is isomorphic 
to a lattice in a Euclidean space. See Exercise 4 of Chapter VII. 

6. (Artin-Tate). Let G be a finite group operating on a finite set S. For vv E S, denote 
1 • vv by [vv], so that we have the direct sum 

Z(5> = 2 Z[w]. 

we5 

Define an action of G on Z (S) by defining cr[vv] = [crvv] (for vv E S), and extending 
cr to Z(S) by linearity. Let Mbea subgroup of Z(S) of rank #[S]. Show that M has 
a Z-basis {y w } wes such that cry w = for all vv E S. (Cf. my Algebraic Number 
Theory, Chapter IX, §4, Theorem 1.) 

7. Let M be a finitely generated abelian group. By a seminorm on M we mean a real- 
valued function v i-> \ v\ satisfying the following properties: 
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|î?| ^ 0 for all v E M; 

\nv\ = \n\ \v\ for n E Z; 

|î? + w | = |î?| + |w| for all v, w E M. 

By the kernel of the seminorm we mean the subset of elements v such that \ v\ = 0. 

(a) Let M 0 be the kernel. Show that M 0 is a subgroup. If M 0 = {0}, then the 
seminorm is called a norm. 

(b) Assume that M has rank r. Let v x ,..., v r e M be linearly independent over 
Z mod M 0 . Prove that there exists a basis {w ly .. ., w r } of M/M 0 such that 

kil = É |Pyl- 

y=i 

[f/iní: An explicit version of the proof of Theorem 7.8 gives the result. 
Without loss of generality, we can asume M 0 = {0}. Let M x = (v x ,.. . , v r ). 
Let d be the exponent of M/M x . Then dM has a finite index in M x . Let n^j 
be the smallest positive integer such that there exist integers n j X ,... , nj j- x 
satisfying 

nj x v x + • • • + nj jVj = dwj for some Wj e M. 

Without loss of generality we may assume 0 ^ n j k ^ d - 1. Then the elements 
w x ,. . . , w r form the desired basis.] 

8. Consider the multiplicative group Q* of non-zero rational numbers. For a non-zero 
rational number jc = a/b with a, b E Z and (a, b) = 1, define the height 

h(x) = log max( \a\, \b\). 

(a) Show that h defines a seminorm on Q*, whose kernel consists of ±1 (the 
torsion group). 

(b) Let Mj be a finitely generated subgroup of Q*, generated by rational numbers 
x x ,. . . , x m . Let M be the subgroup of Q* consisting of those elements jc such 
that jc 5 e M x for some positive integer s. Show that M is finitely generated, 
and using Exercise 7, find a bound for the seminorm of a set of generators 
of M in terms of the seminorms of jc, ,... , jc m . 

Note. The above two exercises are applied in questions of diophantine 
approximation. See my Diophantine approximation on toruses, Am. J. Math. 
86 (1964), pp. 521-533, and the discussion and references I give in Ency- 
clopedia of Mathematical Sciences, Number TheorylII, Springer Verlag, 1991, 
pp. 240-243. 

Localization 

9. (a) Let A be a commutative ring and let M be an A-module. Let S be a multiplicative 

subset of A. Define S~ l M in a manner analogous to the one we used to define 
S~ l A, and show that S~ l M is an S -1 A-module. 

(b) If 0 —> M' -> M -> M" —» 0 is an exact sequence, show that the sequence 
0 -» 5 _1 M' -» S~ l M -» 5 _, M" 0 is exact. 
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10. (a) If p is a prime ideal, and S = A - p is the complement of p in the ring A, then 

S~ X M is denoted by M p . Show that the natural map 

of a module M into the direct product of all localizations M p where p ranges over 
all maximal ideals, is injective. 

(b) Show that a sequence 0 —> M' —> M —> Af" —> 0 is exact if and only if the sequence 
0 —> Mp —> M p —> M" p —> 0 is exact for all primes p. 

(c) Let A be an entire ring and let M be a torsion-free module. For each prime p of 
A show that the natural mapM-> M p is injective. In particular A —> A p is injective, 
but you can see that directly from the imbedding of A in its quotient field K. 

Projective modules over Dedekind rings 

For the next exercise we assume you have done the exercises on Dedekind rings in 
the preceding chapter. We shall see that for such rings, some parts of their module theory 
can be reduced to the case of principal rings by localization. We let o be a Dedekind ring 
and K its quotient field. 

11. Let M be a finitely generated torsion-free module over o. Prove that M is projective. 
[Hint : Given a prime ideal p, the localized module M p is finitely generated torsion- 
free over o p , which is principal. Then M p is projective, so if F is finite free over o, 
and /: F —> M is a surjective homomorphism, then / p : F p —> M p has a splitting 
g p : M p —> F p , such that / p ° g p = id^. There exists c p E o such that c p ^ p and 
c v g p (M) C F. The family {c p } generates the unit ideal o (why?), so there is a finite 
number of elements c Pi and elements jc, E o such that ^x ( c Pi = 1. Let 

g = ’Zx i c Vi g fi . 

Then show that g: M —> F gives a homomorphism such that f°g = id^.] 

12. (a) Let a,b be ideals. Show that there is an isomorphism ofo-modules 

a ® b —> o®ab 

[Hint: First do this when a, b are relatively prime. Consider the homomorphism 
a ® b -> a + b, and use Exercise 10. Reduce the general case to the relatively 
prime case by using Exercise 19 of Chapter II.] 

(b) Let a, b be fractional ideals, and let/: a —> b be an isomorphism (of o-modules, 
of course). Then/has an extension to a K -linear map f K : K —> K. Let c = f K (\). 
Show that b = ca and that /is given by the mapping m c : x -+ cx (multiplication 
by c). 

(c) Let a be a fractional ideal. For each b E a _1 the map m b : a —> o is an element 
of the dual a v . Show that a _1 = a v = Hom 0 (a, o) under this map, and so 
a vv = a. 

13. (a) Let M be a projective finite module over the Dedekind ring o. Show that there 

exist free modules F and F' such that F D M D F\ and F, F' have the same 
rank, which is called the rank of M. 

(b) Prove that there exists a basis {e u ..., e n } of F and ideals a x , . . . , a„ such that 
M = a x e x + • • • + a n e n , or in other words, M ~ ® a t . 
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(c) Prove that M ~ o" -1 © û for some ideal û, and that the association M i-> o 
induces an isomorphism of £ 0 (o) with the group of ideal classes Pic(o). (The 
group Aq(o) is the group of equivalence classes of projective modules defined at 
the end of §4.) 

A few snakes 

14. Consider a commutative diagram of R-modules and homomorphisms such that each 
row is exact: 


M' -> M -> M" -> 0 

f q h 

0-> N' -► /V -► N" 


Prove: 

(a) If f h are monomorphisms then g is a monomorphism. 

(b) If f h are surjective, then g is surjective. 

(c) Assume in addition that 0 -► M' -* M is exact and that N -* N" -* 0 is exact. 
Prove that if any two of f g , h are isomorphisms, then so ìs the third. [Hintv 
Use the snake lemma.] 

15. The five lemma. Consider a commutative diagram of R-modules and homomorph- 
isms such that each row is exact: 



Prove: 

(a) If/j is surjective and/ 2 ,/ 4 are monomorphisms, then/ 3 ìs a monomorphism. 

(b) If/ 5 is a monomorphism and/ 2 ,/ 4 are surjective, then/ 3 is surjective. [Hint: 
Use the snake lemma.] 

Inverse limits 

16. Prove that the inverse limit of a system of simple groups in which the homomorphisms 
are surjective is either the trivial group, or a simple group. 

17. (a) Let n range over the positive integers and let p be a prime number. Show that 

the abelian groups A n = 7jlp n 2> form a projective system under the canonical 
homomorphism if n ^ m. Let Z p be its inverse limit. Show that Z p maps sur- 
jectively on each Z!p n Z\ that Z p has no divisors of 0, and has a unique maximal 
ideal generated by p. Show that Z p is factorial, with only one prime, namely p 
itself. 
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(b) Next consider all ideals of Z as forming a directed system, by divisibility. Prove 
that 

lim Z/(û) = n Z\ p , 

(a) p 

where the limit is taken over all ideals (a), and the product is taken over all 
primes p. 

18. (a) Let {A„} be an inversely directed sequence of commutative rings, and let { M n } 

be an inversely directed sequence of modules, M n being a module over A n such 
that the following diagram is commutative: 

A n +\ x Af n+ i -+M n + l 

•i 1 i 

A n xM n M n 

The vertical maps are the homomorphisms of the directed sequence, and the 
horizontal maps give the operation of the ring on the module. Show that lim M n 
is a module over lim A n . 

(b) Let M be a /?-divisible group. Show that T p (A) is a module over Z p . 

(c) Let M, N be /7-divisible groups. Show that T p (M © N) = T p (M) © T p (N ), as 
modules over Z p . 

Direct limits 

19. Let (/4,,/J) be a directed family of modules. Let a k e A k for some /c, and suppose that 
the image of a k in the direct limit A is 0. Show that there exists some index j ^ k such 
that f k j(a k ) = 0. In other words, whether some element in some group A ,• vanishes 
m the direct limit can already be seen within the original data. One way to see this 
is to use the construction of Theorem 10.1. 

20. Let /, J be two directed sets, and give the product / x J the obvious ordering that 
(ij) rg (/',/) if i ^ i' and j g/. Let A^ be a family of abelian groups, with homo- 
morphisms indexed by / x 7, and forming a directed family. Show that the direct 
limits 

Lim \_mAij and li m \ìn ) A ;j 

* j j i 

exist and are isomorphic in a natural way. State and prove the same result for inverse 
limits. 

21. Let (MJ,/}), (M f , g)) be difected systems of modules over a ring. By a homomorphism 

(M;.) 4 (M,) 

one means a family of homomorphisms w,: Mj -*■ M, for each /' which commute with 
the fj, g). Suppose we are given an exact sequence 

0 -* (M;) 4 (M,) 4 (M") -* 0 


of directed systems, meaning that for each /', the sequence 


0 - m; -* M f -* M" - 0 




III, Ex 


EXERCISES 171 


is exact. Show that the direct limit preserves exactness, that is 
0 -► hm hm M, -► hm M" -> 0 


ìs exact. 

22. (a) Let {MJ be a family of modules over a ring. For any module N show that 

Hom(® M t , N) = f[ Hom(M„ N) 

(b) Show that 

Hom (N, n Md = n Hom (N, M f ). 

23. Let {M,} be a directed family of modules over a ring. For any module N show that 

hm Hom(N, M,) = Hom(N, hm M,) 


24. Show that any module is a direct limit of finitely generated submodules. 

A module M is called finitely presented if there is an exact sequence 

F, ->F o ->M->0 

where F 0 , F x are free with fìnite bases. The image of F { in F 0 is said to be the submodule 
of relations, among the free basis elements of F 0 . 

25. Show that any module is a direct limit of finitely presented modules (not necessarily 
submodules). In other words, given M, there exists a directed system {M f , /j} with M, 
finitely presented for all i such that 

M % lim M,. 

[Hint: Any finitely generated submodule is such a direct limit, since an infinitely 
generated module of relations can be viewed as a limit of finitely generated modules of 
relations. Make this precise to get a proof.] 

26. Let F be a module over a ring. Let {M,} be a directed family of modules. If E is finitely 
generated, show that the natural homomorphism 

lim Hom(F, M,) -> Hom(F, iim M,) 

is injective. IfF is finitely presented, show that this homomorphism is an isomorphism. 
Hint : First prove the statements when F is free with finite basis. Then, say F is 
finitely presented by an exact sequence F x -> F 0 -> F -> 0. Consider the diagram: 

0-> hm Hom(F, M,)-> Hm Hom(F 0 , M,)-> lim Hom(F!, M,) 


0 


Hom(F, Hm M,) 


Hom(F 0 , lim M t ) 


Hom(Fj, linq M,ì 
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Graded Algebras 

Let A be an algebra over a field k. By a filtradon of A we mean a sequence of k- 
vector spaces Ai (i = 0,1,...) such that 

Ao c= A\ c= A 2 c= ■ • • and (J A[ = A, 

and AiAj c= A i+ j for all /, j ^ 0. In particular, A is an /l 0 -algebra. We then call A a fil- 
tered algebra. Let R be an algebra. We say that R is graded if R is a direct sum 
R = 0 Ri of subspaces such that R^Rj c= R i+ j for all /, j ^ 0. 

27. Let A be a filtered algebra. Define for / ^ 0 by Ri = Ai/A^\. By definition, 
A -1 = {0}. Let R = Q)R iy and R{ = gr { (A). Define a natural product on R making 
R into a graded algebra, denoted by gr (A), and called the assocíated graded algebra. 

28. Let A, B be filtered algebras, A = (J A[ and B = (J 2?,. Let L: A —> B be an (Ao, Bo)- 
linear map preserving the filtration, that is L(A,) c= 2?, for all /, and L(ca) = 
L(c)L(a) for c e Aq and a e Ai for all /. 

(a) Show that L induces an (Ao, 5 0 )-linear map 

gr,(^): gTi(A) — gr,(B) for all í. 

(b) Suppose that gr^L) is an isomorphism for all /. Show that L is an (Ao,Bo)- 
isomorphism. 

29. Suppose k has characteristic 0. Let n be the set of all strictly upper triangular ma- 
trices of a given size n x n over k. 

(a) For a given matrix X e n, let D\(X ),. . .,D n (X) be its diagonals, so D\ = 
D\(X) is the main diagonal, and is 0 by the definition of n. Let n, be the 
subset of n consisting of those matrices whose diagonals D\ ,..., Z)„_, are 0. 
Thus rto = {0}, xi\ consists of all matrices whose components are 0 except 
possibly for x nn ; n 2 consists of all matrices whose components are 0 except 
possibly those in the last two diagonals; and so forth. Show that each n, is 
an algebra, and its elements are nilpotent (in fact the (/ + l)-th power of its 
elements is 0). 

(b) Let U be the set of elements 1 -h X with X en. Show that U is a multi- 
plicative group. 

(c) Let exp be the exponential series defined as usual. Show that exp defines a 
polynomial function on n (all but a finite number of terms are 0 when eval- 
uated on a nilpotent matrix), and establishes a bijection 

exp: n —► U. 

Show that the inverse is given by the standard log series. 



CHAPTER IV 

Polynomials 


This chapter provides a continuation of Chapter II, §3. We prove stan- 
dard properties of polynomials. Most readers will be acquainted with some 
of these properties, especially at the beginning for polynomials in one vari- 
able. However, one of our purposes is to show that some of these properties 
also hold over a commutative ring when properly formulated. The Gauss 
lemma and the reduction criterion for irreducibility will show the importance 
of working over rings. Chapter IX will give examples of the importance of 
working over the integers Z themselves to get universal relations. It happens 
that certain statements of algebra are universally true. To prove them, one 
proves them first for elements of a polynomial ring over Z, and then one 
obtains the statement in arbitrary fields (or commutative rings as the case 
may be) by specialization. The Cayley-Hamilton theorem of Chapter XV, 
for instance, can be proved in that way. 

The last section on power series shows that the basic properties of 
polynomial rings can be formulated so as to hold for power series rings. I 
conclude this section with several examples showing the importance of power 
series in various parts of mathematics. 


§1. BASIC PROPERTIES FOR POLYNOMIALS 
IN ONE VARIABLE 

We start with the Euclidean algorithm. 

Theorem 1.1. Let A be a commutative ring , let /, g e A[X~\ be poly- 
nomials in one variable , of degrees ^ 0, and assume that the leading 
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coefficient of g is a unit in A. Then there exist unique polynomials 
q, r e A [X] such that 

f = gq + r 

and deg r < deg g. 

Proof. Write 

f(X) = a n X n + --' + a 0 , 
g(X) = b d X d + ••• + &()» 

where n = deg/, d = deg g so that a n , b d ^0 and b d is a unit in A. We use 
induction on n. 

If n = 0, and deg g > deg /, we let q = 0, r = / If deg # = deg / = 0, then 
we let r = 0 and g = a n bf l . 

Assume the theorem proved for polynomials of degree < n (with n>0). 
We may assume deg g ^ deg / (otherwise, take q = 0 and r = f). Then 

f(X) = a n bf'X n - i g(X)+f i(n 

where /iíA") has degree < n. By induction, we can find r such that 

f(X) = a n bf ì X n - d g(X) + ^(XM*) + r(*) 

and deg r < deg g. Then we let 

q(X) = a n bf'X n - d + q x (X) 

to conclude the proof of existence for q, r. 

As for uniqueness, suppose that 

f = <h9 + ri= 9 + r 2 

with deg r x < deg g and deg r 2 < deg g. Subtracting yields 

(<h - 9i)g = r 2 - ri . 

Since the leading coefficient of g is assumed to be a unit, we have 
deg (q t - q 2 )g = deg^! - q 2 ) + deg g. 

Since deg(r 2 — rj < deg g, this relation can hold only if qi~q 2 = 0, i.e. 
= q 2 , and hence finally r x = r 2 as was to be shown. 

Theorem 1.2. Let k be a field. Then the polynomial ring in one variable 
/c[A"] is principal. 
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Proof. Let a be an ideal of k\_X ], and assume a # 0. Let g be an 
element of a of smallest degree ^ 0. Let / be any element of a such that 
/ # 0. By the Euclidean algorithm we can find q, r e /c[X] such that 

f = qg + r 

and deg r < deg g. But r = f — qg, whence r is in a. Since g had minimal 
degree ^ 0 it follows that r = 0, hence that a consists of all polynomials qg 
(with q e /c[AT]). This proves our theorem. By Theorem 5.2 of Chapter II we 
get: 

Corollary 1 . 3 . The ring /c[A"] is factorial. 

If k is a field then every non-zero element of k is a unit in k, and one sees 
immediately that the units of k[X ] are simply the units of k. (No polyno- 
mial of degree ^ 1 can be a unit because of the addition formula for the 
degree of a product.) 

A polynomial f(X) e /c[2f] is called irreducible if it has degree ^ 1, and if 
one cannot write f(X) as a product 

f(X) = g(X)h(X) 

with g, h e k[X], and both g, h £ k. Elements of k are usually called constant 
polynomials, so we can also say that in such a factorization, one of g or h must 
be constant. A polynomial is called monic if it has leading coefficient 1 . 

Let A be a commutative ring and f(X) a polynomial in A[Xf Let A be 
a subring of B. An element b e B is called a root or a zero of / in B if 
f(b) = 0. Similarly, if (2f) is an n-tuple of variables, an n-tuple (b) is called a 
zero of / if f(b) = 0. 

Theorem 1 . 4 . Let k be a field, and f a polynomial in one variable X in 

k[X~\, of degree n ^ 0. Then f has at most n roots in k, and if a is a root 

of f in k, then X — a divides f(X). 

Proof. Suppose f(á) = 0. Find q, r such that 

f(X) = q(X)(X -a) + r(X) 


and deg r < 1. Then 

0 =f(a) = r(a). 

Since r = 0 or r is a non-zero constant, we must have r = 0, whence X — a 
divides f(X). If a x ,..., a m are distinct roots of / in k, then inductively we see 
that the product 


(X-aJ-tf-aJ 
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divides f(X ), whence m ^ n, thereby proving the theorem. The next corollaries 
give applications of Theorem 1.4 to polynomial functions. 

Corollary 1.5. Let k be a field and T an infinite subset of k. Let 
f(X) e k[X~\ be a polynomial in one variable. If f(a) = 0 for all ae T, then 
/ = 0, i.e. f induces the zero function. 

Corollary 1.6. Let kbe a field , and let S í9 ..., S n be infinite subsets of k. 
Let f(X t ,..., X n ) be a polynomial in n variables over k. If f(a { ,..., a n ) = 0 
for all a { e S t (i = 1,..., n), then f = 0. 

Proof. By induction. We have just seen the result is true for one 
variable. Let n ^ 2, and write 

f(x u ...,x„) = Y d f i (x l ,...,x n _ i )xi 

j 

as a polynomial in X n with coefficients in k{_X ì9 ..., X n _f\. If there exists 
(b í ,...,b.- í )eS 1 x ••• x S„_, 
such that for some j we have ,..., b ^) # 0, then 

f(b í9 ... 9 b n - í9 X n ) 

is a non-zero polynomial in k[X n \ which takes on the value 0 for the infinite 
set of elements S n . This is impossible. Hence f induces the zero function on 
Si x • • • x S n _j for all j 9 and by induction we have /• = 0 for all j. Hence 
/ = 0, as was to be shown. 

Corollary 1.7. Let k be an infinite field and f a polynomial in n variables 
over k. If f induces the zero function on k {n) 9 then f = 0. 

We shall now consider the case of finite fields. Let k be a finite field with 
q elements. Let f(X í9 ..., X n ) be a polynomial in n variables over k. Write 

f(x ì9 ... 9 x n ) = Y j a (v) xi^-x;\ 

If a (v) # 0, we recall that the monomial M {v) (X) occurs in /. Suppose this is 
the case, and that in this monomial M {V) (X) 9 some variable X { occurs with an 
exponent v f ^ q. We can write 

X? = X^ 9 p = integer è 0. 

If we now replace Xp by A/ +1 in this monomial, then we obtain a new 
polynomial which gives rise to the same function as /. The degree of this 
new polynomial is at most equal to the degree of /. 
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Performing the above operation a finite number of times, for all the 
monomials occurring in / and all the variables X l9 ..., X n we obtain some 
polynomial /* giving rise to the same function as / but whose degree in 
each variable is < q. 

Corollary 1.8. Let k be a finite field mth q elements. Let f be a 

polynomial in n variables over k such that the degree of f in each variable 

is <q. If f induces the zero function on k in \ then f = 0. 

Proof. By induction. If n = 1, then the degree of / is < q 9 and hence / 
cannot have q roots unless it is 0. The inductive step is carried out just as 
we did for the proof of Corollary 1.6 above. 

Let / be a polynomial in n variables over the finite field k. A polynomial 
g whose degree in each variable is < q will be said to be reduced. We have 
shown above that there exists a reduced polynomial /* which gives the same 
function as / on k in) . Theorem 1.8 now shows that this reduced polynomial is 
unique. Indeed, if g ì9 g 2 are reduced polynomials giving the same function, 
then — g 2 is reduced and gives the zero function. Hence g Y — g 2 = 0 and 
01 = 02 * 

We shall give one more application of Theorem 1.4. Let k be a field. By 
a multiplicative subgroup of k we shall mean a subgroup of the group /c* 
(non-zero elements of k). 

Theorem 1.9. Let k be a field and let U be a finite multiplicative sub- 

group of k. Then U is cyclic. 

Proof. Write U as a product of subgroups U(p) for each prime p, where 
U(p) is a p-group. By Proposition 4.3(vi) of Chapter I, it will suffice to prove 
that U(p) is cyclic for each p. Let a be an element of U(p) of maximal period 
p r for some integer r. Then x pr = 1 for every element x e U(p\ and hence all 
elements of U(p) are roots of the polynomial 

X pr - 1. 

The cyclic group generated by a has p r elements. If this cyclic group is not 
equal to U(p\ then our polynomial has more than p r roots, which is 
impossible. Hence a generates U (p), and our theorem is proved. 

Corollary 1.10. If k is a finite field , then /c* is cyclic. 

An element Ç in a field k such that there exists an integer n ^ 1 such that 
C" = 1 is called a root of unity, or more precisely an n- th root of unity. Thus 
the set of n-th roots of unity is the set of roots of the polynomial X n — 1. 
There are at most n such roots, and they obviously form a group, which is 
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cyclic by Theorem 1.9. We shall study roots of unity in greater detail 
later. A generator for the group of n-th roots of unity is called a primitive 
n-th root of unity. For example, in the complex numbers, e 2ni/n is a primi- 
tive n-th root of unity, and the n-th roots of unity are of type e 2niv,n with 
1 ^ v ^ n. 

The group of roots of unity is denoted by p. The group of roots of unity 
in a field K is denoted by p (K). 

A field k is said to be algebraically closed if every polynomial in /c[AT] of 
degree ^ 1 has a root in k. In books on analysis, it is proved that the 
complex numbers are algebraically closed. In Chapter V we shall prove that 
a field k is always contained in some algebraically closed field. If k is 
algebraically closed then the irreducible polynomials in /c[A] are the poly- 
nomials of degree 1. In such a case, the unique factorization of a polynomial 
/ of degree ^ 0 can be written in the form 

nx ) = c fi (x - «,r 

í=i 

with c e k, c # 0 and distinct roots <x í9 ..., a r . We next develop a test when 
> 1 . 

Let A be a commutative ring. We define a map 

D: A[_X]-+ AIX'] 

of the polynomial ring into itself. If f(X) = a n X n -h ••• -h a 0 with e A, we 
define the derìvative 


Df(X) =f(X) = £ va v A' v_1 = na„A r " _1 + ••• + «,. 

v = l 

One verifies easily that if / g are polynomials in A[X~\ , then 
(/ + 0)' = /' + 0', (fgï = f'g + fg'. 


and if a e A, then 


(afï = af’. 


Let K be a field and / a non-zero polynomial in K[X]. Let a be a root 
of / in À\ We can write 

f(X) = (X- a) m g(X) 

with some polynomial g(X) relatively prime to X — a (and hence such that 
g(a) # 0). We call m the multiplicity of a in /, and say that a is a multiple 
root if m > 1. 
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Proposition 1.11. Let K , / be as above. The element a of K is a multiple 
root of f if and only if it is a root and f'(a) = 0. 

Proof Factoring / as above, we get 

f'(X) = (X- a) m g'(X) + m(X - aT~'g(X). 

If m > 1, then obviously f'(a) = 0. Conversely, if m = 1 then 
f'(X) = (X- a)g'(X) + g(X\ 

whence f'(a) = g(a) # 0. Hence if f'(a) = 0 we must have m > 1, as desired. 

Proposition 1.12. Let f e K[Xf If K has characteristic 0, and f has 
degree ^ 1, then f' # 0. Let K have characteristic p > 0 and f have 

degree ^ 1. Then f' = 0 if and only if in the expression for f(X) given 

by 

f(X) = f 

V = 1 

p divides each integer v such that a v ^ 0. 

Proof If K has characteristic 0, then the derivative of a monomial a v X v 
such that v ^ 1 and a v # 0 is not zero since it is va v X v ~ l . If K has 

characteristic p > 0, then the derivative of such a monomial is 0 if and only if 

p|v, as contended. 

Let K have characteristic p > 0, and let / be written as above, and be 
such that f'(X) = 0. Then one can write 


d 


f(X) = X b.X^ 
/ì=i 


with b^ e K. 

Since the binomial coefficients 
see that if K has characteristic p, then for a, b e K we have 


are divisible by p for 1 ^ v ^ p — 1 we 


(a + b) p = a p + b p . 


Since obviously (ab) p = a p b p , the map 

xi— >x p 

is a homomorphism of K into itself, which has trivial kernel, hence is 
injective. Iterating, we conclude that for each integer r ^ 1, the map x\-+x pV 
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is an endomorphism of X, called the Frobenius endomorphism. Inductively, if 
c c n are elements of X, then 

(C! +--- + c n y = cf+ ••• + <£. 

Applying these remarks to polynomials, we see that for any element ae K 
we have 

(X - 0)^ = x* r - «r 


If c e X and the polynomial 

X^-c 

has one root úí in X, then = c and 

- c = (X - «r. 

Hence our polynomial has precisely one root, of multiplicity p r . For in- 
stance, (X - 1 ) pr = X^ - 1. 


§2. POLYNOMIALS OVER A FACTORIAL RING 

Let A be a factorial ring, and X its quotient field. Let ae K, a ^ 0. We 
can write ú as a quotient of elements in A , having no prime factor in 
common. If p is a prime element of /1, then we can write 

û = p% 

where b e X, r is an integer, and p does not divide the numerator or 
denominator of b. LFsing the unique factorization in A , we see at once that r 
is uniquely determined by a , and we call r the order of a at p (and write 
r = ord p a). If a = 0, we define its order at p to be 00 . 

If a, a' e X and aa' # 0, then 

ord p (aa') = ord p a + ord p û'. 

This is obvious. 

Let /(X) e X[X] be a polynomial in one variable, written 
/(X) = a 0 + a x X + • * • + a n X n . 


If / = 0, we define ord p / to be 00 . If / # 0, we define ord p / to be 
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ord p / = min ord p a h 

the minimum being taken over all those i such that a t # 0. 

If r = ord p fi we call up r a p-content for /, if u is any unit of A. We define 
the content of / to be the product. 


n p ordpf > 

the product being taken over all p such that ord p / # 0, or any multiple of 

this product by a unit of A. Thus the content is well defined up to 

multiplication by a unit of A. We abbreviate content by cont. 

If b e X, b # 0, then cont (bf) = b cont(/). This is clear. Hence we can 

write 


f(X) = c-MX) 

where c = cont(/), and ffX) has content 1. In particular, all coefficients of 
fi lie in A, and their g.c.d. is 1. We define a polynomial with content 1 to be 

a primitive polynomial. 

Theorem 2.1. (Gauss Lemma). Let A be a factorial ring, and let K be 
its quotient field. Let fige K[X] be polynomials in one variable. Then 

cont (fg) = cont(/) cont(gf). 

Proof. Writing / = cfi and g = dg x where c = cont(/) and d = cont (g) 9 
we see that it suffices to prove: If /, g have content 1, then fg also has 
content 1 , and for this, it suffices to prove that for each prime p, ord p (fg) = 0. 
Let 

f(X) = a n X n + -- + a 0 , a n *0, 

g(X) = b m X m + °- + b 09 b m * 0, 

be polynomials of content 1. Let p be a prime of A. It will suffice to prove 
that p does not divide all coefficients of fg. Let r be the largest integer such 
that 0 ^ r ^ n, a r # 0, and p does not divide a r . Similarly, let b s be the 
coefficient of g farthest to the left, b s # 0, such that p does not divide b s . 
Consider the coefficient of X r+S in f(X)g(X). This coefficient is equal to 

C a r^s ”1” a r+í^s —1 T" 

+ a r-í^s+l + **’ 

and p\a r b s . However, p divides every other non-zero term in this sum since 
in each term there will be some coefficient a^ to the left of a r or some 
coefficient bj to the left of b s . Hence p does not divide c, and our lemma is 
proved. 
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We shall now give another proof for the key step in the above argument, 
namely the statement: 

If f 9 g e A\_X~\ are primitive (i.e. have content 1) then fg is primitive. 

Proof. We have to prove that a given prime p does not divide all the 
coefficients of fg. Consider reduction mod p, namely the canonical homo- 
morphism A -► A/(p) = Â. Denote the image of a polynomial by a bar, so 
fb+f and g*->g under the reduction homomorphism. Then 

Jg=fg * 

By hypothesis, / # 0 and g # 0. Since A is entire, it follows that fg # 0, as 
was to be shown. 

Corollary 2.2. Let f(X) e A[X\ have a factorization f(X) = g(X)h(X) in 
K[X\. If c g = cont(gf), c h = cont (h), and g = c g g í9 h = c h h u then 

f(X) = 

and c g c h is an element of A. In particular , if f g e A[X\ have content 1, 
then he A [X] also. 

Proof The only thing to be proved is c g c h e A. But 
cont(/) = cont (g^hf) = c tf c*, 

whence our assertion follows. 

Theorem 2.3. Let A be a factorial ring. Then the polynomiaì ring A[X\ 
in one variable is factorial. Its prime elements are the primes ofA and poly- 
nomials in A[X] which are irreducible in K[X] and have content 1. 

Proof. Let / e A[X\, f ^ 0. Using the unique factorization in K[X\ 
and the preceding corollary, we can fìnd a factorization 

f(X) = c- Pí (X)--p r (X) 

where c e A, and p l9 ..., p r are polynomials in A[X\ which are irreducible in 
K[X\. Extracting their contents, we may assume without loss of generality 
that the content of Pi is 1 for each i. Then c = cont(/) by the Gauss lemma. 
This gives us the existence of the factorization. It follows that each Pi (X) is 
irreducible in A[X]. If we have another such factorization, say 

f(X) = d qi (X)-q s (X), 

then from the unique factorization in K[X\ we conclude that r = s, and after 
a permutation of the factors we have 


Pi = Qiqi 
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with elements a { e K. Since both p h are assumed to have content 1, it 
follows that a { in fact lies in A and is a unit. This proves our theorem. 

Corollary 2.4. Let A be a factorial ring. Then the ring of polynomials in 
n variables A\_X Í9 ..., X n ~\ is factorial. Its units are precisely the units of 
A, and its prime elements are either primes of A or polynomials which are 
irreducible in K\_X~\ and have content 1. 

Proof. Induction. 

In view of Theorem 2.3, when we deal with polynomials over a factorial 
ring and having content 1, it is not necessary to specify whether such 
polynomials are irreducible over A or over the quotient field K. The two 
notions are equivalent. 

Remark 1. The polynomial ring K\X U ..., X n \ over a fìeld K is not 
principal when n ^ 2. For instance, the ideal generated by X i9 ..,,X n is not 
principal (trivial proof). 

Remark 2. It is usually not too easy to decide when a given polynomial 
(say in one variable) is irreducible. For instance, the polynomial X 4 -h 4 is 
reducible over the rational numbers, because 

X 4 + 4 = (X 2 - 2X + 2)(X 2 + 2X + 2). 

Later in this book we shall give a precise criterion when a polynomial 
X n — a is irreducible. Other criteria are given in the next section. 


§3. CRITERIA FOR IRREDUCIBILITY 

The first criterion is: 

Theorem 3.1. (Eisenstein’s Criterion). Let A be a factorial ring. Let K 
be its quotient field. Let f(X) = a n X n -f • • • + a 0 be a polynomial of degree 
n^. 1 in A\X\. Let p be a prime of A, and assume: 

a n ^0 (mod p), a t = 0 (mod p) for all i < n, 
a o ^0 (mod p 2 ). 


Then f(X) is irreducible in K\X\. 
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Proof. Extracting a g.c.d. for the coefficients of /, we may assume 
without loss of generality that the content of / is 1. If there exists a 
factorization into factors of degree ^ 1 in K[X~\, then by the corollary of 
Gauss’ lemma there exists a factorization in A[X\ , say f(X) = g(X)h(X ), 

g(X) = b d X â + - + bo 9 

h(X) = c m X m + -- + c 0 , 

with d, m ^ 1 and b d c m # 0. Since b 0 c 0 = a 0 is divisible by p but not p 2 , it 
follows that one of b 0 , c 0 is not divisible by p, say b 0 . Then p|c 0 . Since 
c m b d = a n is not divisible by p, it follows that p does not divide c m . Let c r be 
the coefficient of h furthest to the right such that c r # 0 (mod p). Then 

a r = b 0 c r + biC r _! + . 

Since p\b 0 c r but p divides every other term in this sum, we conclude that 
p\a r , a contradiction which proves our theorem. 

Example. Let a be a non-zero square-free integer # ± 1. Then for any 
integer n ^ 1, the polynomial X n — a is irreducible over Q. The polynomials 
3X 5 — 15 and 2X Í0 — 21 are irreducible over Q. 

There are some cases in which a polynomial does not satisfy Eisenstein’s 
criterion, but a simple transform of it does. 


Example. Let p be a prime number. Then the polynomial 
f(X) = X p ~ l +•••+ 1 

is irreducible over Q. 

Proof. It will suffice to prove that the polynomial f(X + 1) is irreducible 
over Q. We note that the binomial coefficients 


fp) = Eí 

W v!(p — v)f 


l^vgp-1, 


are divisible by p (because the numerator is divisible by p and the denomina- 
tor is not, and the coefficient is an integer). We have 

, „_(x + iy-i -X’ + pX '- 1 + - +px 

+ } (X + 1) - 1 X 

from which one sees that f(X + 1) satisfies Eisenstein’s criterion. 


Example. Let E be a field and t an element of some field containing E such 
that t is transcendental over E. Let K be the quotient field of E[t]. 
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For any integer n ^ 1 the polynomial X n — t is irreducible in K[_X~\. This 
comes from the fact that the ring A = E[t\ is factorial and that t is a prime 
in it. 

Theorem 3.2. (Reduction Criterion). Let A, B be entxre rings, and /eí 

(p: A -» B 

be a homomorphism. Let K, L bc í/ic quotient fields of A and B respec- 
tively. Let f e A [X\ be such that cpf # 0 and deg <pf = deg /. // cpf is 
irreducible in L[X\ then f does not have a factorization f(X) = g(X)h(X) 
mth 

g, he A [Z] and deg g, deg ì. 

Proof. Suppose / has such a factorization. Then q>f = (cpg)((ph). Since 
deg (pg ^ deg g and deg (ph ^ deg h, our hypothesis implies that we must 
have equality in these degree relations. Hence from the irreducibility in 
L[X\ we conclude that g or h is an element of A, as desired. 

In the preceding criterion, suppose that A is a local ring, i.e. a ring having 
a unique maximal ideal p, and that p is the kernel of q>. Then from the 
irreducibility of (pf in L[X\ we conclude the irreducibility of / in A[X\. 
Indeed, any element of A which does not lie in p must be a unit in A, so our 
last conclusion in the proof can be strengthened to the statement that g or h 
is a unit in A. 

One can also apply the criterion when A is factorial, and in that case 
deduce the irreducibility of / in K[X\. 

Example. Let p be a prime number. It will be shown later that 
X p — X — 1 is irreducible over the field Z/pZ. Hence X p — X — 1 is irreduc- 
ible over Q. Similarly, 

X 5 - 5X* - 6X - 1 

is irreducible over Q. 

There is also a routine elementary school test whether a polynomial has a 
root or not. 

Proposition 3.3. (Integral Root Test). Let A be a factorial ring and K 
its quotient field. Let 

f(X) = a n X n + --• + a 0 e A [X\. 

Let oce K be a root of f mth oc = b/d expressed mth b, de A and b, d 
relatively prime. Then b\a 0 and d\a n . In particular, if the leading coefficient 
a n is 1, then a root oc must lie in A and divides a 0 . 
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We leave the proof to the reader, who should be used to this one from way 
back. As an irreducibility test, the test is useful especially for a polynomial of 
degree 2 or 3, when reducibility is equivalent with the existence of a root in 
the given field. 


§4. HILBERrS THEOREM 

This section proves a basic theorem of Hilbert concerning the ideals of a 
polynomial ring. We define a commutative ring A to be Noetherian if every 
ideal is finitely generated. 


Theorem 4.1. Let A be a commutative Noetherian ring. Then the polyno- 
mial ring A[X ] is also Noetherian. 

Proof. Let 91 be an ideal of A[X]. Let a t consist of 0 and the set of elements 
a e A appearing as leading coeffìcient in some polynomial 

a Q + a x X + ••• + aX * 

lying in 9Í. Then it is clear that a f is an ideal. (If a, b are in a f , then a±b is 
in a f as one sees by taking the sum and difference of the corresponding 
polynomials. If x e A, then xa e a, as one sees by multiplying the corre- 
sponding polynomial by x.) Furthermore we have 


a 0 c a! <= a 2 c= 


in other words, our sequence of ideals {aj is increasing. Indeed, to see this 
multiply the above polynomial by X to see that a e a i+ì . 

By criterion (2) of Chapter X, §1, the sequence of ideals {aj stops, say at 
a r : 

û 0 c a x c a 2 c c a r = a r+1 = • ••. 

Let 


a 0 i , • • •, a 0no be generators for a 0 , 


a rì ,..., a rny be generators for a r . 

For each i = 0, ..., r and j = 1, ..., n { E let fj be a polynomial in 91, of degree 
U with leading coefficient a u . We contend that the polynomials f i} are a set 
of generators for 91. 

Let / be a polynomial of degree d in 91. We shall prove that / is in the 
ideal generated by the f ip by induction on d. Say d ;> 0. If d > r, then we 
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note that the leading coefïìcients of 


v 'd—rf yi-rf 

A Jr 1> •••> A Jrn r 

generate a d . Hence there exist elements c lt c n e A such that the 
polynomial 

/ - Cí X d ~ r f rí - • • • - c n X d ~ r f rnr 

has degree <d , and this polynomial also lies in 91. If d ^ r, we can subtract 
a linear combination 

/ ™ C ìfdì “ * “ “ C n d fin d 

to get a polynomial of degree < d , also lying in 91. We note that the 
polynomial we have subtracted from / lies in the ideal generated by the f iy 
By induction, we can subtract a polynomial g in the ideal generated by the 
fj such that / — g = 0, thereby proving our theorem. 

We note that if cp\ A -► B is a surjective homomorphism of commutative 
rings and A is Noetherian, so is B. Indeed, let b be an ideal of B , so <p -1 (b) 
is an ideal of A. Then there is a finite number of generators (a Ì9 . . . , a n ) for 
(p~ l (b), and it follows since (p is surjective that b = <p(<p -1 (b)) is generated by 
(p(a x \ ..., </>(«„), as desired. As an application, we obtain: 

Corollary 4.2. Lef A be a Noetherian commutative ring f and let B = 
A[x ì9 ... , x m ] be a commutative ring finitely generated over A. Then B is 
Noetherian. 

Proof. Use Theorem 4.1 and the preceding remark, representing B as a 
factor ring of a polynomial ring. 

Ideals in polynomial rings will be studied more deeply in Chapter IX. 
The theory of Noetherian rings and modules will be developed in Chapter X. 


§5. PARTIAL FRACTIONS 

In this section, we analyze the quotient field of a principal ring, using the 
factoriality of the ring. 

Theorem 5.1. Let A be a principal entire ring , and let P be a set of 
representatives for its irreducible elements. Let K be the quotient field of 
A 9 and let oc e K. For each peP there exists an element oc p e A and an 
integer j(p) ^ 0, such that j(p) = 0 for almost all p e P 9 oc p and p jip) are 
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relatively prime , and 


V a p 


If we have another such expression 


— V 

a " L n i(p)’ 
peP P 

then j(p) = i(p) for all p, and ot p = p p mod p jip) for all p. 

Proof We fìrst prove existence, in a special case. Let a , b be rela- 
tively prime non-zero elements of A. Then there exists x, y e A such that 
xa -h yb = 1. Hence 

1 x y 
ab b a 


Hence any fraction c/ab with ce A can be decomposed into a sum of two 
fractions (namely cx/b and cy/a) whose denominators divide b and a respec- 
tively. By induction, it now follows that any a e K has an expression as 
stated in the theorem, except possibly for the fact that p may divide ol p . 
Canceling the greatest common divisor yields an expression satisfying all the 
desired conditions. 

As for uniqueness, suppose that a has two expressions as stated in the 
theorem. Let q be a fixed prime in P. Then 

^q Pq _ y fip ^p 

n j(q) n i(Q) ~~ n i(p) n j(p )' 

H H pïq P P 

If j(q) = i(q) = 0, our conditions concerning q are satisfied. Suppose one of 
j(q) or i(q) > 0, say j(q\ and say j(q) ^ i(q). Let d be a least common multiple 
for all powers p j(p) and p l{p) such that p # q. Multiply the above equation by 
dq m . We get 

d(oi q - q j(q) ~ i{q) p q ) = q j(q) p 


for some e A. Furthermore, q does not divide d. If i(q) < j(q) then q 
divides 0 L q , which is impossible. Hence i(q) = j(q). We now see that q j{q) 
divides 0L q — P q9 thereby proving the theorem. 

We apply Theorem 5.1 to the polynomial ring k[X~\ over a field k. We 
let P be the set of irreducible polynomials, normalized so as to have leading 
coefficient equal to 1. Then P is a set of representatives for all the irreduc- 
ible elements of /c[X]. In the expression given for a in Theorem 5.1, we can 
now divide ol p by p j{p) , i.e. use the Euclidean algorithm, if deg ol p ^ deg p j{p) . 
We denote the quotient field of /c[X] by k(X) 9 and call its elements rational 
functions. 
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Theorem 5.2. Let A = k[X~\ be the polynomial ring in one variable over a 
field k. Let P be the set of irreducible polynomials in k\_X\ with leading 
coejficient 1. Then any element f of k(X) has a unique expression 

/W = p Ç P p§^ + 3W ’ 

where f p , g are polynomials , f p = 0 if j(p) = 0, f p is relatively prime to p if 
j(p) > 0, and degf p < deg p jip) if j(p) > 0. 

Proof The existence follows at once from our previous remarks. The 
uniqueness follows from the fact that if we have two expressions, with 
elements f p and <p p respectively, and polynomials g , h, then p jip) divides 
f P ~ whence f p - % = 0, and therefore f p = q> p , g = h. 

One can further decompose the term f p /p Ap) by expanding f p according to 
powers of p. One can in fact do something more general. 

Theorem 5.3. Let k be a field and k\_X\ the polynomial ring in one 
variable. Let f g e k[X\, and assume deg g 1. Then there exist unique 
polynomials 

/oJi. 

such that deg f < deg g and such that 

f = fo + /ì G + “ * + fdQ d • 

Proof. We first prove existence. If deg g > deg f then we take / 0 = / 
and f = 0 for i > 0. Suppose deg g ^ deg f. We can find polynomials q , r 
with deg r < deg g such that 

f = qg + r, 

and since deg g ^ 1 we have deg q < deg /. Inductively, there exist polyno- 
mials h 0 , h í9 h s such that 

q = h 0 + h t g + ■■■ + h s g s , 

and hence 

f = r + h 0 g + ■■■ + h s g s+1 , 

thereby proving existence. 

As for uniqueness, let 

f - fo + fiQ + *** + fd9 d - <Po + <Pi9 “í-+ <PmQ m 


be two expressions satisfying the conditions of the theorem. Adding terms 
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equal to 0 to either side, we may assume that m = d. Subtracting, we get 
0 = (/o “ <Po) + “ * + (fd — <Pd)Q d - 

Hence g divides f 0 - cp 0 , and since deg(/ 0 - (p 0 ) < deg g we see that / 0 = cp 0 . 
Inductively, take the smallest integer i such that / # <p t (if such i exists). 
Dividing the above expression by g l we find that g divides / — (p t and hence 
that such i cannot exist. This proves uniqueness. 

We shall call the expression for / in terms of g in Theorem 5.3 the g- adic 
expansion of /. If g(X) = X 9 then the g-adic expansion is the usual expres- 
sion of / as a polynomial. 

Remark. In some sense, Theorem 5.2 redoes what was done in Theorem 
8.1 of Chapter I for Q/Z; that is, express explicitly an element of K/A as a 
direct sum of its p-components. 


§6. SYMMETRIC POLYNOMIALS 

Let A be a commutative ring and let t ì9 ... 9 t n be algebraically indepen- 
dent elements over A. Let X be a variable over A[t í9 ... 9 t n ]. We form the 
polynomial 

F(X) = (X-t í )---(X-t n ) 

= X*- Sl X*- 1 +••• + (- l) n s n9 

where each = Si(t ì9 ..., t n ) is a polynomial in t í9 ..., t n . Then for instance 
s i = h + ‘'' + t n and s n = t^ • • • t n . 

The polynomials s í9 ..., s n are called the elementary symmetric polynomials 

°f 11 J • • • 5 t w . 

We leave it as an easy exercise to verify that s t is homogeneous of degree i 
in t ì9 ..., t n . 

Let (j be a permutation of the integers (1, ... 9 n). Given a polynomial 
f(t) e A[t] = A[t Ì9 ..., t n ] 9 we define cr/to be 

^/(t 1» • • • t w ) — /( ^cr( 1 >* • • • » to■(/!>)• 

If cr, r are two permutations, then crr/ = cr(r/) and hence the symmetric group 
G on n letters operates on the polynomial ring A[t]. A polynomial is called 
symmetric if cr/ = / for allcreG.lt is clear that the set of symmetric 
polynomials is a subring of A[t] 9 which contains the constant polynomials 
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(i.e. A itself) and also contains the elementary symmetric polynomials s 1? ...,s n . 
We shall see below that ^[s^,..., s n ] is the ring of symmetric polynomials. 
Let X ì9 ... 9 X m be variables. We define the weight of a monomial 

x; 1 - jr;- 

to be Vj + 2 v 2 + ••• + nv n . We define the weight of a polynomial 
g(X i9 ..., X n ) to be the maximum of the weights of the monomials occurring 
in g. 

Theorem 6.1. Let f(t) e A[t í9 ..., íj be symmetric of degree d. Then 
there exists a polynomial g(X ì9 ..., X n ) of weight ^ d such that 

f(t) = g(s í9 ... 9 s n ). 

Proof By induction on n. The theorem is obvious if n = 1, because 

Si = íj. 

Assume the theorem proved for polynomials in n — 1 variables. 

If we substitute t n = 0 in the expression for F(X), we find 

(X- tì )--(X- t n - x )X = X n - ( Sí ) oX"- 1 +••• + (- irHsn-i)oX, 

where (s^q is the expression obtained by substituting t n = 0 in s f . We see 
that (sJq, ..., (Sn-Jo are precisely the elementary symmetric polynomials in 

1 1, • • • > t n ~i . 

We now carry out induction on d. If d = 0, our assertion is trivial. 
Assume d > 0, and assume our assertion proved for polynomials of degree 
< d. Let /(íj,..., íj have degree d. There exists a polynomial 
gi(X ì9 ..., X n _ Y ) of weight ^ d such that 

f(t 1 > • • • > t n - 1 , 0) = ^((Sj )o> • • • > ( S n- 1 )o)* 

We note that gfs l9 ..., s n _ x ) has degree ^ d in t l9 ..., t n . The polynomial 
fi(ti,"-,t n ) = f(t ì9 ... 9 t n )-g ì (s í ,... 9 s n - ì ) 
has degree ^ d (in t í9 ..., t n ) and is symmetric. We have 

fi(tii • • •, t n —i , 0 ) = 0 . 

Hence f { is divisible by t n9 i.e. contains t n as a factor. Since / x is symmetric, 
it contains tí • • • t n as a factor. Hence 

/l = 5„/ 2 (í,, t„) 

for some polynomial / 2 , which must be symmetric, and whose degree is 
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^d — n <d. By induction, there exists a polynomial g 2 in n variables and 
weight d — n such that 

f 2 (tu--,tn) = 92(s 1 ,...,s n ). 

We obtain 

f(t) = G\ ( S 1» S n-1 ) "F S h£? 2( S 1» *•'» S n)> 

and each term on the right has weight ^ d. This proves our theorem. 

We shall now prove that the elementary symmetric polynomials s ì9 ... 9 s n 
are algebraically independent over A. 

If they are not, take a polynomial f(X l9 ..., X n ) e A[X~\ of least degree 
and not equal to 0 such that 

f(s 1 »•••> s n) = 0. 

Write / as a polynomial in X n with coefficients in A[X U ..., X n - x \ 

f(x 1 ,..., X n ) = ux, ,...,x n _ í ) + -- + fAXu--->x n - i )xt- 

Then / 0 # 0. Otherwise, we can write 

f(X) = xmx) 


with some polynomial \j/ 9 and hence s n \l/(s l9 ..., s n ) = 0. From this it follows 
that ^(s l5 ..., s„) = 0, and ip has degree smaller than the degree of /. 

We substitute s f for X { in the above relation, and get 

0 = /o(s,, ..., S„_!) + • • • + f d (Si , ■ • •, s„_, )sf 

This is a relation in A[t ì9 ... 9 t n ~\, and we substitute 0 for t n in this relation. 
Then all terms become 0 except the first one, which gives 

0 = /o(( s i)o> ••• > ( s n-i)o)> 

using the same notation as in the proof of Theorem 6.1. This is a non-trivial 
relation between the elementary symmetric polynomials in t í9 ... 9 t n _ ì9 a 
contradiction. 

Example. (The Discriminant). Let f(X) = (X — t t ) ••• (X — t n ). Con- 
sider the product 

s(t )=n (ti - tj)- 

i<j 

For any permutation cr of (1,..., n) we see at once that 


s°(t)= ±ô(t). 
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Hence ô(t) 2 is symmetric, and we call it the discriminant: 

D f = D(s 1 ,...,s„) = Y\(t i - tj) 2 . 

i<j 

We thus view the discriminant as a polynomial in the elementary symmetric 
functions. For a continuation of the general theory, see §8. We shall now 
consider special cases. 

Quadratic case. You should verify that for a quadratic polynomial 
f(X) = X 2 + bX + c, one has 

D = b 2 - 4c. 

Cubic case. Consider f(X) = X 3 + aX + b. We wish to prove that 

D = —4a 3 — 21b 2 . 

Observe first that D is homogeneous of degree 6 in t í9 t 2 . Furthermore, a is 
homogeneous of degree 2 and b is homogeneous of degree 3. By Theorem 
6.1 we know that there exists some polynomial g(X 2 ,X 3 ) of weight 6 such 
that D = g(a , b). The only monomials X^X” of weight 6, i.e. such that 
2m -f 3n = 6 with integers m, n ^ 0, are those for which m = 3, n = 0, or 
m = 0 and n = 2. Hence 

g(X 2 , x 3 ) = vXl + wY 3 2 

where u, w are integers which must now be determined. 

Observe that the integers v, w are universal, in the sense that for any 
special polynomial with special values of a , b its discriminant will be given 
by g(a, b) = va 3 + wò 2 . 

Consider the polynomial 

h(X) = X(X - Ì)(X + 1) = X 3 — X. 

Then a = — 1, b = 0, and D = —va 3 = —v. But also D = 4 by using the 
definition of the discriminant of the product of the differences of the roots, 
squared. Hence we get v = — 4. Next consider the polynomial 


f 2 (X) = X 3 -Ì. 


Then a = 0, b = —1, and D = 2b 2 = w. But the three roots of / 2 are the 
cube roots of unity, namely 

, -l+v+3 -1-V=3 

’ 2 ’ 2 

Using the definition of the discriminant we find the value D = —27. Hence 
we get w = — 27. This concludes the proof of the formula for the dis- 
criminant of the cubic when there is no X 2 term. 
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In general, consider a cubic polynomial 

f(X) = X 3 - Sl X 2 + s 2 X - s 3 = (X - t 2 )(X - t 2 )(X - t 3 ). 

We find the value of the discriminant by reducing this case to the simpler 
case when there is no X 2 term. We make a translation, and let 

y = x-i Sl so x= y + i Sl = y + i(t! + t 2 + t 3 )- 

Then f(X) becomes 

f(X) = f*(Y) = Y 3 + aY + b = (Y - u,)(Y - u 2 )(Y - u 3 ), 

where a = u^u 2 + u 2 u 3 + u^u^ and b = — u^u^u^, while u t + u 2 + u 3 = 0. 
We have 

Ui = ti - ^ for i = 1, 2, 3, 

and u t — Uj = — tj for all i # j, so the discriminant is unchanged, and you 

can easily get the formula in general. Do Exercise 12(b). 


§7. MASON-STOTHERS THEOREM AND THE 
abc CONJECTURE 

In the early 80s a new trend of thought about polynomials started with the 
discovery of an entirely new relation. Let f(t) be a polynomial in one variable 
over the complex numbers if you wish (an algebraically closed field of charac- 
teristic 0 would do). We define 

n 0 (f) = number of distinct roots of /. 

Thus n 0 (f) counts the zeros of / by giving each of them multiplicity 1, and 
n 0 (f) can be small even though deg / is large. 

Theorem 7.1 (Mason-Stothers, [Mas 84], [Sto 81]). Let a(t ), b(t), c(t) be 
relatively prime polynomials such that a + b = c. Then 

maxdeg{a, b, c) ^ no(abc) - 1. 

Proof (Mason) Dividing by c, and letting / = a/c, g = b/c we have 

/ + g = 1, 

where /, g are rational functions. Differentiating we get f' + g' = 0, which 
we rewrite as 
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SO that 

b . = l = -£l 

a f g'/g' 

Let 

û(t)=ci n ( f - fc (o = c 2 n ( f - pjT j > c w= c 3 n (^ - ^) rfc - 

Then by calculus algebraicized in Exercise 11 (c), we get 

y y r k 

b = /'//_ ^t-a, L t-y k 

a g /g y’ Hj _ y 

L t ~ Pj L t-y k 

A common denominator for /'// and g'/g is given by the product 

N 0 -n(í-««)n(í-A)n(t-^ 


whose degree is n 0 (abc). Observe that N 0 f'/f and N 0 g'/g are both polyno- 
mials of degrees at most n 0 (abc) — 1. From the relation 

b _ N 0 f'/f 
a N 0 g'/g ’ 

and the fact that a , b are assumed relatively prime, we deduce the inequality 
in the theorem. 

As an application, let us prove Fermat’s theorem for polynomials. Thus 
let x(í), y(t\ z(t) be relatively prime polynomials such that one of them has 
degree ^ 1, and such that 

x(t) n + y(t) n = z(t) n . 

We want to prove that n ^ 2. By the Mason-Stothers theorem, we get 

n deg x = deg x(í)" ^ deg x(í) -f deg y(t) -f deg z(t) — 1, 

and similarly replacing x by y and z on the left-hand side. Adding, we find 

n(deg x -f deg y -f deg z) ^ 3(deg x 4- deg y -f deg z) — 3. 

This yields a contradiction if n ^ 3. 

As another application in the same vein, one has: 

Davenport’s theorem. Let f g be non-constant polynomials such that 
/ 3 — g 2 # 0. Then 

deg(/ 3 - g 2 ) ^ jdegf- 1. 

See Exercise 13. 
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One of the most fruitful analogies in mathematics is that between the 
integers Z and the ring of polynomials F[í] over a field F. Evolving from 
the insights of Mason [Ma 84], Frey [Fr 87], Szpiro, and others, Masser and 
Oesterle formulated the abc conjecture for integers as follows. Let m be a 
non-zero integer. Define the radical of m to be 

N 0 (m) = [I P’ 

p\m 

i.e. the product of all the primes dividing m, taken with multiplicity 1. 


The abc conjecture. Given e > 0, there exists a positive number C(e) having 
the follomng property. For any non-zero relative prime integers a , ò, c 
such that a + b = c, we have 

max(|a|, \b\, |c|) g C(s)N 0 (abc) 1+e . 

Observe that the inequality says that many prime factors of a, b , c occur to 
the first power, and that if “small” primes occur to high powers, then they 
have to be compensated by “large” primes occurring to the first power. For 
instance, one might consider the equation 

2 " ± 1 =m. 

For m large, the abc conjecture would state that m has to be divisible by 
large primes to the first power. This phenomenon can be seen in the tables 
of [BLSTW 83]. 

Stewart-Tijdeman [ST 86] have shown that it is necessary to have the e in 
the formulation of the conjecture. Subsequent examples were communicated to 
me by Wojtek Jastrzebowski and Dan Spielman as follows. 

We have to give examples such that for all C > 0 there exist natural 
numbers a , ò, c relatively prime such that a + b = c and \a\> CN 0 (abc). But 
trivially, 

2"|(3 2 " - 1). 

We consider the relations a n + b n = c n given by 

3 2 " — 1 = c„. 

It is clear that these relations provide the desired examples. Other examples 
can be constructed similarly, since the role of 3 and 2 can be played by other 
integers. Replace 2 by some prime, and 3 by an integer = 1 mod p. 

The abc conjecture implies what we shall call the 

Asymptotic Fermat Theorem. For all n sufficiently large , the equation 

x n + y n = z n 

has no solution in relatively prime integers # 0. 



IV, §7 


MASON'S THEOREM AND THE abc CONJECTURE 197 


The proof follows exactly the same pattem as for polynomials, except 
that we write things down multiplicatively, and there is a 1 + e floating 
around. The extent to which the abc conjecture will be proved with an 
explicit constant C(e) (or say C(l) to fix ideas) yields the corresponding 
explicit determination of the bound for n in the application. We now go into 
other applications. 

Hall’s conjecture [Ha 71]. If u, v are relatively prime non-zero integers 
such that u 3 — v 2 # 0, then 

| u 3 — v 2 \ » |u| 1/2_c . 

The symbol » means that the left-hand side is ^ the right-hand side times a 
constant depending only on e. Again the proof is immediate from the abc 
conjecture. Actually, the hypothesis that u, v are relatively prime is not 
necessary; the general case can be reduced to the relatively prime case by 
extracting common factors, and Hall stated his conjecture in this more 
general way. However, he also stated it without the epsilon in the exponent, 
and that does not work, as was realized later. As in the polynomial case, 
Hall’s conjecture describes how small \u 3 — v 2 \ can be, and the answer is not 
too small, as described by the right-hand side. 

The Hall conjecture can also be interpreted as giving a bound for integral 
relatively prime solutions of 

v 2 = u 3 -f b with integral b. 

Then we find 

\u\ « \b\ 2+e . 

More generally, in line with conjectured inequalities from Lang-Waldschmidt 
[La 78], let us fix non-zero integers A, B and let w, v, fc, m, n be variable, 
with u, v relatively prime and mv > m + n. Put 

Au m -f Bv n = k. 

By the abc conjecture, one derives easily that 

(1) |«| « N 0 (k) mn -™ m+n)( 1 +e) and |u| « N 0 (k) mn " m+n)( 1 +e) . 

From this one gets 

|*| « N 0 (k) mn - )m+n)i ' +c) . 

The Hall conjecture is a special case after we replace N 0 (k) with |fc|, because 
N 0 (k)£\k\. 

Next take m = 3 and n = 2, but take /1 = 4 and B = — 27. In this case 
we write 


D = 4u 3 — 21v 2 
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and we get 

(2) |m| « N 0 (D) 2+e and \v\«N 0 (D) 3+e . 

These inequalities are supposed to hold at first for u, v relatively prime. 
Suppose we allow u, v to have some bounded common factor, say d. Write 

u = u'd and v = v'd 

with u', v ' relatively prime. Then 

D = 4 d 3 u' 3 - lld 2 v' 2 . 

Now we can apply inequality (1) with A = 4 d 3 and B = —27d 2 , and we find 
the same inequalities (2), with the constant implicit in the sign « depending 
also on d , or on some fixed bound for such a common factor. Under these 
circumstances, we call inequalities (2) the generalized Szpiro conjecture. 

The original Szpiro conjecture was stated in a more sophisticated situa- 
tion, cf. [La 90] for an exposition, and Szpiro’s inequality was stated in the 
form 

\D\«N(D) 6+ \ 

where N(D) is a more subtle invariant, but for our purposes, it is sufficient 
and much easier to use the radical N 0 (D). 

The point of D is that it occurs as a discriminant. The trend of thoughts 
in the direction we are discussing was started by Frey [Fr 87], who asso- 
ciated with each solution of a + b = c the polynomial 

x(x — u)(x -f b), 

which we call the Frey polynomial. (Actually Frey associated the curve 
defined by the equation y 2 = x(x — a)(x -f b), for much deeper reasons, but 
only the polynomial on the right-hand side will be needed here.) The 
discriminant of the polynomial is the product of the differences of the roots 
squared, and so 

D = (abc) 2 . 

We make a translation 

„ b — a 

to get rid of the x 2 -term, so that our polynomial can be rewritten 

^ 3 -72Í-?3, 

where y 29 y 3 are homogeneous in a, b of appropriate weight. The dis- 
criminant does not change because the roots of the polynomial in £ are 
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translations of the roots of the polynomial in x. Then 

D = 4yi - 21yl 

The translation with ( b — a)/ 3 introduces a small denominator. One may 
avoid this denominator by using the polynomial x(x — 3 a)(x — 3ò), so that 
y 2 , y 3 then come out to be integers, and one can apply the generalized Szpiro 
conjecture to the discriminant, which then has an extra factor D = 3 6 (abc) 2 . 

It is immediately seen that the generalized Szpiro conjecture implies 
asymptotic Fermat. Conversely: 

Generalized Szpiro implies the abc conjecture. 

Indeed, the correspondence (a, b)<-+(y 2 , y 3 ) is invertible, and has the “right” 
weight. A simple algebraic manipulation shows that the generalized Szpiro 
estimates on y 2 , y 3 imply the desired estimates on \a\, \b\. (Do Exercise 14.) 
From the equivalence between abc and generalized Szpiro, one can use the 
examples given earlier to show that the epsilon is needed in the Szpiro 
conjecture. 

Finally, note that the polynomial case of the Mason-Stothers theorem and 
the case of integers are not independent, or specifically the Davenport theorem 
and Hall’s conjecture are related. Examples in the polynomial case parametrize 
cases with integers when we substitute integers for the variables. Such examples 
are given in [BCHS 65], one of them (due to Birch) being 

f(t) = t 6 + 4í 4 + 10í 2 -f 6 and g(t) = t 9 -f 6 1 1 + 21í 5 -f 35í 3 + *£t, 

whence 

deg(/(í) 3 - d(t) 2 ) = j deg/ + 1. 

This example shows that Davenport’s inequality is best possible, because the 
degree attains the lowest possible value permissible under the theorem. 
Substituting large integral values of t = 2 mod 4 gives examples of similarly 
low values for x 3 — y 2 . For other connections of all these matters, cf. [La 90]. 
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§8. THE RESULTANT 

In this section, we assume that the reader is familiar with determinants. 
The theory of determinants will be covered later. The section can be viewed 
as giving further examples of symmetric functions. 

Let A be a commutative ring and let u 0 , w 0 , ...,w m be alge- 

braically independent over A. We form two polynomials: 

f v (X) = v 0 X n + 

g w (X) = w 0 X m + ••• + w m . 

We define the resultant of (v, vv), or of /„, g w , to be the determinant 



m + n 


The blank spaces are supposed to be filled with zeros. 
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If we substitute elements ( a ) = ( a 0 , and ( b ) = (b 0 , . b m ) in A for 

(v) and (vv) respectively in the coeflScients of f v and g w , then we obtain 
polynomials f a and g b with coeíficients in A, and we define their resultant to 
be the determinant obtained by substituting (a) for ( v ) and ( b ) for (vv) in the 
determinant. We shall write the resultant of f v , g w in the form 

Res(/„, gj or R(v, vv). 

The resultant Res(/„, g b ) is then obtained by substitution of (û), (b) for (u), (vv) 
respectively. 

We observe that R(v, vv) is a polynomial with integer coeflScients, i.e. we 
may take A = Z. If z is a variable, then 

R(zv, vv) = z m R(v, vv) and R(v, zvv) = z n R(v, vv) 

as one sees immediately by factoring out z from the first m rows (resp. the 
last n rows) in the determinant. Thus R is homogeneous of degree m in its 
first set of variables, and homogeneous of degree n in its second set of 
variables. Furthermore, R(v, vv) contains the monomial 

VoK 

with coeflScient 1, when expressed as a sum of monomials. 

If we substitute 0 for v 0 and vv 0 in the resultant, we obtain 0, because the 
first column of the determinant vanishes. 

Let us work over the integers Z. We consider the linear equations 


X m ~ l f v (X) = v 0 X n+m ' 

x m ~ 2 L(X) = 

1 + v i X n+m - 2 + ••• 
v 0 X n+m ~ 2 + ••• 

+ v^X" 1 - 1 
+ v n X m ~ 2 


L(X) = 

o 

3 

+ ••• 

+ v„ 

X'-'gJX) = w 0 X n+m - 

_1 + w, X n+m - 2 + ••• 

+ vv^"- 1 


X n ~ 2 gJX) = 

w 0 X n+m ~ 2 + ••• 

+ w m X"- 2 


gjx) = 

w 0 X m 

+ ... 

+ w m 


Let C be the column vector on the left-hand side, and let 

C 0 , • • • > C m +„ 

be the column vectors of coeflScients. Our equations can be written 
C = X n+m ~ l C 0 + • • • + 1 • C m+n . 

By Cramer’s rule, applied to the last coeflScient which is = 1, 

R(v, vv) = det(C 0 ,..., C m + n ) = det(C 0 ,..., C m + n _j, C). 
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From this we see that there exist polynomials <p v w and ij/ VtW in Z[v 9 w][A"] 
such that 


<P,v,wfv + ^V t w9w = R(v y w) = Res (f V9 f w ). 

Note that R(v , vv) e Z[y, w] but that the polynomials on the left-hand side 
involve the variable X. 

If X: Z[v, vv] -► A is a homomorphism into a commutative ring A and we 
let À(v) = ( a ), A(w) = (b), then 

+ 4>a,b9b = K(«, í>) = ReS (f a , f b ). 

Thus from the universal relation of the resultant over Z we obtain a similar 
relation for every pair of polynomials, in any commutative ring A. 

Proposition 8.1. Let K be a subfield of a field L, and let / fl , g b be 
polynomials in K[X~\ having a common root Ç in L. Then R(a , b) = 0. 

Proof. If / fl (£) = g b (Ç) = 0, then we substitute £ for X in the expression 
obtained for R(a , b) and find R(a , fc) = 0. 

Next, we shall investigate the relationship between the resultant and the 
roots of our polynomials f v9 g w . We need a lemma. 

Lemma 8.2. Let h(X í9 ..., X n ) be a polynomial in n variables over the 
integers Z. If h has the value 0 when we substitute X x for X 2 and leave 
the other X t fixed (i ^ 2), then h(X ì9 ..., X n ) is divisible by X Y — X 2 in 
Z[X u ... 9 X n l 

Proofi Exercise for the reader. 

Let v 0 , t í9 ..., t n9 w 0 , u i9 ..., u m be algebraically independent over Z and 
form the polynomials 

L = v 0 (X - tl ) - (X - t n ) = v 0 X n + ••• + !;., 

= w 0 (X - Ui) ■ ■ ■ (X - u m ) = w 0 X m + • • • + w m . 

Thus we let 

Vi = (~ 1 )%*&) and wj = (- l/^os/u). 

We leave to the reader the easy verification that 

V 0 , V l9 ..., !?„, W Q , Wj, ..., w m 
are algebraically independent over Z. 

Proposition 8.3. Notation being as above 9 we have 

Res (f v9 g w ) = v%wZ fj ft (U ~ u j\ 

ì=i j=i 
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Proof. Let S be the expression on the right-hand side of the equality in 
the statement of the proposition. 

Since R(v, w) is homogeneous of degree m in its first variables, and 
homogeneous of degree n in its second variables, it follows that 

R = i?5f w£h(t, u) 

where h(t, u) e Z[í, u]. By Proposition 8.1, the resultant vanishes when we 
substitute ti for Uj (i = 1 ,..., n and j = 1 ,..., m), whence by the lemma, view- 
ing R as an element of Z[i? 0 , w 0 , í, u] it follows that R is divisible by - Uj 
for each pair (ij). Hence S divides R in Z[i; 0 , w 0 , í, u], because t^-Uj is 
obviously a prime in that ring, and different pairs (ij) give rise to different 
primes. 

From the product expression for S, namely 

(1) ^=« n ri (h - uj ), 

i=i j =i 

we obtain 

n n m 

n s(td =< n n ( f i - «a 

i =1 i — 1 j =1 

whence 

(2) S = 9(tf). 

i= 1 

Similarly, 

(3) S = (-l) wm w5 f\f(u } ). 

j =i 

From (2) we see that S is homogeneous and of degree n in (w), and from (3) 
we see that S is homogeneous and of degree m in (v). Since R has exactly the 
same homogeneity properties, and is divisible by S, it follows that R = cS for 
some integer c. Since both R and S have a monomial t; 0 w£ occurring in 
them with coefficient 1, it follows that c = 1, and our proposition is proved. 

We also note that the three expressions found for S above now give us a 
factorization of R. We also get a converse for Proposition 8.1. 

Corollary 8.4. Let f a9 g b be polynomials mth coefficients in a fìeld K , such 
that a 0 b 0 # 0, and such that f a9 g b split in factors of degree 1 in K[2Q. 
Then Res(/ a , g b ) = 0 if and only if f a and g b have a root in common. 

Proof. Assume that the resultant is 0. If 

f a = a 0 (X-*J-JX-0L n \ 

g b = b 0 (X -M-iX -p n \ 

is the factorization of f a , g b , then we have a homomorphism 
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Z[r 0 , t, w 0 , u] -+ K 

such that v 0 \->a 0 , w 0 \-*b 0 , i— ► a^, and u ; - for all i,j. Then 
0 = Res(/„, g b ) = a^b " 0 fl T\ («< ~ ft), 

whence / a , f b have a root in common. The converse has already been 
proved. 

We deduce one more relation for the resultant in a special case. Let f v be 
as above, 

f v (X) = v 0 X n + ••• + = v 0 (X - t x ) ••• (X - t n ). 


From (2) we know that if f v is the derivative of f v , then 
(4) Res (/„/;) = 0lm 

i 


Using the product rule for differentiation, we find: 


fv(X) = X v 0 (X - tl )- - (X-t í )-(X- t n ), 

i 

fM = v 0 (t t - ti) •• • (f* - 0 • • • (fi - t n ). 


where a roof over a term means that this term is to be omitted. 
We define the discriminant of f v to be 

D(f v ) = D(V) = (- 1 r (n-m v 2n- 2 {ti _ tj y 

i*j 


Proposition 8.5. Let f v be as above and have algebraically independent 

coefficients over Z. Then 

(5) Res(/„, /„') = v 0 "~ l n (U - tj) = (-1 T (n - m v 0 D(f v ). 

i*j 

Proof One substitutes the expression obtained for f v (t t ) into the prod- 
uct (4). The result follows at once. 

When we substitute 1 for u 0 , we find that the discriminant as we defined 
it in the preceding section coincides with the present definition. In particular, 
we find an explicit formula for the discriminant. The formulas in the special 
case of polynomials of degree 2 and 3 will be given as exercises. 

Note that the discriminant can also be written as the product 

D(f v ) = vi"- 2 n (t,- - tj) 2 . 

i<j 

Serre once pointed out to me that the sign ( —l)' ,(n_1) / 2 was missing in the 
first edition of this book, and that this sign error is quite common in the 
literature, occurring as it does in van der Waerden, Samuel, and Hilbert (but 
not in his collected works, corrected by Olga Taussky); on the other hand 
the sign is correctly given in Weber’s Algebra , Vol. I, 50. 

For a continuation of this section, see Chapter IX, §3 and §4. 
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§9. POVVER SERIES 

Let X be a letter, and let G be the monoid of functions from the set {X} 
to the natural numbers. If v e N, we denote by X v the function whose value 
at X is v. Then G is a multiplicative monoid, already encountered when we 

discussed polynomials. Its elements are X° f X l 9 X 2 ,..., X v ,_ 

Let A be a commutative ring, and let ^[[A"]] be the set of functions 
from G into A, without any restriction. Then an element of ^[[^]] may be 
viewed as assigning to each monomial X v a coefficient a v eA. We denote 
this element by 

00 

Z 

v=0 

The summation symbol is not a sum, of course, but we shall write the above 
expression also in the form 

a 0 X° ^a^X 1 + ••• 

and we call it a formal power series with coefficients in A, in one variable. 
We call a 0f a l9 ... its coefficients. 

Given two elements of ^[[A"]], say 

2 tf v * v and f b M X* 9 

v = 0 n =0 

we define their product to be 

00 

Z CtX‘ 

i-0 

where 

Ci Z ciyb^. 

v+H=i 

Just as with polynomials, one defines their sum to be 

i («v + K)X\ 

v = 0 

Then we see that the power series form a ring, the proof being the same as 
for polynomials. 

One can also construct the power series ring in several variables 
A[[X ìf ..., X n ~]\ in which every element can be expressed in the form 

X a {v) Xl' • • • = X a {v) M {v) (X íf ... f X n ) 

(V) 

with unrestricted coefficients a {v) in bijection with the n-tuples of integers 
(vj,..., v„) such that v f ^ 0 for all i. It is then easy to show that there is an 
isomorphism between A[\X lf ..., X n ]\ and the repeated power series ring 
^[[A"i]] **• [[A" n ]]. We leave this as an exercise for the reader. 
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The next theorem will give an analogue of the Euclidean algorithm for 
power series. However, instead of dealing with power series over a field, it is 
important to have somewhat more general coeíficients for certain applica- 
tions, so we have to introduce a little more terminology. 

Let A be a ring and / an ideal. We assume that 

n i '={«}• 

V = 1 

We can view the powers / v as defining neighborhoods of 0 in A , and we can 
transpose the usual definition of Cauchy sequence in analysis to this situation, 
namely: we define a sequence {a n } in A to be Cauchy if given some power / v 
there exists an integer N such that for all m.n'^.N we have 

«m - tf„e/ v . 

Thus I v corresponds to the given e of analysis. Then we have the usual 
notion of convergence of a sequence to an element of A. One says that A is 
complete in the /-adic topology if every Cauchy sequence converges. 

Perhaps the most important example of this situation is when A is a local 
ring and / = m is its maximal ideal. By a complete local ring, one always 
means a local ring which is complete in the m-adic topology. 

Let fc be a field. Then the power series ring 

R = kllX í9 ... 9 X n -fi 

in n variables is such a complete local ring. Indeed, let m be the ideal 
generated by the variables X l9 ..., X n . Then R/ m is naturally isomorphic to 
the field k itself, so m is a maximal ideal. Furthermore, any power series of 
the form 

f(X) = c 0 -MX) 

with c 0 e fc, c 0 t^O and ffX) e m is invertible. To prove this, one may first 
assume without loss of generality that c 0 = 1. Then 

(i - MX))~ l = ì + f x (X) + MX) 2 + /iW 3 + • • • 

gives the inverse. Thus we see that m is the unique maximal ideal and R is 
local. It is immediately verified that R is complete in the sense we have just 
defined. The same argument shows that if fc is not a field but c 0 is invertible 
in fc, then again f(X) is invertible. 

Again let A be a ring. We may view the power series ring in n variables 
(n > 1) as the ring of power series in one variable X n over the ring of power 
series in n — 1 variables, that is we have a natural identification 

If A = fc is a field, the ring k[f_X ì9 ...,X n _ 1 ]] is then a complete local 
ring. More generally, if o is a complete local ring, then the power series ring 
°[[^]] is a complete local ring, whose maximal ideal is (m, X) where m is 
the maximal ideal of o. Indeed, if a power series £fl v X v has unit constant 
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term a 0 e o*, then the povver series is a unit in o[[X]], because first, without 
loss of generality, we may assume that a 0 = 1, and then we may invert 1 + h 
with h e (m, X) by the geometric series 1 — h + h 2 — h 2 + • * *. 

In a number of problems, it is useful to reduce certain questions about 
power series in several variables over a field to questions about power series 
in one variable over the more complicated ring as above. We shall now 
apply this decomposition to the Euclidean algorithm for power series. 


Theorem 9.1. Let o be a complete local ring mth maximal ideal m. Let 

f(X) = £ ai x‘ 

i= 0 

be a power series in o[[A"]] ( one variable ), such that not all a { lie in m. 
Say a 0 , ..., a n e m, and a n e o* is a unit. Given g e o[[X]] we can solve 
the equation 

g = gf + r 

uniquely with q e o[[A"]], r e o[A"], and deg r ^ n — 1. 

Proof (Manin). Let a and t be the projections on the beginning and 
tail end of the series, given by 

a: X biX 1 ^"X biX' = b 0 + b x X + • + b n . x X n ~\ 

i= 0 

t: Y, biX l \-> £ biX l ~ n = b n + b n+í X + b n + 2 X 2 + *•*. 

i=n 

Note that r(hX") = h for any h e o[[A r ]]; and h is a polynomial of degree 
< n if and only if r(h) = 0. 

The existence of q, r is equivalent with the condition that there exists q 
such that 


*(g) = 

Hence our problem is equivalent with solving 


r(g) = r(qa(f)) + r(qr(f)X") = r(qa(f)) + qr(f). 

Note that r (/) is invertible. Put Z = qr(f). Then the above equation is 
equivalent with 




Note that 


nJ) 

because a(f)/r(f ) e mo[[^f]]. We can therefore invert to find Z, namely 
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which proves both existence and uniqueness and concludes the proof. 


Theorem 9.2. (Weierstrass Preparation). The power series f in the pre- 

vious theorem can be written uniquely in the form 

where e m, and u is a unit in °[ml 
Proof Write uniquely 

X n = qf + r, 

by the Euclidean algorithm. Then q is invertible, because 


q = c 0 + c Y x + •••, 


so that 


/ = ■" + a n X n + •••, 
1 = c 0 a n (mod m), 


and therefore c 0 is a unit in o. We obtain qf = X n — r, and 

f = q'\X n -r\ 


with r = 0 (mod m). This proves the existence. Uniqueness is immediate. 

The integer n in Theorems 9.1 and 9.2 is called the Weierstrass degree of /, 
and is denoted by deg^ /. We see that a power series not all of whose coeffi- 
cients lie in m can be expressed as a product of a polynomial having the given 
Weierstrass degree, times a unit in the power series ring. Furthermore, all 
the coefficients of the polynomial except the leading one lie in the maximal 
ideal. Such a polynomial is called distinguished, or a Weierstrass polynomial. 


Remark. I rather like the use of the Euclidean algorithm in the proof of 
the Weierstrass Preparation theorem. However, one can also give a direct 
proof exhibiting explicitly the recursion relations which solve for the coeffi- 
cients of u, as follows. Write u = £ c^X 1 . Then we have to solve the 
equations 

b 0 c 0 = u 0 , 


b 0 Ci + b x c 0 = a l9 


bo c n-i + * ’ * + K-i c o — a n -1 > 
bo c n + ’ ’ ’ + c 0 a ni 

^0 C n+l + ' * ’ + Ci = U n + 1? 



IV, §9 


POWER SERIES 209 


In fact, the system of equations has a unique solution mod m r for each 
positive integer r, after selecting c 0 to be a unit, say c 0 = 1. Indeed, from 
the first n equations (from 0 to n — 1) we see that b 0 ,..., are uniquely 
determined to be 0 mod m. Then c„, c n+1 ,... are uniquely determined 
mod m by the subsequent equations. Now inductively, suppose we have 
shown that the coeíficients b i9 Cj are uniquely determined mod m r . Then one 
sees immediately that from the conditions a 0 ,..., a n _ x = 0 mod m the first n 
equations define uniquely mod m r+1 because all b, = 0 mod m. Then 
the subsequent equations define c } mod m r+1 uniquely frorn the values of 
bi mod m r+1 and Cj mod m r . The unique system of solutions mod m r for each 
r then defines a solution in the projective limit, which is the complete local 
ring. 

We now have all the tools to deal with unique factorization in one important 
case. 

Theorem 9.3. Let k be a field. Then k[[X Ì9 ..., X n ]] is factorial. 

Proof. Let f(x) = f(X Ì9 . .. 9 X n ) e /:[[X]] be 0. After making a sufficiently 
general linear change of variables (when k is infinite) 

Xi = 2 CijYj with c^ e k , 

we may assume without loss of generality that/(0,... , 0, x n ) 0. (When k is 
fìnite, one has to make a non-linear change, cf. Theorem 2.1 of Chapter VIII.) 
Indeed, if we write f(X) = f d (X) + higher terms, where f d (X) is a homogeneous 
polynomial of degree d ^ 0, then changing the variables as above preserves the 
degree of each homogeneous component of/, and since k is assumed infìnite, 
the coeffìcients c^- can be taken so that in fact each power Yf (i = 1,..., n) 
occurs with non-zero coefficient. 

We now proceed by induction on n. Let R n = k[[X Ì9 ..., X n ]\ be the power 
series in n variables, and assume by induction that R n _ x is factorial. By Theorem 
9.2, write/= gu where u is a unit and g is a Weierstrass polynomial in R n - x [X n ]. 
By Theorem 2.3, R n -\[X n ] is factorial, and so we can write g as a product of 
irreducible elements g ì9 ..., g r e R n _ x [X n ] 9 so/= g x • • • g r u 9 where the factors 
g ( are uniquely determined up to multiplication by units. This proves the existence 
of a factorization. As to uniqueness, suppose/is expressed as a product of 
irreducible elements in R n , f = f x • • • f s . Then f q (0 9 ..., 0, x n ) =£ 0 for each 
q = 1,.. . , s 9 so we can write f q = h q u q where u q is a unit and h q is a Weierstrass 

polynomial, necessarily irreducible in R n _ x [X n ]. Then / = gu= n h q n u q 
with g and all h q Weierstrass polynomials. By Theorem 9.2, we must have 

g = n h q9 and since R n - X [X n ] is factorial, it follows that the polynomials h q 
are the same as the polynomials g i9 up to units. This proves uniqueness. 

Remark. As was pointed out to me by Dan Anderson, I incorrectly stated 
in a previous printing that if 0 is a factorial complete local ring, then £)[[X]] 
is also factorial. This assertion is false, as shown by the example 
k(t)[[X lt X 2 , X 3 ]]/(X? + X\ + X 3 ) 
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due to P. Salmon, Su un problema post da P. Samuel, Atti Acad. Naz. Lincei 
Rend. Cl. Sc. Fis. Matem. 40(8) (1966) pp. 801-803. It is true that if O is a 
regular local ring in addition to being complete, then £)[[X]] is factorial, but this 
is a deeper theorem. The simple proof I gave for the power series over a field 
is classical. I chose the exposition in [GrH 78]. 

Theorem 9.4. IfA is Noetherian, then A[[X ]] is also Noetherian. 

Proof. Our argument will be a modification of the argument used in the 
proof of Hilbert’s theorem for polynomials. We shall consider elements of 
lowest degree instead of elements of highest degree. 

Let 91 be an ideal of A[[X]]. We let a, be the set of elements a e A such 
that a is the coefficient of X 1 in a power series 

aX' + terms of higher degree 

lying in 91. Then is an ideal of A, and a f c= a i+í (the proof of this assertion 
being the same as for polynomials). The ascending chain of ideals stops: 


a 0 c CLt c a 2 c= • • • c a r = a r+ì = • • • 

As before, let a {j (i = 0, ...,r and j = 1, ...,n f ) be generators for the ideals 
a h and let fj be power series in A having as beginning coefficient. 
Given / g 91, starting with a term of degree d, say d g r, we can find 
elements c v . . . , c nd e A such that 

f ~ C \fd\ ~ ~ C njdn á 

starts with a term of degree ^ d + 1. Proceeding inductively, we may as- 
sume that d > r. We then use a linear combination 

f _ c y>x*-'f rl - ■■■ - d d jx d -% nr 

to get a power series starting with a term of degree ^ d + 1. In this way, if 
we start with a power series of degree d > r, then it can be expressed as a 
linear combination of f rl , , f rnr by means of the coefficients 

9ÁX) = t c["X '~ r ,.... g„ r (X) = t c<"X'- r , 

v=d v=d 

and we see that the f i} generate our ideal 91, as was to be shown. 

Corollary 9.5. If A is a Noetherian commutative ring, or a field, then 
/1 XJ] is Noetherian. 

Examples. Power series in one variable are at the core of the theory of 
functions of one complex variable, and similarly for power series in several 
variables in the higher-dimensional case. See for instance [Gu 90]. 

Weierstrass polynomials occur in several contexts. First, they can be used 
to reduce questions about power series to questions about polynomials, in 
studying analytic sets. See for instance [GrH 78], Chapter 0. In a number- 
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theoretic context, such polynomials occur as characteristic polynomials in 
the Iwasawa theory of cyclotomic fields. Cf. [La 90], starting with Chapter 
5. 

Power series can also be used as generating functions. Suppose that to 
each positive integer n we associate a number a(n). Then the generating 
function is the power series £tf(/z)í n . In significant cases, it turns out that 
this function represents a rational function, and it may be a major result to 
prove that this is so. 

For instance in Chapter X, §6 we shall consider a Poincaré series, 
associated with the length of modules. Similarly, in topology, consider a 
topological space X such that its homology groups (say) are finite dimen- 
sional over a field k of coefficients. Let h n = dim H n (X , fc), where H n is the 
n-th homology group. The Poincaré series is defined to be the generating 
series 


Px(t) = I h n t n . 

Examples arise in the theory of dynamical systems. One considers a 
mapping T: X -+ X from a space X into itself, and we let N n be the number 
of fixed points of the n-th iterate T n =ToTo -oT(n times). The generat- 
ing function is XA/ n í' 1 . Because of the number of references I give here, I 
list them systematically at the end of the section. See first Artin-Mazur 
[ArM 65]; a proof by Manning of a conjecture of Smale [Ma 71]; and 
Shub’s book [Sh 87], especially Chapter 10, Corollary 10.42 (Manning’s 
theorem). 

For an example in algebraic geometry, let V be an algebraic variety 
defined over a finite field k. Let K n be the extension of k of degree n (in a 
given algebraic closure). Let N n be the number of points of V in K n . One 
defines the zeta function Z(t) as the power series such that Z(0) = 1 and 


Z'/Z(t) = £ N.f- 1 . 

n =1 

Then Z(t) is a rational function (F. K. Schmidt when the dimension of V is 1, 
and Dwork in higher dimensions). For a discussion and references to the 
literature, see Appendix C of Hartshorne [Ha 77]. 

Finally we mention the partition function p(n\ which is the number of 
ways a positive integer can be expressed as a sum of positive integers. The 
generating function was determined by Euler to be 

1 + f P(n)f = ft (1 - 'T 1 . 

n =1 n =1 

See for instance Hardy and Wright [HardW 71], Chapter XIX. The generat- 
ing series for the partition function is related to the power series usually 
expressed in terms of a variable q , namely 
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A = <? fì 0 - 4") 24 = £ 

n =1 n=1 

which is the generating series for the Ramanujan function x(n). The power 
series for A is also the expansion of a function in the theory of modular 
functions. For an introduction, see Serre’s book [Se 73], last chapter, and 
books on elliptic functions, e.g. mine. We shall mention one application of 
the power series for A in the Galois theory chapter. 

Generating power series also occur in /C-theory, topological and algebraic 
geometric, as in Hirzebruch’s formalism for the Riemann-Roch theorem and 
its extension by Grothendieck. See Atiyah [At 67], Hirzebruch [Hi 66], and 
[FuL 86]. I have extracted some formal elementary aspects having directly 
to do with power series in Exercises 21-27, which can be viewed as basic 
examples. See also Exercises 31-34 of the next chapter. 


[ArM 65] 
[At 67] 
[FuL 85] 
[GrH 78] 
[Gu 90] 
[HardW 71] 

[Hart 77] 
[Hi 66] 

[La 90] 

[Ma 71] 

[Se 73] 

[Sh 87] 
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EXERCISES 

1. Let k be a field and f(X) e /c[X] a non-zero polynomial. Show that the following 
conditions are equivalent: 

(a) The ideal ( f(X )) is prime. 

(b) The ideal (/(X)) is maximal. 

(c) f(X) is irreducible. 

2. (a) State and prove the analogue of Theorem 5.2 for the rational numbers. 

(b) State and prove the analogue of Theorem 5.3 for positive integers. 

3. Let / be a polynomial in one variable over a field k. Let X , Y be two variables. 
Show that in k\_X , 7] we have a “Taylor series” expansion 

f(X+Y)=f(X)+ £ Çí (X)Y\ 

î=1 

where (p t (X ) is a polynomial in X with coefficients in k. If k has characteristic 0, 
show that 


q>i(X) = 


D'f(X) 


4. Generalize the preceding exercise to polynomials in several variables (introduce 
partial derivatives and show that a finite Taylor expansion exists for a polynomial 
in several variables). 

5. (a) Show that the polynomials X 4 + 1 and X 6 + X 3 + 1 are irreducible over the 

rational numbers. 

(b) Show that a polynomial of degree 3 over a field is either irreducible or has a 
root in the field. Is X 3 — 5X 2 + 1 irreducible over the rational numbers? 

(c) Show that the polynomial in two variables X 2 + Y 2 — 1 is irreducible over 
the rational numbers. Is it irreducible over the complex numbers? 

6. Prove the integral root test of §3. 

7. (a) Let k be a finite field with q elements. Let f(X x , ...,X n ) be a polynomial in 

k\X~\ of degree d and assume /(0,..., 0) = 0. An element (a l9 ..., a„) e k (n) 
such that f(a) = 0 is called a zero of /. If n > d , show that / has at least one 
other zero in k {n) . [Hint: Assume the contrary, and compare the degrees of 
the reduced polynomial belonging to 

i-f(xy~ l 

and (1 — Xf' 1 ) ••• (1 — XH~ l ). The theorem is due to Chevalley.] 

(b) Refine the above results by proving that the number N of zeros of / in k (n) is 
= 0 (mod p), arguing as follows. Let i be an integer ^ 1. Show that 


2 >‘= 



if q — 1 divides i, 
otherwise. 


Denote the preceding function of i by ij/(i). Show that 
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I (i-/t*r‘) 

X6 fc<"> 

and for each n-tuple (i t ,..., i n ) of integers ^ 0 that 

£ x[' ■■■ x‘„" = !) </'(!„). 

JC6 

Show that both terms in the sum for N above yield 0 mod p. (The above 
argument is due to Warning.) 

(c) Extend Chevalley’s theorem to r polynomials of degrees d u ...,d r 

respectively, in n variables. If they have no constant term and n > show 
that they have a non-trivial common zero. 

(d) Show that an arbitrary function /: k (n) -* k can be represented by a poly- 
nomial. (As before, k is a finite field.) 

8. Let A be a commutative entire ring and X a variable over A. Let a, be A and 
assume that a is a unit in A. Show that the map X\-+aX 4- b extends to a 
unique automorphism of A[X] inducing the identity on A. What is the inverse 
automorphism? 

9. Show that every automorphism of A[X~\ is of the type described in Exercise 8. 

10. Let K be a field, and A(X) the quotient field of K[X]. Show that every automorphism 
of ÀXX) which induces the identity on K is of type 

Y[ aX + b 
cX + d 

with a , b , c, de K such that (aX + b)/(cX 4- d) is not an element of À, or 
equivalently, ad — bc # 0. 

11. Let A be a commutative entire ring and let K be its quotient field. We show here 
that some formulas from calculus have a purely algebraic setting. Let D: A-> A 
be a derivation, that is an additive homomorphism satisfying the rule for the 
derivative of a product, namely 

D(xy) = xDy 4- yDx for x, y e A. 

(a) Prove that D has a unique extension to a derivation of K into itself, and that 
this extension satisfies the rule 



for x, y e A and y # 0. [Define the extension by this formula, prove that it is 
independent of the choice of x, y to write the fraction x/y, and show that it 
is a derivation having the original value on elements of A.\ 

(b) Let L(x) = Dx/x for x e K*. Show that L(xy) = L(x) + L(y). The homo- 
morphism L is called the logarithmic derivative. 

(c) Let D be the standard derivative in the polynomial ring /c[À] over a field k. 
Let R(X) = cf|(À — a,) m< with a, e k, c e k, and m, e Z, so R(X) is a rational 
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function. Show that 


12. (a) If f(X) = aX 2 + bX + c, show that the discriminant of / is b 2 - 4 ac. 

(b) If f(X) = a 0 X 3 + a x X 2 + a 2 X + a 2 , show that the discriminant of / is 

a\a\ — 4a 0 a\ — 4 a\a 3 — Ha 2 0 a\ + 18 a 0 a x a 2 a^. 

(c) Let f(X) = (X ~t l )---(X-t n ). Show that 

D/-(-r w nm 

i=1 

13. Polynomials will be taken over an algebraically closed field of characteristic 0. 

(a) Prove 

Davenport’s theorem. Let f(t), g(t) be polynomials such that / 3 — g 2 # 0. Then 
deg(/ 3 - 0 2 )^ideg/+ 1 . 

Or put another way, let h = / 3 — g 2 and assume h ^ 0. Then 

deg / ^ 2 deg h — 2 . 

To do this, first assume f g relatively prime and apply Mason’s theorem. In 
general, proceed as follows. 

(b) Let A, B, f, g be polynomials such that Af Bg are relatively prime ^ 0. Let 
h = Af 3 + Bg 2 . Then 

deg / ^ deg A + deg B + 2 deg h — 2. 

This follows directly from Mason’s theorem. Then starting with f g not 
necessarily relatively prime, start factoring out common factors until no 
longer possible, to effect the desired reduction. When I did it, I needed to do 
this step three times, so don’t stop until you get it. 

(c) Generalize (b) to the case of / m — g n for arbitrary positive integer exponents 
m and n. 

14. Prove that the generalized Szpiro conjecture implies the abc conjecture. 

15. Prove that the abc conjecture implies the following conjecture: There are infinitely 
many primes p such that 2 P_1 ^ 1 mod p 2 . [Cf. the reference [Sil 88 ] and [La 90] 
at the end of §7.] 

16. Let vv be a complex number, and let c = max(l, |w|). Let F, G be non-zero 
polynomials in one variable with complex coefficients, of degrees d and d' respec- 
tively, such that |F|, |G| ^ 1. Let R be their resultant. Then 

\R\ ác d+d '[|F(w)| + \G(w)\~\\F\ d '\G\ d (d + d') d+d '. 

(We denote by |F| the maximum of the absolute values of the coeíficients of F.) 

17. Let d be an integer ^ 3. Prove the existence of an irreducible polynomial of 
degree d over Q, having precisely d — 2 real roots, and a pair of complex 
conjugate roots. Use the following construction. Let b l9 ..., h d - 2 be distinct 
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integers, and let a be an integer > 0. Let 

g(X) = (.X 2 + a)(X -b x )- - (X-b d _ J = X à + c 4 . t X d ~ l + • • ■ + c 0 . 
Observe that c, e Z for all i. Let p be a prime number, and let 

g n (X) = g(X) + ^ 

so that g n converges to g (i.e. the coefficients of g n converge to the coefficients 
of g\ 

(a) Prove that g n has precisely d — 2 real roots for n sufficiently large. (You may 
use a bit of calculus, or use whatever method you want.) 

(b) Prove that g n is irreducible over Q. 


Integral-valued polynomials 

18. Let P(X)e Q[Y] be a polynomial in one variable with rational coefficients. It 
may happen that P(ri) e Z for all sufficiently large integers n without necessarily P 
having integer coefficients. 

(a) Give an example of this. 

(b) Assume that P has the above property. Prove that there are integers 
c 0 , c í9 ...» c r such that 

P(X) = Co(*) + Ci( r f j) + + C„ 

where 

(í) = À* ( * -1) ‘" (X-r + 1) 

is the binomial coefficient function. In particular, P(n) e Z for all n. Thus we 
may call P integral valued. 

(c) Let /: Z Z be a function. Assume that there exists an integral valued 
polynomial Q such that the difference function A/ defìned by 

(Af)(n) = f(n) — f(n — 1) 

is equal to Q(n) for all n sufficiently large. Show that there exists an integral- 
valued polynomial P such that f(ri) = P(n) for all n sufficiently large. 


Exercises on symmetric functions 

19. (a) Let X Í9 ...,X H be variables. Show that any homogeneous polynomial in 
Z[X lf ... f X n ] of degree > n(n — 1) lies in the ideal generated by the elemen- 
tary symmetric functions s^,..., s n . 

(b) With the same notation show that Z[X l9 ... 9 X n ] is a free Z[s 1? ..., sj 
module with basis the monomials 

X ir) = X[ l ••■ X n '" 

with 0 ^ r { r ^ n — i. 
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(c) Let <X’ 1 ,...,X II and Y l9 ...,Y m be two independent sets of variables. Let 
S!,...,s n be the elementary symmetric functions of X and si,...,s^ the 
elementary symmetric functions of Y (using vector vector notation). Show 
that Z[X, 7] is free over Z[s, s'] with basis X (r) Y iq \ and the exponents (r), (q) 
satisfying inequalities as in (b). 

(d) Let / be an ideal in Z[s, s']. Let J be the ideal generated by / in Z[X, T]. 
Show that 

J n Z[s, s'] = /. 

20. Let A be a commutative ring. Let t be a variable. Let 

f(t) = £ a it' and 3(t) = Ê 

i=0 i=0 

be polynomials whose constant terms are a 0 = b 0 = 1. If 

) = 1 , 

show that there exists an integer iV (= (m + n)(m -h n — 1)) such that any mono- 
mial 


with Yéì r j > N is equal to 0. [Hint : Replace the a’s and b 's by variables. Use 
Exercise 19(b) to show that any monomial M(a) of weight > N lies in the ideal / 
generated by the elements 

k 

C k = Yj a i^k~i 

i=0 

(letting a 0 = b 0 = 1). Note that c fc is the /c-th elementary symmetric function of 
the m -I- n variables (X, y).] 

[Note: For some interesting contexts involving symmetric functions, see 
Cartier’s talk at the Bourbaki Seminar, 1982-1983.] 


A-rings 

The following exercises start a train of thought which will be pursued in Exercise 
33 of Chapter V; Exercises 22-24 of Chapter XVIII; and Chapter XX, §3. These 
originated to a large extent in Hirzebruch’s Riemann-Roch theorem and its extension 
by Grothendieck who defined A-rings in general. 

Let K be a commutative ring. By 2-operations we mean a family of mappings 

k l : K->K 

for each integer i ^ 0 satisfying the relations for all x e K: 

A°(x) = 1, A*(x) = x, 

and for all integers n ^ 0, and xjeX, 

A"(x + y)=t À i (x)X'~ i (y). 

Ì =0 
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The reader will meet examples of such operations in the chapter on the alternat- 
ing and symmetric products, but the formalism of such operations depends only 
on the above relations, and so can be developed here in the context of formal 
power series. Given a A-operation, in which case we also say that K is a A-ring, 
we define the power series 

A,(x) = t 

i =0 

Prove the following statements. 

21. The map xh->2,(x) is a homomorphism from the additive group of K into the 
multiplicative group of power series 1 + tK[[t]] whose constant term is equal to 
1. Conversely, any such homomorphism such that 2 t (x) = 1 + xt + higher terms 
gives rise to 2-operations. 

22. Let s = at + higher terms be a power series in K[[í]] such that a is a unit in K. 
Show that there is a power series 

t = g(s) = £ hjS 1 with bi e K. 

Show that any power series f(t) eK[[í]] can be written in the form h(s) for some 
other power series with coefficients in K. 

Given a 2-operation on X, define the corresponding Grothendieck prnver series 

y t W = = *m(x) 

where s = t/( 1 — t). Then the map 

x * * 7í(x) 

is a homomorphism as before. We define y'(x) by the relation 

Show that y satisfies the following properties. 

23. (a) For every integer n^0we have 

y"(x + y)=t y i (x)y"" i (j'). 

.1=0 

(b) y,(l) = 1/(1 - ()■ 

(c) y,(- 1) = i - í. 

24. Assume that A'u — 0 for i > 1. Show: 

(a) y,(u - 1) = 1 + (u - l)t. 

(b) Tt (l - u) = £ (1 - u)Y. 

i = 0 

25. Bernoulli numbers. Define the Bernoulli numbers B k as the coefficients in the 
power series 




í! 
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Of course, e' = is the standard power series with rational coefflcients 1/n!. 

Prove: 

(a) B 0 = 1, = — 2 » ^2 = 6* 

(b) F( — t) = í + F{t), and B k = 0 if k is odd # 1. 


26. Bernoulli polynomials. Define the Bernoulli polynomials B k (X) by the power 
series expansion 


tp^ °° t k 


It is clear that B k = B fc (0), so the Bernoulli numbers are the constant terms of the 
Bernoulli polynomials. Prove: 

(a) Bo(A') = 1 , B,(X) = X - *, B 2 (X) = X 2 - X + i 

(b) For each positive integer N, 

(c) B k (X) = X k — \kX k ~ l + lower terms. 

(d) F(t, X + 1) - F(t, X) = te x ' = tZX'L 

(e) Bn(X + 1) - B*(JO = kX k ~ l for k ^ 1. 

27. Let N be a positive integer and let / be a function on Z/NZ. Form the power 
series 


n -i te ia+X)t 

Ff(t, X) = X M-psr—. 

a=0 e — 1 

Following Leopoldt, define the generalized Bernoulli polynomials relative to the 
function / by 

F f (t,X)= Ì B*.,(*)£. 

k=o Kl 

In particular, the constant term of B k f (X) is defined to be the generalized 
Bernoulli number B k f = B fct/ (0) introduced by Leopoldt in cyclotomic fields. 
Prove: 

(a) F f (t, X + k) = e k 'F f (t, X). 

(b) F f (t, X + N)- F f (t, X) = (e m - 1 )F f (t, X). 

(c) £[*./(*■ + N) - B t j-(X)] = X' f(a)(a + Xf' 1 . 

K a = o 

(d) B kif (X)= X i^jB^X"-' 

= B k f + kB k _ uf X + ••• + kB Uf X k ~ x + B 0 f X k . 

Note. The exercises on Bernoulli numbers and polynomials are designed not 
only to give examples for the material in the text, but to show how this material 
leads into major areas of mathematics: in topology and algebraic geometry centering 
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around Riemann-Roch theorems; analytic and algebraic number theory, as in the 
theory of the zeta functions and the theory of modular forms, cf. my Introduction 
to Modular Forms , Springer-Verlag, New York, 1976, Chapters XIV and XV; my 
Cyclotomic Fields , I and II, Springer-Verlag, New York, 1990, Chapter 2, §2; Kubert- 
Lang’s Modular Units, Springer-Verlag, New York, 1981; etc. 


Further Comments, 1996-2001. I was informed by Umberto Zannier that what has 
been called Mason’s theorem was proved three years earlier by Stothers [Sto 81], Theo- 
rem 1.1. Zannier himself has published some results on Davenport’s theorem [Za 95], 
without knowing of the paper by Stothers, using a method similar to that of Stothers, 
and rediscovering some of Stothers’ results, but also going beyond. Indeed, Stothers uses 
the “Belyi method” belonging to algebraic geometry, and increasingly appearing as a 
fundamental tool. Mason gave a very elementary proof, accessible at the basic level of 
algebra. An even shorter and very elegant proof of the Mason-Stothers theorem was 
given by Noah Snyder [Sny 00]. I am much indebted to Snyder for showing me that 
proof before publication, and I reproduced it in [La 99b]. But I recommend looking at 
Snyder’s version. 

[La 99b] S. Lang, Math Talks for Undergraduates , Springer Verlag 1999 

[Sny 00] N. Snyder, An alternate proof of Mason’s theorem, Elemente der Math. 55 
(2000) pp. 93-94 

[Sto 81] W. Stothers, Polynomial identities and hauptmoduln, Quart. J. Math. Oxford 
(2) 32 (1981) pp. 349-370 

[Za 95] U. Zannier, On Davenport’s bound for the degree of/ 3 - g 2 and Riemann’s 
existence theorem, Acta Arithm. LXXI.2 (1995) pp. 107-137 



Part Two 

ALGEBRAIC 

EQUATIONS 


This part is concerned with the solutions of algebraic equations, in one 
or several variables. This is the recurrent theme in every chapter of this 
part, and we lay the foundations for all further studies concerning such 
equations. 

Given a subring A of a ring B , and a finite number of polynomials 
/j, ..., f n in A [Xj,..., X n \ we are concerned with the n-tuples 

W 6 

such that 

fi(b u ...,b n ) = 0 

for i = 1,..., r. For suitable choices of A and B , this includes the general 
problem of diophantine analysis when A, B have an “arithmetic” structure. 

We shall study various cases. We begin by studying roots of one polyno- 
mial in one variable over a field. We prove the existence of an algebraic 
closure, and emphasize the role of irreducibility. 

Next we study the group of automorphisms of algebraic extensions of a 
field, both intrinsically and as a group of permutations of the roots of a 
polynomial. We shall mention some major unsolved problems along the 
way. 

It is also necessary to discuss extensions of a ring, to give the possibil- 
ity of analyzing families of extensions. The ground work is laid in Chapter 
VII. 

In Chapter IX, we come to the zeros of polynomials in several variables, 
essentially over algebraically closed fields. But again, it is advantageous to 
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consider polynomials over rings, especially Z> since in projective space, the 
conditions that homogeneous polynomials have a non-trivial common zero 
can be given universally over Z in terms of their coefficients. 

Finally we impose additional structures like those of reality, or metric 
structures given by absolute values. Each one of these structures gives rise to 
certain theorems describing the structure of the solutions of equations as 
above, and especially proving the existence of solutions in important cases. 



CHAPTER V 

Algebraic Extensions 


In this first chapter concerning polynomial equations, we show that given 
a polynomial over a field, there always exists some extension of the field 
where the polynomial has a root, and we prove the existence of an algebraic 
closure. We make a preliminary study of such extensions, including the 
automorphisms, and we give algebraic extensions of finite fields as examples. 


§1. FINITE AND ALGEBRAIC EXTENSIONS 

Let F be a field. If F is a subfield of a field F, then we also say that E is 
an extension field of F. We may view E as a vector space over F, and we say 
that E is a finite or infinite extension of F according as the dimension of this 
vector space is finite or infinite. 

Let F be a subfield of a field E. An element a of E is said to be algebraic 
over F if there exist elements a 09 ..., a n (n ^ 1) of F, not all equal to 0, such 
that 

a 0 + a Y oi + ••• + a n ot n = 0. 

If a # 0, and a is algebraic, then we can always find elements a t as above 
such that a 0 # 0 (factoring out a suitable power of a). 

Let X be a variable over F. We can also say that a is algebraic over F if 
the homomorphism 

F[*]->F 


223 



224 ALGEBRAIC EXTENSIONS 


V, §1 

which is the identity on F and maps X on oc has a non-zero kernel. In that 
case the kernel is an ideal which is principal, generated by a single polyno- 
mial p(X\ which we may assume has leading coefficient 1. We then have an 
isomorphism 

Fixy(p(X)) * FM, 

and since F[a] is entire, it follows that p(X) is irreducible. Having normal- 
ized p(X) so that its leading coefficient is 1, we see that p(X) is uniquely 
determined by a and will be called THE irreducible polynomial of a over F. 
We sometimes denote it by Irr(a, F , X). 

An extension E of F is said to be algebraic if every element of E is 
algebraic over F. 

Proposition 1.1. Let E be a finite extension of F. Then E is algebraic 
over F. 

Proof. Let a e £, a # 0. The powers of a, 

1, a, a 2 , ..., a", 

cannot be linearly independent over F for all positive integers n, otherwise 
the dimension of E over F would be infinite. A linear relation between these 
powers shows that a is algebraic over F. 

Note that the converse of Proposition 1.1 is not true; there exist infinite 
algebraic extensions. We shall see later that the subfield of the complex 
numbers consisting of all algebraic numbers over Q is an infinite extension 
ofQ. 

If E is an extension of F, we denote by 

ÍE:F] 

the dimension of E as vector space over F. It may be infinite. 


Proposition 1.2. Let kbe a field and F cz E extension fields of k. Then 


lE:k] = [£:F][F:k]. 

// {x f } ie/ is a basis for F over k and {yj} jeJ is a basis for E over F, then 
{ X iyj}aj)eixj is a basis for E over k. 


Proof. Let z e E. 
<Xj = 0, such that 


By hypothesis there exist elements oc } e F, almost all 

z = X «jyj- 

jeJ 


For each j e J there exist elements ò,-, e /c, almost all of which are equal to 0, 
such that 
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a j = £ fy*,. 

iel 

and hence 

z = Z Z Víty 

This shows that {x.y,} is a family of generators for E over k. We must show 
that it is linearly independent. Let {c 0 } be a family of elements of k, almost 
all of which are 0, such that 

I E c o x .Tj = 0- 

J í 

Then for each j, 

Z c « x i = ° 

i 

because the elements y } are linearly independent over F. Finally c í; - = 0 for 
each i because {xj is a basis of F over /c, thereby proving our proposition. 

Corollary 1.3. The extension E of k is finite if and only if E is finite over 
F and F is finite over k. 

As with groups, we define a tower of fields to be a sequence 

F i cF 2 <=---cF n 

of extension fields. The tower is called finite if and only if each step is finite. 

Let k be a field, E an extension field, and cleE. We denote by k(oc) the 
smallest subfield of E containing both k and oc. It consists of all quotients 
f(oc)/g(oc), where /, g are polynomials with coeíficients in k and g(a) # 0. 

Proposition 1.4. Let oc be algebraic over k. Then k(ot) = k[a], and k(oc) is 
finite over k. The degree [/c(a): k ] is equal to the degree of Irr(a, k, X). 

Proof. Let p(X) = Irr(a, /c, X). Let f(X)ek[X ] be such that f(oc) # 0. 
Then p(X) does not divide f(X ), and hence there exist polynomials g(X\ 
h(X) g k\_X~\ such that 

g(X)p(X) + h(X)f(X) = 1. 

From this we get /i(a)/(a) = 1, and we see that f(oc) is invertible in /c[a]. 
Hence /c[a] is not only a ring but a field, and must therefore be equal to 
k(oc). Let d = deg p(X). The powers 

1, a,..., a d ~ l 

are linearly independent over /c, for otherwise suppose 
a® + cfia + * * * + a^-^cc d 1 = 0 
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with cii e /c, not all = 0. Let g(X ) = a 0 + * * * + a á ^ X d ~ l . Then g # 0 and 
g(oì) = 0. Hence p(A") divides 0 (À), contradiction. Finally, let /(a)e/c[a], 
where /(A") e k[X~\. There exist polynomials q(X\ r(X)ek[X~\ such that 
deg r < d and 

f(X) = q(X)p(X) + r(X). 

Then /(a) = r(a), and we see that 1, a,..., a d_1 generate /c[a] as a vector space 
over /c. This proves our proposition. 

Let £, F be extensions of a field /c. If E and F are contained in some field 
L then we denote by EF the smallest subfield of L containing both E and 
F, and call it the compositum of E and F, in L. If F, F are not given as 
embedded in a common field L, then we cannot define the compositum. 

Let k be a subfield of E and let a l9 ..., a w be elements of E. We denote 
by 

the smallest subfield of £ containing k and a t ,..., a„. Its elements consist of 
all quotients 

/(«!.•••.«») 

í(«l, •••,«») 

where /, g are polynomials in n variables with coefficients in /c, and 

ôf(ai,...,a„) # 0 . 

Indeed, the set of such quotients forms a field containing k and aj,...,a n . 
Conversely, any field containing k and 

«i, •••, 

must contain these quotients. 

We observe that E is the union of all its subfields /c(a l5 ..., a„) as 
(a x ,...,a n ) ranges over finite subfamilies of elements of E. We could define 
the compositum of an arbitrary subfamily of subfields of a field L as the 
smallest subfield containing all fields in the family. We say that E is finitely 
generated over k if there is a finite family of elements a l 5 ...,a n of E such 
that 

E = /c(a x ,..., a n ). 

We see that E is the compositum of all its finitely generated subfields over k. 


Proposition 1.5. Let E be a finite extension of k. Then E is finitely 
generated. 


Proof 

certainly 


Let {a^,...,^,,} be a basis of E as vector space over k. Then 
E = k(oc í9 ..., a n ). 
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If F = /c(a <x n ) is finitely generated, and F is an extension of /c, both 
F, E contained in L, then 


EF = F(a x ,..., a M ), 

and £F is finitely generated over F. We often draw the following picture: 

EF 



k 


Lines slanting up indicate an inclusion relation between fìelds. We also call 
the extension EF of F the translation of E to F, or also the lifting of E to 
F. 

Let a be algebraic over the field k. Let F be an extension of k , and 
assume /c(a), F both contained in some field L. Then a is algebraic over F. 
Indeed, the irreducible polynomial for a over k has a fortiori coefficients in 
F, and gives a linear relation for the powers of a over F. 

Suppose that we have a tower of fields: 

k c= /c(a x ) c: /c(aj,a 2 ) c ••• c: /c(a x ,..., aj, 

each one generated from the preceding field by a single element. Assume that 
each a f is algebraic over /c, i = 1, ..., n. As a special case of our preceding 
remark, we note that a i+1 is algebraic over /c(a x ,..., a f ). Hence each step of 
the tower is algebraic. 

Proposition 1.6. Let E = /c(a 1? ..., a n ) be a finitely generated extension of 

a field /c, amí assume a f algebraic over k for each i = 1, ..., n. F/zen E is 

finite algebraic over k. 

Proof. From the above remarks, we know that E can be obtained as the 
end of a tower each of whose steps is generated by one algebraic element, 
and is therefore finite by Proposition 1.4. We conclude that E is finite over k 
by Corollary 1.3, and that it is algebraic by Proposition 1.1. 

Let C be a certain class of extension fields F a E. We shall say that e is 
distinguished if it satisfies the following conditions: 

(1) Let /c c F c £ be a tower of fields. The extension k c= E is in Q if and 
only if /c c= F is in e and F c= E is in C. 

(2) If k c= E is in e, if F is any extension of /c, and F, F are both 
contained in some field, then F <= EF is in G. 

(3) If k c= F and k c= F are in G and F, F are subfields of a common field, 
then /c c= FF is in C. 
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The diagrams illustrating our properties are as follows: 



(1) (2) (3) 

These lattice diagrams of fields are extremely suggestive in handling exten- 
sion fields. 

We observe that (3) follows formally from the first two conditions. 
Indeed, one views EF over fc as a tower with steps k cz F cr EF. 

As a matter of notation, it is convenient to write E/F instead of F cz E to 
denote an extension. There can be no confusion with factor groups since we 
shall never use the notation E/F to denote such a factor group when E is an 
extension field of F. 

Proposition 1.7. The class of algebraic extensions is distinguished , and so 
is the class of finite extensions. 

ProoJ'. Consider first the class of finite extensions. We have already 
proved condition (1). As for (2), assume that E/k is finite, and let F be any 
extension of k. By Proposition 1.5 there exist elements a 1 ,...,a n e£ such 
that E = k(a ì9 ..., a n ). Then EF = F(oc l9 ..., a„), and hence EF/F is finitely 
generated by algebraic elements. Using Proposition 1.6 we conclude that 
EF/F is finite. 

Consider next the class of algebraic extensions, and let 

kczF czE 

be a tower. Assume that E is algebraic over k. Then a fortiori , F is 
algebraic over k and E is algebraic over F. Conversely, assume each step in 
the tower to be algebraic. Let oce E. Then a satisfies an equation 

a n oc n + • • • + a 0 = 0 

with a t e F, not all a { = 0. Let F 0 = k(a n ,..., a 0 ). Then F 0 is finite over k by 
Proposition 1.6, and a is algebraic over F 0 . From the tower 

k cz F 0 = k(a n ,..., a 0 ) cz F 0 (a) 

and the fact that each step in this tower is finite, we conclude that F 0 ( a ) is 
finite over fc, whence a is algebraic over fc, thereby proving that E is algebraic 
over fc and proving condition (1) for algebraic extensions. Condition (2) has 
already been observed to hold, i.e. an element remains algebraic under lifting, 
and hence so does an extension. 
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Remark. It is true that finitely generated extensions form a distinguished 
class, but one argument needed to prove part of (1) can be carried out only 
with more machinery than we have at present. Cf. the chapter on transcen- 
dental extensions. 


§2. ALGEBRAIC CLOSURE 

In this and the next section we shall deal with embeddings of a field into 
another. We therefore define some terminology. 

Let E be an extension of a field F and let 

<j:F-L 

be an embedding (i.e. an injective homomorphism) of F into L. Then o 
induces an isomorphism of F with its image oF, which is sometimes written 
F a . An embedding t of E in L will be said to be over o if the restriction of t 
to F is equal to o. We also say that t extends o. If o is the identity then we 
say that t is an embedding of E over F. 

These definitions could be made in more general categories, since they 
depend only on diagrams to make sensç: 


E —-—► L E —-—► L 



F -► L F 


Remark. Let f(X) e F[X] be a polynomial, and let ot be a root of / in 
E. SsLyf(X) = a 0 + -- + a n X n . Then 

0 = f(a) = a 0 + a y a + • • • -f a n a n . 

If t extends o as above, then we see that xot is a root of f a because 
0 = r(/(a)) = aS + a^(r a) + • • • + <( ta)". 

Here we have written a a instead of o(a). This exponential notation is 
frequently convenient and will be used again in the sequel. Similarly, we 
write F a instead of cr(F) or oF. 

In our study of embeddings it will also be useful to have a lemma 
concerning embeddings of algebraic extensions into themselves. For this we 
note that if o: E -> L is an embedding over k (i.e. inducing the identity on /c), 
then o can be viewed as a /c-homomorphism of vector spaces, because both 
L, L can be viewed as vector spaces over k. Furthermore o is injective. 
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Lemma 2.1. Let E be an algebraic extension of k, and let o: E -► E be an 

embedding of E into itself over k. Then o is an automorphism. 

Proof. Since o is injective, it will suffice to prove that o is surjective. Let 
a be an element of E, let p(X) be its irreducible polynomial over fc, and let E' 
be the subfield of E generated by all the roots of p(X) which lie in E. Then 
E' is finitely generated, hence is a finite extension of fc. Furthermore, o must 
map a root of p(X) on a root of p(X\ and hence o maps E' into itself. We 
can view o as a fc-homomorphism of vector spaces because o induces the 
identity on fc. Since o is injective, its image o(E') is a subspace of E' having 
the same dimension [E’ : fc]. Hence cr(£') = E'. Since a e E', it follows that 
a is in the image of a, and our lemma is proved. 

Let E, F be extensions of a field fc, contained in some bigger field L. We 
can form the nng E[F] generated by the elements of F over E. Then E[F] = 
F[E], and EF is the quotient field of this ring. It is clear that the elements of 
E[F] can be written in the form 

Mi +••• +a n b„ 

with a { e E and ò, e F. Hence EF is the field of quotients of these elements. 

Lemma 2.2. Let E í9 E 2 be extensions of a field fc, contained in some 

bigger field E, and let o be an embedding of E in some field L. Then 

Proof. We apply o to a quotient of elements of the above type, say 

í a ibi + ••• + o n K \ _ a lK + QnK 
\a\b\ + • • • + a' m b'J ~ a'fb'f + • • • + a'°b'°' 

and see that the image is an element of o^EJo^E^). It is clear that the image 
o^E^E^) is o^E^E^). 

Let fc be a field, f(X) a polynomial of degree ^ 1 in fc[X]. We consider 
the problem of finding an extension E of fc in which / has a root. If p(X) is 
an irreducible polynomial in fc[X] which divides f(X), then any root of p(X) 
will also be a root of f(X), so we may restrict ourselves to irreducible 
polynomials. 

Let p(X) be irreducible, and consider the canonical homomorphism 
o:klXl^klXl/{p(X)). 

Then o induces a homomorphism on fc, whose kernel is 0, because every 
nonzero element of fc is invertible in fc, generates the unit ideal, and 1 does 
not lie in the kernel. Let Ç be the image of X under o, i.e. Ç = o(X) is the 
residue class of X mod p(X). Then 

P'it) = P°(X°) = (p(X)Y = 0. 
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Hence ^ is a root of p a 9 and as such is algebraic over ak. We have now 
found an extension of o /c, namely ok(Ç) in which p a has a root. 

With a minor set-theoretic argument, we shall have: 

Proposition 2.3. Let k be a field and f a polynomial in k[X ] of degree 
è 1. Then there exists an extension E of k in which f has a root. 

Proof. We may assume that / = p is irreducible. We have shown that 
there exists a field F and an embedding 

cr: k-> F 

such that p a has a root f in F. Let S be a set whose cardinality is the same 
as that of F — <rk (= the complement of ak in F) and which is disjoint from 
k. Let E = k u S. We can extend a: k -» F to a bijection of E on F. We now 
define a field structure on E. If x, y e E we define 

xy = <r -1 (ffW<7(>0). 
x + y = (t~ 1 (<j(x) + a(y)). 

Restricted to /c, our addition and multiplication coincide with the given 
addition and multiplication of our original field k, and it is clear that k is a 
subfield of E. We let ot = o~ l (Ç). Then it is also clear that p( a) = 0, as 
desired. 

Corollary 2.4. Let k be a field and let f í9 ..., f n be polynomials in /c[X] 
of degrees ^ 1. Then there exists an extension E of k in which each f has 
a root , i = 1,..., n. 

Proof. Let E { be an extension in which / x has a root. We may view / 2 
as a polynomial over E t . Let E 2 be an extension of E Y in which / 2 has a 
root. Proceeding inductively, our corollary follows at once. 

We define a field L to be algebraically closed if every polynomial in L[X] 
of degree ^ 1 has a root in L. 

Theorem 2.5. Let kbe a field. Then there exists an algebraically closedfield 
containing k as a subfield. 

Proof. We first construct an extension E x of k in which every polyno- 
mial in k[X ] of degree ^ 1 has a root. One can proceed as follows (Artin). 
To each polynomial / in /c[^] of degree ^ 1 we associate a letter X f and we 
let S be the set of all such letters X f (so that S is in bijection with the set of 
polynomials in /c[X] of degree ^ 1). We form the polynomial ring /c[S], and 
contend that the ideal generated by all the polynomials f(X f ) in /c[5] is not 
the unit ideal. If it is, then there is a finite combination of elements in our 
ideal which is equal to 1: 

9ifx(X fi ) + -- + g n f n (X f J=í 
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with g t e fc[S]. For simplicity, write X t instead of X fr The polynomials 
will involve actually only a finite number of variables, say X l9 ..., X N (with 
N ^ n ). Our relation then reads 

Ìg i (X u ...,X N )f i (X i )=ì. 

i = 1 

Let F be a finite extension in which each polynomial has a root, 

say a f is a root of f in F, for i = 1, ..., n. Let a^ = 0 for i > n. Substitute a f 
for Xi in our relation. We get 0 = 1, contradiction. 

Let m be a maximal ideal containing the ideal generated by all polyno- 
mials f(X f ) in fc[S]. Then fc[S]/m is a field, and we have a canonical map 

o: fc[S] - fc[S]/m. 

For any polynomial / g fc[A"] of degree ^ 1, the polynomial f a has a root in 
fc[S]/m, which is an extension of <rfc. Using the same type of set-theoretic 
argument as in Proposition 2.3, we conclude that there exists an extension 
E v of fc in which every polynomial /e fc[2f] of degree ^ 1 has a root in E x . 
Inductively, we can form a sequence of fields 

E t c £ 2 c= £ 3 c= ••• c= E n ••• 

such that every polynomial in F w [2f] of degree ^ 1 has a root in £ n+1 . Let E 
be the union of all fields E nì n = 1, 2, .... Then E is naturally a field, for if 
x, ye E then there exists some n such that x, y e £ n , and we can take the 
product or sum xy or x -f y in E n . This is obviously independent of the 
choice of n such that x, y e E n , and defines a field structure on E. Every 
polynomial in £[2f] has its coeíficients in some subfield £ n , hence a root in 
F n+1 , hence a root in £, as desired. 

Corollary 2.6. Let k be a fìeld. There exists an extension fc a which is 
algebraic over k and algebraically closed. 

Proof. Let E be an extension of fc which is algebraically closed and let 
fc a be the union of all subextensions of E , which are algebraic over fc. Then 
fc a is algebraic over fc. If a e E and a is algebraic over fc a then a is algebraic 
over fc by Proposition 1.7. If / is a polynomial of degree ^ 1 in fc a [A"], then 
/ has a root a in £, and a is algebraic over fc a . Hence a is in fc a and fc a is 
algebraically closed. 

We observe that if L is an algebraically closed field, and / e L[X~\ has 
degree ^ 1, then there exists ceL and a x ,..., a B e L such that 

f(X) = c(X- ai )-(2f-a n ). 

Indeed, / has a root a^ in L, so there exists #(20 e L[X\ such that 

/(X) = (X - * x )g(l í). 

If deg 0 ^ 1, we can repeat this argument inductively, and express / as a 
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product of terms (X — a*) (i = 1, n) and an element c e L. Note that c is 
the leading coefficient of / i.e. 

f(X) = cX n + terms of lower degree. 

Hence if the coefficients of / lie in a subfield k of L, then cek. 

Let k be a field and a:k-> L an embedding of k into an algebraically 
closed field L. We are interested in analyzing the extensions of o to algebraic 
extensions E of k. We begin by considering the special case when E is 
generated by one element. 

Let E = k(oc) where a is algebraic over k. Let 

p(X) = Irr(a, k, X). 

Let P be a root of p a in L. Given an element of /c(a) = /c[a], we can write it 
in the form /(a) with some polynomial f(X) e /cfA"]. We define an extension 
of o by mapping 

nmnn 

This is in fact well defined, i.e. independent of the choice of polynomial f(X) 
used to express our element in /c[a]. Indeed, if g(X) is in /cfA'] and such that 
^r(a) = /(a), then (g-f)(<x) = 0 9 whence p(X) divides g(X) - f(X). Hence 
p a (X) divides g a (X) — f a (X), and thus g a (f) = f a (P)- It is now clear that our 
map is a homomorphism inducing o on /c, and that it is an extension of o to 
/c(a). Hence we get: 

Proposition 2.7. The number of possible extensions of o to /c(a) is ^ the 
number of roots of p, and is equal to the number of distinct roots of p. 

This is an important fact, which we shall analyze more closely later. For 
the moment, we are interested in extensions of o to arbitrary algebraic 
extensions of k. We get them by using Zorn’s lemma. 

Theorem 2.8. Let k be a field, E an algebraic extension of k, and 
o:k^*L an embedding of k into an algebraically closed field L. Then 
there exists an extension of o to an embedding of E in L. If E is 
algebraically closed and L is algebraic over ok, then any such extension of 
o is an isomorphism of E onto L. 

Proof. Let S be the set of all pairs (F, t) where F is a subfield of E 
containing k, and t is an extension of o to an embedding of F in L. If (F, t) 
and (F', t') are such pairs, we write (F, t) ^ (F', t') if FcF and t'|F = t. 
Note that S is not empty [it contains (k, cr)], and is inductively ordered: If 
{(Fi, t i)} is a totally ordered subset, we let F = (J F t and define t on F to be 
equal to T f on each F f . Then (F, t) is an upper bound for the totally ordered 
subset. Using Zorn’s lemma, let (K, X) be a maximal element in S. Then X is 
an extension of o, and we contend that K = E. Otherwise, there exists a e E, 
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a $ K. By what we saw above, our embedding Â has an extension to K( a), 
thereby contradicting the maximality of ( K , A). This proves that there exists 
an extension of o to E. We denote this extension again by o. 

If E is algebraically closed, and L is algebraic over crfc, then oE is 
algebraically closed and L is algebraic over crL, hence L = oE. 

As a corollary, we have a certain uniqueness for an “algebraic closure” of 
a field fc. 

Corollary 2.9. Let kbe a field and let £, E ' be algebraic extensions of k. 
Assume that £, E' are algebraically closed. Then there exists an iso- 
morphism 

t : E -+ E' 

of E onto E ' inducing the identity on k. 

Proof. Extend the identity mapping on fc to an embedding of E into E’ 
and apply the theorem. 

We see that an algebraically closed and algebraic extension of fc is 
determined up to an isomorphism. Such an extension will be called an 
algebraic closure of fc, and we frequently denote it by fc a . In fact, unless 
otherwise specified, we use the symbol fc a only to denote algebraic closure. 

It is now worth while to recall the general situation of isomorphisms and 
automorphisms in general categories. 

Let d be a category, and A, B objects in G. We denote by Iso(^, B) the 
set of isomorphisms of A on B. Suppose there exists at least one such 
isomorphism o: A-+B, with inverse o~ l : B A. If (p is an automorphism of 

A , then o o (p: A -► B is again an isomorphism. If ij/ is an automorphism of 

B , then ij/ o a: A -> B is again an isomorphism. Furthermore, the groups 
of automorphisms Aut(^) and Aut(B) are isomorphic, under the mappings 

(p^OO(po o~ l , 

0~ l o \j/ o (j<Hil/, 

which are inverse to each other. The isomorphism o o (p o cr" 1 is the one 
which makes the following diagram commutative: 



<7 


We have a similar diagram for o~ l o \j/ o o. 

Let t: A -> B be another isomorphism. Then t _1 o o is an automorphism 
of A, and t o cr " 1 is an automorphism of B. Thus two isomorphisms differ by 
an automorphism (of A or B). We see that the group Aut(B) operates on the 
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set Iso(,4, B) on the left, and Aut (A) operates on the set Iso(/l, B) on the 
right. 

We also see that Aut(v4) is determined up to a mapping analogous to a 
conjugation. This is quite diíferent from the type of uniqueness given by 
universal objects in a category. Such objects have only the identity auto- 
morphism, and hence are determined up to a unique isomorphism. 

This is not the case with the algebraic closure of a field, which usually 
has a large amount of automorphisms. Most of this chapter and the next is 
devoted to the study of such automorphisms. 

Examples. It will be proved later in this book that the complex numbers 
are algebraically closed. Complex conjugation is an automorphism of C. 
There are many more automorphisms, but the other automorphisms =é id. are 
not continuous. We shall discuss other possible automorphisms in the chapter 
on transcendental extensions. The subfield of C consisting of all numbers which 
are algebraic over Q is an algebraic closure Q a of Q. It is easy to see that Q a 
is denumerable. In fact, prove the following as an exercise: 

If k is a field which is not finite , then any algebraic extension of k has the 

same cardinality as k. 

If k is denumerable, one can first enumerate all polynomials in k, then 
enumerate finite extensions by their degree, and finally enumerate the cardi- 
nality of an arbitrary algebraic extension. We leave the counting details as 
exercises. 

In particular, Q a # C. If R is the field of real numbers, then R a = C. 

If k is a finite field, then algebraic closure k a of k is denumerable. We 
shall in fact describe in great detail the nature of algebraic extensions of 
finite fields later in this chapter. 

Not all interesting fields are subfields of the complex numbers. For 
instance, one wants to investigate the algebraic extensions of a field C(X) 
where X is a variable over C. The study of these extensions amounts to the 
study of ramified coverings of the sphere (viewed as a Riemann surface), and 
in fact one has precise information concerning the nature of such extensions, 
because one knows the fundamental group of the sphere from which a finite 
number of points has been deleted. We shall mention this example again 
later when we discuss Galois groups. 


§3. SPLITTING FIELDS AND 
NORMAL EXTENSIONS 

Let k be a field and let / be a polynomial in k[_X~\ of degree ^ 1. By a 
splitting field K of / we shall mean an extension K of k such that / splits 
into linear factors in K 9 i.e. 
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/(*) = c(*-<*,)•••(*- a„) 

with a f e X, i = 1, n, and such that X = fe(a x ,..., a n ) is generated by all 
the roots of /. 

Theorem 3.1. Let Kbe a splitting field of the polynomial f(X) e k[X\ If 
E is another splitting field of /, then there exists an isomorphism o: £ -► K 
inducing the identity on k. IfkczKcz fc a , where k a is an algebraic closure 
of k, then any embedding of E in 1â inducing the identity on k must be an 
isomorphism of E onto K. 

Proof. Let X a be an algebraic closure of K. Then X a is algebraic over 
fc, hence is an algebraic closure of fc. By Theorem 2.8 there exists an 
embedding 

cr:£-X a 

inducing the identity on fc. We have a factorization 
f(X) = c(x-M'-(x-p n ) 

with Pi e £, i = 1, ..., n. The leading coefficient c lies in fc. We obtain 
f(X) = f a (X) = c(X &Pi )* * * (X op n ). 

We have unique factorization in X a [X]. Since / has a factorization 
/(X) = c(X-a 1 )*-*(X-a n ) 

in X[X], it follows that (op l9 ..., oP n ) differs from (a 1? ..., a„) by a permuta- 
tion. From this we conclude that oP t e K for i= 1, ..., n and hence that 
oE cz K. But K = fc(a 1? ..., aj = k(oP í9 ..., crj? n ), and hence oE = X, because 

E = k(p u ... 9 p n ). 

This proves our theorem. 

We note that a polynomial /(X) g fc[X] always has a splitting field, 
namely the field generated by its roots in a given algebraic closure fc a of fc. 

Let / be a set of indices and let {/} ig / be a family of polynomials in 
fc[X], of degrees ^ 1. By a splitting field for this family we shall mean an 
extension X of fc such that every / splits in linear factors in X[X], and X is 
generated by all the roots of all the polynomials /, i g /. In most applica- 
tions we deal with a finite indexing set /, but it is becoming increasingly 
important to consider infinite algebraic extensions, and so we shall deal with 
them fairly systematically. One should also observe that the proofs we shall 
give for various statements would not be simpler if we restricted ourselves to 
the finite case. 

Let fc a be an algebraic closure of fc, and let X f be a splitting field of / in 
fc a . Then the compositum of the X, is a splitting field for our family, 
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since the two conditions defining a splitting field are immediately satisfied. 
Furthermore Theorem 3.1 extends at once to the infinite case: 

Corollary 3.2. Let K be a splitting field for the family {fi} ieI and let E 

be another splitting field. Any embedding of E into K a inducing the 

identity on k gives an isomorphism of E onto K. 

Proof. Let the notation be as above. Note that E contains a unique 
splitting field of f and K contains a unique splitting field K t of f. Any 
embedding o of E into K a must map E { onto K t by Theorem 3.1, and hence 
maps E into K. Since K is the compositum of the fields K h our map o must 
send E onto K and hence induces an isomorphism of E onto K. 

Remark. If / is finite, and our polynomials are f u ..., /„, then a split- 
ting field for them is a splitting field for the single polynomial f(X) = 
fi(X) **• f n (X) obtained by taking the product. However, even when dealing 
with finite extensions only, it is convenient to deal simultaneously with sets 
of polynomials rather than a single one. 

Theorem 3.3. Let K be an algebraic extension of k, contained in an 

algebraic closure k a of k. Then the follomng conditions are equivalent: 

NOR 1. Every embedding of K in ìâ over k induces an automorphism of K. 

NOR 2. K is the splitting field of a family of polynomials in /cfX]. 

NOR 3. Every irreducible polynomial of /c[X] which has a root in K 
splits into linear factors in K. 

Proof. Assume NOR 1. Let a be an element of K and let p a (X) be its 
irreducible polynomial over k. Let /? be a root of p a in /c a . There exists an 
isomorphism of /c(a) on k(f) over /c, mapping a on f. Extend this iso- 
morphism to an embedding of K in /c a . This extension is an automorphism o 
of K by hypothesis, hence cra = f lies in K. Hence every root of p a lies in K , 
and p a splits in linear factors in K[X~\. Hence K is the splitting field of the 
family {p a } ae x as a ranges over all elements of K , and NOR 2 is satisfied. 

Conversely, assume NOR 2, and let {/} Í6/ be the family of polynomials 
of which K is the splitting field. If a is a root of some / in K , then for any 
embedding cr of X in /c a over k we know that cra is a root of f. Since K is 
generated by the roots of all the polynomials /, it follows that cr maps K 
into itself. We now apply Lemma 2.1 to conclude that cr is an automorphism. 

Our proof that NOR 1 implies NOR 2 also shows that NOR 3 is 
satisfied. Conversely, assume NOR 3. Let cr be an embedding of K in /c a 
over k. Let a e K and let p(X) be its irreducible polynomial over k. If cr is 
an embedding of K in /c a over k then cr maps aona root /? of p(X), and by 
hypothesis f lies in K. Hence cra lies in X, and cr maps X into itself. By 
Lemma 2.1, it follows that cr is an automorphism. 
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An extension K of fc satisfying the hypotheses NOR 1, NOR 2, NOR 3 
will be said to be normal. It is not true that the class of normal extensions is 
distinguished. For instance, it is easily shown that an extension of degree 2 
is normal, but the extension Q(^/2) of the rational numbers is not normal 
(the complex roots of X 4 — 2 are not in it), and yet this extension is obtained 
by successive extensions of degree 2, namely 

E = Q(^2) => F = Q, 

where 

F = Q(a), a = and E = Fi^/oc). 

Thus a tower of normal extensions is not necessarily normal. However, we 
still have some of the properties: 

Theorem 3.4. Normal extensions remain normal under lifting. If 
K 3 E => k and K is normal over fc, then K is normal over E. If K ì9 K 2 
are normal over k and are contained in some field L, then K Y K 2 is normal 
over k , and so is K x nX 2 . 

Proof. For our first assertion, let K be normal over fc, let F be any 
extension of k, and assume K , F are contained in some bigger field. Let o be 
an embedding of KF over F (in F a ). Then o induces the identity on F, hence 
on fc, and by hypothesis its restriction to K maps K into itself. We get 
(KF) a = K a F a = KF whence KF is normal over F. 

Assume that K => E => fc and that K is normal over fc. Let o be an 
embedding of K over E. Then o is also an embedding of K over fc, and 
our assertion follows by definition. 

Finally, if K ì9 K 2 are normal over fc, then for any embedding o of K t K 2 
over fc we have 


o(K x K 2 ) = o(K x )o(K 2 ) 

and our assertion again follows from the hypothesis. The assertion concern- 
ing the intersection is true because 

We observe that if K is a finitely generated normal extension of fc, say 

K = fc(a x ,..., a„), 

and p l9 ...,p n are the respective irreducible polynomials of oc í9 ... 9 oc n over 
fc then K is already the splitting field of the finite family p l9 ... 9 p n . We 
shall investigate later when K is the splitting field of a single irreducible 
polynomial. 
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§4. SEPARABLE EXTENSIONS 

Let E be an algebraic extension of a field F and let 

o: F -> L 

be an embedding of F in an algebraically closed field L. We investigate more 
closely extensions of o to E. Any such extension of o maps E on a subfield 
of L which is algebraic over oF. Hence for our purposes, we shall assume 
that L is algebraic over oF , hence is equal to an algebraic closure of oF. 

Let S a be the set of extensions of o to an embedding of E in L. 

Let L' be another algebraically closed field, and let t: F -> L be an 
embedding. We assume as before that L' is an algebraic closure of xF. 
By Theorem 2.8, there exists an isomorphism À: L -► L' extending the map 
t o cr _1 applied to the field oF. This is illustrated in the following diagram: 

L' * --- L 

< - E 

I 

t F <- F -► oF 

x a 

We let S z be the set of embeddings of E in L' extending t. 

If o* e S a is an extension of o to an embedding of E in L, then À o o* is 
an extension of t to an embedding of E into L', because for the restriction to 
F we have 

X o o* = T o <T -1 o (j = T. 

Thus À induces a mapping from S a into S z . It is clear that the inverse 
mapping is induced by À~\ and hence that S a9 S z are in bijection under the 
mapping 

(J*h2o <T*. 

In particular, the cardinality of S a , S z is the same. Thus this cardinality 
depends only on the extension E/F, and will be denoted by 

IE:FI. 

We shall call it the separable degree of E over F . It is mostly interesting 
when E/F is finite. 

Theorem 4.1. Let L F => k be a tower. Then 

[£:/c] s = [L:F] s [F:fc] s . 

Furthermore , if E is finite over fc, then [E: fc] s is finite and 
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The separable degree is at most equal to the degree. 

Proof. Let o: k -► L be an embedding of k in an algebraically closed field 
L. Let {crjfç/ be the family of distinct extensions of o to F, and for each i, let 
{t ij} be the family of distinct extensions of o ( to E. By what we saw before, 
each o t has precisely [£: F] s extensions to embeddings of E in L. The set of 
embeddings {t fj } contains precisely 

LE:FUF:k] s 

elements. Any embedding of E into L over o must be one of the x ij9 and thus 
we see that the first formula holds, i.e. we have multiplicativity in towers. 

As to the second, let us assume that E/k is finite. Then we can obtain E 
as a tower of extensions, each step being generated by one element: 

k cz /c(a x ) c: k( a x , a 2 ) c: ••• c fc(a t ,..., a r ) = E. 

If we define inductively F v+1 = L v (a v+1 ) then by Proposition 2.7, 

[fv(«v+i) : F vl á [Fv(«v + i): Fvl 

Thus our inequality is true in each step of the tower. By multiplicativity, it 
follows that the inequality is true for the extension E/k, as was to be shown. 

Corollary 4.2. Let E be finite over k, and E => F z> k. The equality 

[L: k] s = [L: /c] 

holds if and only if the corresponding equality holds in each step of the 
tower , i.e. for E/F and F/k. 

Proof. Clear. 

It will be shown later (and it is not difficult to show) that [ E: k] s divides 
the degree [ E: k] when E is finite over k. We define [£: /c] f to be the 
quotient, so that 

LE:k] s LE:k]i = [E:k]. 

It then follows from the multiplicativity of the separable degree and of the 
degree in towers that the symbol [£: /c] f is also multiplicative in towers. We 
shall deal with it at greater length in §6. 

Let E be a finite extension of k. We shall say that E is separable over k if 
LE:k] s = LE:k]. 

An element a algebraic over k is said to be separable over k if /c(a) is 
separable over k. We see that this condition is equivalent to saying that the 
irreducible polynomial Irr(a, /c, X) has no multiple roots. 

A polynomial f(X)ekLX] is called separable if it has no multiple roots. 



SEPARABLE EXTENSIONS 241 


V, §4 

If a is a root of a separable polynomial g(X)ek[X ] then the irreducible 
polynomial of a over k divides g and hence a is separable over k. 

We note that if k C F C K and a e K is separable over k , then a is separable 
overF. Indeed, if/is a separable polynomial in k[X] such that/(a) = 0, then 
/also has coefficients in F, and thus a is separable over F. (We may say that a 
separable element remains separable under lifting.) 

Theorem 4.3. Let E be a finite extension of k. Then E is separable over k 
if and only if each element of E is separable over k. 

Proof. Assume E is separable over k and let a e E. We consider the 
tower 

k c= fe(a) c= E. 

By Corollary 4.2, we must have [fe(a):fe] = [fe(a):fe] 5 whence a is separable 
over fe. Conversely, assume that each element of E is separable over fe. We 
can write E = k(a l9 . .., a n ) where each a, is separable over fe. We consider 
the tower 

fe<= fe(a x ) c= fe(a x , a 2 ) c ••• c fe(a l5 ..., aj. 

Since each a f is separable over fe, each a f is separable over fe(a l5 ..., a^) for 
i ^ 2. Hence by the tower theorem, it follows that E is separable over fe. 

We observe that our last argument shows: If E is generated by a finite 
number of elements, each of which is separable over fe, then E is separable 
over fe. 

Let E be an arbitrary algebraic extension of fe. We define E to be 
separable over fe if every finitely generated subextension is separable over 
fe, i.e., if every extension fe(aj,..., a„) with a^,..., a n e E is separable 
over fc. 

Theorem 4.4. Let E be an algebraic extension of fc, generated by a 
family of elements {(Xi} ieI . If each a t - is separable over k then E is 
separable over k. 

Proof. Every element of E lies in some finitely generated subfield 

k(ot ií 9 ...,a in ) 9 

and as we remarked above, each such subfield is separable over fe. Hence 
every element of E is separable over fe by Theorem 4.3, and this concludes 
the proof. 

Theorem 4.5. Separable extensions form a distinguished class of exten- 
sions. 
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Proof. Assume that E is separable over k and let E F k. Every 
element of E is separable over F, and every element of F is an element of £, 
so separable over k. Hence each step in the tower is separable. Conversely, 
assume that E z> F => k is some extension such that E/F is separable and F/k 
is separable. If E is finite over k , then we can use Corollary 4.2. Namely, we 
have an equality of the separable degree and the degree in each step of the tower, 
whence an equality for E over k by multiplicativity. 

If E is infinite, let oc e E. Then a is a root of a separable polynomial f(X) 
with coeflìcients in F. Let these coeffìcients be a n ,... 9 a 0 . Let F 0 = 
k(a n9 ..., a 0 ). Then F 0 is separable over /c, and a is separable over F 0 . We 
now deal with the finite tower 


k <= F 0 <= F 0 (a) 

and we therefore conclude that F 0 (a) is separable over /c, hence that a 
is separable over k. This proves condition (1) in the definition of 
“distinguished.” 

Let E be separable over k. Let F be any extension of /c, and assume that 
£, F are both subfields of some field. Every element of E is separable over /c, 
whence separable over F. Since EF is generated over F by all the elements 
of E , it follows that EF is separable over F, by Theorem 4.4. This proves 
condition (2) in the definition of “distinguished,” and concludes the proof of 
our theorem. 

Let E be a fìnite extension of k. The intersection of all normal extensions 
K of k (in an algebraic closure F a ) containing E is a normal extension of k 
which contains F, and is obviously the smallest normal extension of k 
containing F. If o l9 ..., a n are the distinct embeddings of F in F a , then the 
extension 


K = (o,E)(o 2 E)-*(o n E\ 

which is the compositum of all these embeddings, is a normal extension of /c, 
because for any embedding of it, say t, we can apply t to each extension 
dfF. Then (ro l9 ..., xo n ) is a permutation of (o u ..., o n ) and thus t maps K 
into itself. Any normal extension of k containing F must contain o t E for 
each ï, and thus the smallest normal extension of k containing E is precisely 
equal to the compositum 

(d^F) ••• (o n E). 

If F is separable over /c, then from Theorem 4.5 and induction we 
conclude that the smallest normal extension of k containing F is also separ- 
able over k. 

Similar results hold for an infinite algebraic extension F of /c, taking an 
infinite compositum. 
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In light of Theorem 4.5, the compositum of all separable extensions of a 
field k in a given algebraic closure /c a is a separable extension, vvhich will be 
denoted by k s or /c sep , and will be called the separable closure of /c. As a 
matter of terminology, if E is an algebraic extension of /c, and cr any 
embedding of E in /c a over /c, then we call oE a conjugate of E in /c a . We can 
say that the smallest normal extension of k containing E is the compositum of 
all the conjugates of E in £ a . 

Let oc be algebraic over k. If o x ,..., o r are the distinct embeddings of /c(a) 
into /c a over /c, then we call cr^a, ..., o r ot the conjugates of a in /c*. These 
elements are simply the distinct roots of the irreducible polynomial of a over 
k. The smallest normal extension of k containing one of these conjugates is 
simply /c^a,..., o r oc). 


Theorem 4.6. (Primitive Element Theorem). Let E be a finite extension 
of a field k. There exists an element ae E such that E = fc(a) if and only 
if there exists only a finite number of fields F such that k cz F a E. If E 
is separable over /c, then there exists such an element a. 

Proof. If k is finite, then we know that the multiplicative group of E is 
generated by one element, which will therefore also generate E over k. We 
assume that k is infinite. 

Assume that there is only a finite number of fields, intermediate between 
k and E. Let a, f e E. As c ranges over elements of /c, we can only have 
a finite number of fields of type /c(a -f cfi). Hence there exist elements c u 
c 2 e k with Cj # c 2 such that 

/c(a -f c^j?) = /c(a -f c 2 jS). 

Note that a + c x j? and a -f c 2 /? are in the same field, whence so is (Cj — c 2 )j8, 
and hence so is /?. Thus a is also in that field, and we see that /c(a, /?) can be 
generated by one element. 

Proceeding inductively, if E = k(oc í9 ...,a n ) then there will exist elements 
c 2 , ..., c n e k such that 

e = m 

where t; = oc^ -f c 2 a 2 H-f c„a n . This proves half of our theorem. 

Conversely, assume that E = k(oc) for some a, and let f(X) = Irr(a, /c, X). 
Let k a F a E. Let g F (X) = Irr(a, F, A"). Then g F divides /. We have unique 
factorization in E[X\ and any polynomial in E[X~\ which has leading 
coefficient 1 and divides f(X) is equal to a product of factors (X — a t ) where 
a l9 ..., a n are the roots of/in a fìxed algebraic closure. Hence there is only a 
fìnite number of such polynomials. Thus we get a mapping 

F\-^g F 

from the set of intermediate fields into a finite set of polynomials. Let F 0 be 
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the subfield of F generated over fc by the coeíficients of g F (X). Then g F has 
coefficients in F 0 and is irreducible over F 0 since it is irreducible over F. 
Hence the degree of a over F 0 is the same as the degree of a over F. Hence 
F = F 0 . Thus our field F is uniquely determined by its associated poly- 
nomials g Fì and our mapping is therefore injective. This proves the first 
assertion of the theorem. 

As to the statement concerning separable extensions, using induction, 
we may assume without loss of generality that E = /c(a, /?) where a, /? are 
separable over k. Let a l9 ..., a n be the distinct embeddings of k( a, fi) in fc a 
over fc. Let 


P(X) = n (Ofl + Xa$ - Oj-a - Xafi\ 

i*j 

Then P(X) is not the zero polynomial, and hence there exists c ek such 
that P(c) # 0. Then the elements cr^a + cfï) (i = 1, n) are distinct, whence 
fc(a + cj?) has degree at least n over fc. But n = [fc(a, /?): fc], and hence 


as desired. 


fc(a, P) = k(oc + cp). 


If E = fc(a), then we say that a is a primitive element of E (over fc). 


§5. FINITE FIELDS 

We have developed enough general theorems to describe the structure of 
finite fields. This is interesting for its own sake, and also gives us examples 
for the general theory. 

Let F be a finite field with q elements. As we have noted previously, we 
have a homomorphism 

Z-> + 

sending 1 on 1, whose kernel cannot be 0, and hence is a principal ideal 
generated by a prime number p since Z/pZ is embedded in F and F has no 
divisors of zero. Thus F has characteristic p, and contains a field isomorphic 
to Z/pZ. 

We remark that Z/pZ has no automorphisms other than the identity. 
Indeed, any automorphism must map 1 on 1, hence leaves every element 
fixed because 1 generates ZjpZ additively. We identify Z/pZ with its image 
in F. Then F is a vector space over Z/pZ, and this vector space must be 
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finite since F is finite. Let its degree be n. Let (o í9 ...,co n be a basis for F 
over Z/pZ. Every element of F has a unique expression of the form 

ûìû>ì + ••• + a n^n 

with a t e Z/pZ. Hence q = p n . 

The multiplicative group F* of F has order q — 1. Every oc e F* satisfies 
the equation X q ~ l = 1. Hence every element of F satisfies the equation 

f(X) = X q - X = 0. 

This implies that the polynomial f(X) has q distinct roots in F, namely all 
elements of F. Hence / splits into factors of degree 1 in F, namely 

xq - x = n (x - 4 

a e F 

In particular, F is a splitting field for /. But a splitting field is uniquely 
determined up to an isomorphism. Hence if a finite field of order p n exists, it 
is uniquely determined, up to an isomorphism, as the splitting field of 
X pn - X over Z/pZ. 

As a matter of notation, we denote Z/pZ by F p . Let n be an integer ^ 1 
and consider the splitting field of 

x pn - X = f(X) 

in an algebraic closure F£. We contend that this splitting field is the set of 
roots of f(X) in F£. Indeed, let a, p be roots. Then 

(a + P) pn - (a + fi) = a pn + p pn - a - fi = 0, 

whence a -f p is a root. Also, 

(ocp) pn — ocp = oc pn p pn -ocp = ocp-ocp = 0 9 

and ocp is a root. Note that 0, 1 are roots of f(X). If p # 0 then 

(p-') pn -P~'=(P pn )-'-p-'=0 


so that P 1 is a root. Finally, 

(-pr-(-P)=(-\rp pn + p. 

If p is odd, then (— l) p " = — 1 and we see that —p is a root. If p is even then 
— 1 = 1 (in Z/2Z) and hence -P = P is a root. This proves our contention. 
The derivative of f(X) is 

f'(X) = p n X pn ~ l - 1 = -1. 

Hence f(X) has no multiple roots, and therefore has p n distinct roots in 
F p . Hence its splitting field has exactly p n elements. We summarize our 
results: 
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Theorem 5.1. For each prime p and each integer n ^ 1 there exists a finite 
field of order p n denoted by F pn , uniquely determined as a subfield of an 
algebraic closure F p . It is the splitting field of the polynomiaì 

X pn - X 9 

and its elements are the roots of this polynomial. Every finite field is 
isomorphic to exactly one field F pn . 

We usually write p n = q and F^ instead of F pn . 

Corollary 5.2. Let be a finite field. Let n be an integer ^l. In a 
given algebraic closure F£, there exists one and only one extension of F q of 
degree n 9 and this extension is the field F^. 

Proof. Let q = p m . Then q n = p mn . The splitting field of X qn — X is 
precisely F pmn and has degree mn over Z/pZ. Since F^ has degree m over 
Z/pZ, it follows that F 9 n has degree n over F^. Conversely, any extension of 
degree n over F^ has degree mn over F p and hence must be F pmn . This proves 
our corollary. 

Theorem 5.3. The multiplicative group of a finite field is cyclic. 

Proof. This has already been proved in Chapter IV, Theorem 1.9. 

We shall determine all automorphisms of a finite field. 

Let q = p n and let F^ be the finite field with q elements. We consider the 

Frobenius mapping 

<P- F, - F, 

such that (p(x ) = x p . Then is a homomorphism, and its kernel is 0 since F^ 
is a field. Hence q> is injective. Since F^ is finite, it follows that cp is 
surjective, and hence that cp is an isomorphism. We note that it leaves F p 
fixed. 

Theorem 5.4. The group of automorphisms of F q is cyclic of degree n, 
generated by cp. 

Proof. Let G be the group generated by cp. We note that (p n = id 
because (p n (x) = x pn = x for all x e F r Hence n is an exponent for (p. Let d 
be the period of (p 9 so d ^ 1. We have (p d (x) = x pd for all x e F q . Hence each 
x e F q is a root of the equation 

x pd - X = 0 . 

This equation has at most p d roots. It follows that d ^ n, whence d = n. 

There remains to be proved that G is the group of all automorphisms of 
F q . Any automorphism of F^ must leave F p fixed. Hence it is an auto- 
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morphism of F^ over F p . By Theorem 4.1, the number of such auto- 
morphisms is ^ n. Hence ¥ q cannot have any other automorphisms except 
for those of G. 

Theorem 5.5. Let m, n be integers ^ 1. Then in any algebraic closure of 
¥ p , the subfield ¥^ is contained in F p m ifand only if n divides m. Ifthat is the 
case, let q = p n , and let m = nd. Then F^m is normal and separable over F q , 
and the group of automorphisms of¥ pm over is cyclic oforder d, generated 
by cp n . 

Proof. All the statements are trivial consequences of what has already been 
proved and will be left to the reader. 


§6. INSEPARABLE EXTENSIONS 

This section is of a fairly technical nature, and can be omitted without 
impairing the understanding of most of the rest of the book. 

We begin with some remarks supplementing those of Proposition 2.7. 

Let f(X) = (X — a ) m g(X) be a polynomial in k[Jf], and assume X — oc 
does not divide g(X). We recall that m is called the multiplicity of a in /. 
We say that a is a multiple root of / if m > 1. Otherwise, we say that a is a 
simple root. 

Proposition 6.1. Let a be algebraic over k, a e /c a , and let 

f(X) = Irr(a, fc, X). 

If char k = 0, then all roots of f have multiplicity 1 (/ is separable). If 

char k = p > 0, 

then there exists an integer p ^ 0 such that every root of f has multiplicity 

pWe have 


[/c(a):k]=p^[/c(a):k] s , 
and u? is separable over k. 

Proof. Let a x , ..., a r be the distinct roots of / in /c a and let a = a x . Let 
m be the multiplicity of a in /. Given 1 ^ i ^ r, there exists an isomorphism 

c\ /c(a) -+ /c(a f ) 

over k such that cra = a f . Extend cr to an automorphism of /c a and denote 




248 ALGEBRAIC EXTENSIONS 


this extension also by a. Since / has coefficients in fc we have f a = /. We 
note that 

r 

f(X) = ÍI ÇX~ aaj) m j 
j= 1 

if YYij is the multiplicity of otj in /. By unique factorization, we conclude that 
YYii = and hence that all yyi x are equal to the same integer yyi. 

Consider the derivative f'(X). If / and /' have a root in common, then a 
is a root of a polynomial of lower degree than deg/. This is impossible 
unless deg/' = —oo, in other words, /' is identically 0. If the characteristic 
is 0, this cannot happen. Hence if / has multiple roots, we are in characteris- 
tic p, and f(X) = g(X p ) for some polynomial g(X) e k\_X~\. Therefore a p is a 
root of a polynomial g whose degree is < deg /. Proceeding inductively, we 
take the smallest integer p ^ 0 such that a p " is the root of a separable 
polynomial in k\X\, namely the polynomial h such that 

nx) = h(xn. 

Comparing the degree of / and g, we conclude that 


[fc(a): k( a p )] = p. 

Inductively, we find 

[ k(z):k(*n\=p p . 


Since h has roots of multiplicity 1, we know that 

[fe(an:fe] í = [fc(a p '):fe]. 


and comparing the degree of / and the degree of h , we see that the num- 
ber of distinct roots of / is equal to the number of distinct roots of h. 
Hence 


[k(a):k\ s = [k(an:k\ 5 . 

From this our formula for the degree follows by multiplicativity, and our 
proposition is proved. We note that the roots of h are 


Corollary 6.2. For any finite extension E of fc, the separable degree 
[E: fc] s divides the degree [E: fc]. The quotient is 1 if the characteristic is 
0, and a power of p if the characteristic is p > 0. 

Proof. We decompose E/k into a tower, each step being generated by 
one element, and apply Proposition 6.1, together with the multiplicativity of 
our indices in towers. 

If E/K is finite, we call the quotient 
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[£:fc] 

[£:fe] s 

the inseparable degree (or degree of inseparability), and denote it by [E : fc] f as 
in §4. We have 

[£:fc] s [£:fc] f = [£:fc]. 


Corollary 6.3. A finite extension is separable if and only if [E : fc] f = 1. 
Proof By definition. 

Corollary 6.4 If E =) F =) fc are two finite extensions , then 
[E : fc] f = [E : F] f [F: fc] f . 


Proof. Immediate by Theorem 4.1. 

We now assume throughout that fc is a field of characteristic p > 0. 

An element oc algebraic over fc is said to be purely inseparable over fc if 
there exists an integer n ^ 0 such that a pn lies in fc. 

Let E be an algebraic extension of fc. We contend that the following 
conditions are equivalent: 

P.Ins. 1. We have [E : fc] s = 1. 

P. Ins. 2. Every element a of E is purely inseparable over fc. 

P. Ins. 3. For every oce E, the irreducible equation of a over fc is of type 
X pn — a = 0 with some 0 and ae k. 

P. Ins. 4. There exists a set of generators {a f } fg/ of E over fc such that 
each a f is purely inseparable over fc. 

To prove the equivalence, assume P. Ins. 1. Let oce E. By Theorem 4.1, 
we conclude that [fc(a): fc] s = 1. Let f(X) = Irr(a, fc, X). Then / has only one 
root since 

[fc(a):fc] s 

is equal to the number of distinct roots of f(X). Let m = [fc(a):fc]. Then 
deg/=m, and the factorization of / over fc(a) is f(X) = (X — oc) m . Write 
m = p n r where r is an integer prime to p. Then 

f(X) = (X pn - oc pn ) r 

= X pnr - roc pn X pn{r - l) + lower terms. 

Since the coelficients of f(X) lie in fc, it follows that 

roc pn 
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lies in fc, and since r # 0 (in fc), then cc pn lies in fc. Let a = a pn . Then a is 
a root of the polynomial X pn — a, which divides f(X). It follows that 
f(X) = X pn - a. 

Essentially the same argument as the preceding one shows that P. Ins. 2 
implies P. Ins. 3. It is trivial that the third condition implies the fourth. 

Finally, assume P. Ins. 4. Let E be an extension generated by purely 
inseparable elements a f (i e /). Any embedding of E over fc maps a f on a root 
of 


f(X) = Irr(a f , fc, X). 


But f(X) divides some polynomial X pn — a, which has only one root. Hence 
any embedding of E over fc is the identity on each a f , whence the identity on 
£, and we conclude that [£: fc] s = 1, as desired. 

An extension satisfying the above four properties will be called purely 
inseparable. 

Proposition 6.5. Purely inseparable extensions form a distinguished class 
of extensions. 

Proof. The tower theorem is clear from Theorem 4.1, and the lifting 
property is clear from condition P. Ins. 4. 

Proposition 6.6. Let E be an algebraic extension of k. Let E 0 be the 
compositum of all subfields F of E such that F => fc and F is separable 
over k. Then E 0 is separable over fc, and E is purely inseparable over 
E 0 . 

Proof Since separable extensions form a distinguished class, we know 
that E 0 is separable over fc. In fact, E 0 consists of all elements of E which 
are separable over fc. By Proposition 6.1, given a e E there exists a power of 
p, say p n such that a pn is separable over fc. Hence E is purely inseparable 
over E 0 , as was to be shown. 

CoroIIary 6.7. // an algebraic extension E of k is both separable and 
purely inseparable , then E = fc. 

Proof. Obvious. 

Corollary 6.8. Let K be normal over k and let K 0 be its maximal separa- 
ble subextension. Then K 0 is also normal over k. 

Proof. Let o be an embedding of K 0 in K a over fc and extend o to an 
embedding of K. Then o is an automorphism of K. Furthermore, oK 0 is 
separable over fc, hence is contained in X 0 , since K 0 is the maximal separa- 
ble subfield. Hence oK 0 = K 0 , as contended. 
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Corollary 6.9. Let F, F be two finite extensions of fc, and assume that 
E/k is separable , F/k is purely inseparable. Assume E, F are subfields of a 
common field. Then 


[£F:F] = [£:fc] = [£F:fc] s , 
[EF : £] = [F: fc] = [FF : fc] f . 

Proofi The picture is as follows: 



The proof is a trivial juggling of indices, using the corollaries of Proposition 
6.1. We leave it as an exercise. 

Corollary 6.10. Let E p denote the field of all elements x p , x e E. Let E 
be a finite extension of fc. If E p k = F, then E is separable over fc. If E is 
separable over fc, then E pn k = F for all nìt 1. 

Proofi Let F 0 be the maximal separable subfield of F. Assume E p k = F. 
Let F = fc(a l5 ..., a„). Since F is purely inseparable over F 0 there exists m 
such that af m e F 0 for each i=l,...,«. Hence E pm c= F 0 . But E pm k = F 
whence F = F 0 is separable over fc. Conversely, assume that F is separable 
over fc. Then F is separable over E p k. Since F is also purely inseparable over 
E p k we conclude that F = E p k. Similarly we get F = F p ”fc/or n ^ 1, as was 
to be shown. 

Proposition 6.6 shows that any algebraic extension can be decomposed 
into a tower consisting of a maximal separable subextension and a purely 
inseparable step above it. Usually, one cannot reverse the order of the 
tower. However, there is an important case when it can be done. 

Proposition 6.11. LetKbe normal over k. Let G be its group ofautomorphisms 
over k. Let K° be the fixed field ofG (see Chapter VI, §1). Then K G is purely 
inseparable over fc, and K is separable over K°. If Ko is the maximal separa- 
ble subextension of K , then K — K G Ko and Ko n K G = k. 

Proof. Let a e K G . Let t be an embedding of fc(a) over fc in K a and 
extend t to an embedding of K f which we denote also by t. Then t is an 
automorphism of K because K is normal over fc. By definition, Ta = a and 
hence t is the identity on fc(a). Hence [fc(a): fc] s = 1 and a is purely in- 
separable. Thus K G is purely inseparable over fc. The intersection of K 0 
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and K g is both separable and purely inseparable over fc, and hence is equal 
to fc. 

To prove that K is separable over K G , assume fìrst that K is finite over 
fc, and hence that G is finite, by Theorem 4.1. Let <xe K. Let a i9 ... 9 <r r be a 
maximal subset of elements of G such that the elements 


^a,..., <r r oc 

are distinct, and such that o\ is the identity, and a is a root of the polynomial 

f(X) =t\(X- 

Í = 1 

For any t g G we note that / T = / because t permutes the roots. We note 
that / is separable, and that its coefficients are in the fixed field K G . Hence a 
is separable over K G . The reduction of the infinite case to the finite case is 
done by observing that every a e K is contained in some finite normal 
subextension of K. We leave the details to the reader. 

We now have the following picture: 


K 


K 



K 0 K 



G 



G 


K 0 nK G = k 


By Proposition 6.6, K is purely inseparable over K 0 , hence purely insepara- 
ble over K 0 K G . Furthermore, K is separable over K G 9 hence separable over 
K 0 K G . Hence K = K 0 K G , thereby proving our proposition. 

We see that every normal extension decomposes into a compositum of 
a purely inseparable and a separable extension. We shall define a Galois ex- 
tension in the next chapter to be a normal separable extension. Then K 0 
is Galois over fc and the normal extension is decomposed into a Galois and a 
purely inseparable extension. The group G is called the Galois group of the 
extension K/k. 

A field fc is called perfect if k p = fc. (Every field of characteristic zero is 
also called perfect.) 

Corollary 6.12. If k is perfect , then every algebraic extension of k is 

separable, and every algebraic extension of k is perfect. 

Proof. Every finite algebraic extension is contained in a normal exten- 
sion, and we apply Proposition 6.11 to get what we want. 
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EXERCISES 

1. Let F = Q(a), where a is a root of the equation 

a 3 + a 2 + a + 2 = 0. 

Express (a 2 + a + l)(a 2 -f a) and (a — l) -1 in the form 

aa 2 + bot + c 


with a, b, c e Q. 

2. Let E = F(a) where a is algebraic over F, of odd degree. Show that E = F(a 2 ). 

3. Let a and /? be two elements which are algebraic over F. Let f(X) = Irr(a, F, X) 
and g(X) = Irr(/?, F, X). Suppose that deg / and deg g are relatively prime. Show 
that g is irreducible in the polynomial ring F(a)[X]. 

4. Let a be the real positive fourth root of 2. Find all intermediate fields in the 
extension Q(a) of Q. 

5. If a is a complex root of X 6 + X 3 + 1, find all homomorphisms a\ Q(a) -+ C. 
[Hint\ The polynomial is a factor of X 9 — 1.] 

6. Show that yjl + y/3 is algebraic over Q, of degree 4. 

7. Let E, F be two finite extensions of a field k, contained in a larger field K. Show 
that 


[FF : k] ^ [F : k] [F : k\ 

If [F: k] and [F: k] are relatively prime, show that one has an equality sign in 
the above relation. 

8. Let /(X) E k[X] be a polynomial of degree n. Let K be its splitting field. Show 
that [K:k\ divides n\ 

9. Find the splitting field of X pS — 1 over the field Z/pZ. 

10. Let a be a real number such that a 4 = 5. 

(a) Show that Q(ia 2 ) is normal over Q. 

(b) Show that Q(a + iot) is normal over Q(ia 2 ). 

(c) Show that Q(a + ia) is not normal over Q. 

11. Describe the splitting fields of the following polynomials over Q, and find the 

degree of each such splitting field. 

(a) X 2 - 2 (b) X 2 - 1 

(c) X 3 - 2 (d) (X 3 - 2)(X 2 - 2) 

(e) X 2 + X + 1 (f) X 6 + X 3 + 1 

(g) * 5 - 7 

12. Let K be a finite field with p n elements. Show that every element of K has a 
unique p-th root in K. 
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13. If the roots of a monic polynomial f(X) E k[X] in some splitting field are distinct, 
and form a field, then char k = p and f(X) = X pn - X for some n ^ 1. 

14. Let char K = p. Let L be a finite extension of K , and suppose [L: K~\ prime to 
p. Show that L is separable over K. 

15. Suppose char K = p. Let ae K. If a has no p-th root in K , show that X pn — a is 
irreducible in K[X~] for all positive integers n. 

16. Let char K = p. Let a be algebraic over K. Show that a is separable if and only 
if K(a) = K(a pn ) for all positive integers n. 

17. Prove that the following two properties are equivalent: 

(a) Every algebraic extension of K is separable. 

(b) Either char K = 0, or char K = p and every element of K has a p-th root in 
K. 

18. Show that every element of a finite field can be written as a sum of two squares 
in that field. 

19. Let E be an algebraic extension of F. Show that every subring of E which 
contains F is actually a field. Is this necessarily true if E is not algebraic over F? 
Prove or give a counterexample. 

20. (a) Let E = F(x) where x is transcendental over F. Let K ^ F be a subfield of E 

which contains F. Show that x is algebraic over K. 

(b) Let E = F(x). Let y = f(x)/g(x) be a rational function, with relatively prime 
polynomials f g e F[x]. Let n = max(deg /, deg g). Suppose n}> 1. Prove 
that 

LF(x):F(y)] = n. 

21. Let Z + be the set of positive integers, and A an additive abelian group. Let 
/: Z + -> A and g:Z + -* A be maps. Suppose that for all n, 

/(«) = Z 9(d). 

d\n 

Let p be the Mòbius function (cf. Exercise 12 of Chapter II). Prove that 

g(n) = z g(n/d)f(d). 

d\n 

22. Let k be a finite field with q elements. Let f(X) e k[X~\ be irreducible. Show that 
f(X) divides X qn — X if and only if deg / divides n. Show the multiplication 
formula 


^-^=nn /ÂX), 

d\n f d irr 

where the inner product is over all irreducible polynomials of degree d with 
leading coefficient 1. Counting degrees, show that 

q" = Z #(«*)> 

d\n 

where i //(d) is the number of irreducible polynomials of degree d. Invert by 
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Exercise 21 and find that 


»'/'(«) = z n(d)q" ld . 

d\n 

23. (a) Let k be a finite field with q elements. Define the zeta function 

z(í)=(i-t) i n(i-t de 8 T i , 

p 

where p ranges over all irreducible polynomials p = p(X) in k\_X~\ with leading 
coefficient 1. Prove that Z(t) is a rational function and determine this rational 
function. 

(b) Let n q (n) be the number of primes p as in (a) of degree ^ n. Prove that 

q q m 

n a (m) ~ —-for m -> oo. 

q - 1 m 


Remark. This is the analogue of the prime number theorem in number theory, 
but it is essentially trivial in the present case, because the Riemann hypothesis is 
trivially verified. Things get more interesting fast after this case. Consider an 
equation y 2 = x 3 + ax + b over a fìnite field of characteristic ^ 2, 3, and 
having q elements. Assume —4 a 3 — 21b 2 ^ 0, in which case the curve defined by 
this equation is called an elliptic curve. Define N n by 

N n — 1 = number of points (x, y) satisfying the above equation with 
x, y g F qn (the extension of of degree n). 

Define the zeta function Z(t) to be the unique rational function such that Z(0) = 1 
and 

Z'/Z(t) = X N n t n ~ l . 


A famous theorem of Hasse asserts that Z(t) is a rational function of the form 


Z(t) = 


(l-aO(l-âí) 
(i - 0(1 - qt) ’ 


where a is an imaginary quadratic number (not real, quadratic over Q), á is its 
complex conjugate, and aâ = q, so |a| = q l/2 . See Hasse, “Abstrakte Bergrundung 
der komplexen Multiplikation und Riemannsche Vermutung in Funktionen- 
kòrpern,” Abh. Math. Sem. Univ. Hamburg 10 (1934) pp. 325-348. 

24. Let k be a field of characteristic p and let t, u be algebraically independent over 
k. Prove the following: 

(a) k(t , u) has degree p 2 over k(t p , u p ). 

(b) There exist infinitely many extensions between k(t, u) and k(t p , u p ). 

25. Let E be a finite extension of k and let p r = \E: k] t . We assume that the 
characteristic is p > 0. Assume that there is no exponent p s with s <r such that 
E pS k is separable over k (i.e., such that <x pS is separable over k for each a in E). 
Show that E can be generated by one element over k. [ Hint: Assume first that 
E is purely inseparable.] 
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26. Let k be a field,/(X) an irreducible polynomial in k[X], and let K be a finite normal 
extension of k. If g , h are monic irreducible factors of f(X) in K[X ], show that there 
exists an automorphism a of K over k such that g = h a . Give an example when this 
conclusion is not valid if K is not normal over k. 

27. Let Xj, x„ be algebraically independent over a field k. Let y be algebraic over 
k(x) = k(x x„). Let P(X n+í ) be the irreducible polynomial of y over k(x). 
Let cp(x) be the least common multiple of the denominators of the coefficients of 
P. Then the coefficients of cp(x)P are elements of /c[x]. Show that the polynomial 

f(X l ...;X n+l ) = <p(X l9 ... 9 X m )P(X n+l ) 

is irreducible over k, as a polynomial in n + 1 variables. 

Conversely, let f(X Y ,--,X n+l ) be an irreducible polynomial over k. Let 
x í9 ...,x n be algebraically independent over k. Show that 

f(x u ...,x n , X n+Ì ) 

is irreducible over k(x x ,. x„). 

If / is a polynomial in n variables, and (b) = (b íy ..., b n ) is an n-tuple of 
elements such that f(b) = 0, then we say that (b) is a zero of f. We say that (b) is 
non-trivial if not all coordinates b t are equal to 0. 

28. Let f(X l9 ..., X n ) be a homogeneous polynomial of degree 2 (resp. 3) over a field 
k. Show that if / has a non-trivial zero Ìn an extension of odd degree (resp. 
degree 2) over k , then / has a non-trivial zero in k. 

29. Let f(X , Y) be an irreducible polynomial in two variables over a field k. Let t be 
transcendental over /c, and assume that there exist integers m, n ^ 0 and elements 
a, bek, ab ^ 0, such that f(at n , bt m ) — 0. Show that after inverting possibly X or 
7, and up to a constant factor, / is of type 

X m Y n - c 

with some cek. 

The answer to the following exercise is not known. 

30. (Artin conjecture). Let / be a homogeneous polynomial of degree d in n vari- 
ables, with rational coefficients. If n > d, show that there exists a root of unity Ç, 
and elements 

X U ..., X n € Q[C] 

not all 0 such that f(x x ,..., x„) = 0. 

31. Difference equations. Let u í9 ..., u d be elements of a field K. We want to solve 
for infinite vectors (x 0 , x x ,..., x„,...) satisfying 

(*) = Wi*„-i + • * * + u d x n - d for n^d. 

Define the characteristic polynomial of the system to be 

+ ••• + «,) =f(X). 
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Suppose a is a root of /. 

(a) Show that x„ = a" (n ^ 0) is a solution of (♦). 

(b) Show that the set of solutions of (*) is a vector space of dimension d. 

(c) Assume that the characteristic polynomial has d distinct roots a 1 ,...,a d . 
Show that the solutions (aj),..., (aj) form a basis for the space of solutions. 

(d) Let x„ = hjaJ + • • • 4- b d aj for n ^ 0, show how to solve for b í9 . .., b d in terms 
of a 1? ..., 0 L d and x 0 ,x d _ t . (Use the Vandermonde determinant.) 

(e) Under the conditions of (d), let F(T) = Y, x nT n - Show that F(T) represents a 
rational function, and give its partial fraction decomposition. 

32. Let d = 2 for simplicity. Given a 0 , a í9 u, v, vv, t e K, we want to find the solutions 
of the system 

a„ = ua n - x - vta n - 2 ~ t n w for n ^ 2. 

Let oc í9 a 2 be the root of the characteristic polynomial, that is 
1 -uX + vtX 2 = (1 - ai X)(l - a 2 X). 

Assume that ct í9 a 2 are distinct, and also distinct from t. Let 

F(X) = £ a„X". 

n=0 

(a) Show that there exist elements A, B, C of K such that 

= Z - ŷ + 1 - y + z - 

1 — a x X 1 cc^X 1 — tX 

(b) Show that there is a unique solution to the difference equation given by 

a„ = Aolì -h BaJ -h Ct n for n ^ 0. 

(To see an application of this formalism to modular forms, as in the work of 
Manin, Mazur, and Swinnerton-Dyer, cf. my Introduction to Modular Forms, 
Springer-Verlag, New York, 1976, Chapter XII, §2.) 

33. Let R be a ring which we assume entire for simplicity. Let 

g(T) = T d — a d _ x T d ~ l — - - — a 0 
be a polynomial in R[T], and consider the equation 
T d = û 0 + a x T -h * * * + Qd-i T d 

Let x be a root of g(T). 

(a) For any integer n ^ d there is a relation 

X " = a 0,n + a l ,n X + ••’ + a d-l,„X d ' X 

with coefficients a itj in Z\_a 0 ,..., a d -{] <= R. 

(b) Let F(T) e R[T~\ be a polynomial. Then 

F(x) = a 0 (F) + a,(F)x + ••• + a (l _ 1 (F)x‘'‘ 1 
where the coefficients a^F) lie in R and depend linearly on F. 
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(c) Let the Vandermonde determinant be 

1 Xj ••• xf" 1 

v ( Xl ,..., Xi )= 1 x . 2 xí ~' =n (*,-*»)• 

: : : «<; 

1 Xj ••• Xj' 1 

Suppose that the equation g(T) = 0 has d roots and that there is a factoriza- 
tion 

g (T)=f\(T- Xl ). 

i—l 

Substituting x, for x with i = 1,..., d and using Cramer’s rule on the resulting 
system of linear equations, yields 

Aaj(F) = Aj(F) 

where A is the Vandermonde determinant, and Aj(F) is obtained by replacing 
the j- th column by ^FfxJ, ...» F(x d )), so 

1 X! •• F( Xl ) ••• xf" 1 

1 x 2 F(x 2 ) ••• x 2 _1 

1 x d • • • F(x d ) • • • x d _1 

If A ^ 0 then we can write 

aj(F) = Aj(F)/A. 

Remark. If F(T) is a power series in R[[T]] and if R is a complete local ring, 
with Xj, ...» x d in the maximal ideal, and x = x ( for some i, then we can evaluate 
F(x) because the series converges. The above formula for the coefficients aj(F) 
remains valid. 

34. Let xx d be independent variables, and let A be the ring 

QtO. . *d]]m/fl (T-x,y 

i=l 

Substituting some x { for T induces a natural homomorphism (p t of A onto 

Q[[z 1( ...,x,,]] = R, 

and the map zi-»((ju 1 (z),..., (p d (z)) gives an embedding of A into the product of R 
with itself d times. 

Let k be an integer, and consider the formal power series 

d (T — x)e^~ Xi * 

F(T) = e- n V--?V = n h(T - x,) 
i=i e — i í=i 

where h(t) = te 1 /(e x — 1). It is a formal power series in T, T — x í9 ..., T — x d . 
Under substitution of some x,- for T it becomes a power series in Xj and Xj — x t , 
and thus converges in Q[[x l5 ..., x d ]]. 
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(a) Verify that 

F(T) = a 0 (F) + ••• + a i - 1 (F)T‘' _1 mod (T - x t ) 

ì=1 

where a 0 (^)» a d-i(f)e Q[[x lf x d ]], and that the formula given in the 

preceding exercise for these coefficients in terms of Vandermonde determi- 
nants is valid. 

(b) Show that a d _i(F) = 0 if -(d - 1) ^ k < 0 and a d _ x (F) = 1 if k = 0. 


Remark. The assertion in (a) is a simple limit. The assertion in (b) is a fact 
which has been used in the proof of the Hirzebruch-Grothendieck-Riemann- 
Roch theorem and as far as I know there was no simple known proof until Roger 
Howe pointed out that it could be done by the formula of the preceding exercise 
as follows. We have 


V(x» 


Furthermore, 


x n )a d ^(F) 


1 

1 x d 


xt 1 F( Xl ) 

F( Xd ) 


F(Xj) = e kx ‘ n 
n*j 


(Xj - x„)e x i x - 
e x r*« - l 


We use the inductive relation of Vandermonde determinants 


V(Xt, .. •, X d ) = V(X X , ..., *,.*J(- 1 Y~> n (Xj - Xn). 

n*j 

We expand the determinant for û d -i(F) according to the last column to get 


j =i n*j e ì — e 


Using the inductive relation backward, and replacing x t by e Xi which we denote 
by y, for typographical reasons, we get 


v(y i i • • • > y d )a d ~i(F) = 


y i ■ 

•• yr 

y k+d 1 


(S 

.. 1 

Sp 
+ . . 
Cu 

1 


If k # 0 then two columns on the right are the same, so the determinant is 0. If 
k = 0 then we get the Vandermonde determinant on the right, so a d ^(F) = 1. 
This proves the desired value. 




CHAPTER VI_ 

Galois Theory 


This chapter contains the core of Galois theory. We study the group of 
automorphisms of a finite (and sometimes infinite) Galois extension at length, 
and give examples, such as cyclotomic extensions, abelian extensions, and even 
non-abelian ones, leading into the study of matrix representations of the Galois 
group and their classifications. We shall mention a number of fundamental 
unsolved problems, the most notable of which is whether given a finite group 
G, there exists a Galois extension of Q having this group as Galois group. Three 
surveys give recent points of view on those questions and sizeable bibliographies: 

B. Matzat, Konstruktive Galoistheorie , Springer Lecture Notes 1284, 1987 

B. Matzat, Úber das Umkehrproblem der Galoisschen Theorie, JahrsberichtDeutsch. 

Mat.-Verein. 90 (1988), pp. 155-183 

J. P. Serre, Topics in Galois theory , course at Harvard, 1989, Jones and Bartlett, 

Boston 1992 

More specific references will be given in the text at the appropriate moment 
concerning this problem and the problem of determining Galois groups over 
specific fields, especially the rational numbers. 


§1. GALOIS EXTENSIONS 

Let K be a field and let G be a group of automorphisms of K. We denote 
by K g the subset of K consisting of all elements xe K such that x a = x for all 
g e G. It is also called the fixed field of G. It is a field because if x, y e K G then 

(x -f yf = x a + f = x + y 
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for all o e G, and similarly, one verifies that K is closed under multiplication, 
subtraction, and multiplicative inverse. Furthermore, K G contains 0 and 1, 
hence contains the prime field. 

An algebraic extension K of a field k is called Galois if it is normal and 
separable. We consider K as embedded in an algebraic closure. The group of 
automorphisms of K over k is called the Galois group of K over k , and is denoted 
by G(K/k ), G K/k , Ga\(K/k), or simply G. It coincides with the set of embeddings 
oïK \nK* over k. 

For the convenience of the reader, we shall now state the main result of the 
Galois theory for finite Galois extensions. 

Theorem 1.1. Leî Kbe a finite Galois extension of k, with Galois group G. 
There is a bijection between the set of subfields E of K containing k, and the 
set of subgroups H of G, given by E = K H . The field E is Galois over k if and 
only if H is normal in G, and if that is the case, then the map o\—>o\E induces 
an isomorphism of G/H onto the Galois group of E over k. 

We shall give the proofs step by step, and as far as possible, we give them for 
infinite extensions. 

Theorem 1.2. Let K be a Galois extension of k. Let G be its Galois group. 
Then k = K G . If F is an intermediate field, k «= F cz K, then K is Galois over 
F. The map 

F^G(K/F) 

from the set of intermediate fields into the set of subgroups of G is injective. 

Proof. Let a e K G . Let o be any embedding of /c(a) in K a , inducing the 
identity on k. Extend o to an embedding of K into K a , and call this extension o 
also. Then o is an automorphism of K over k, hence is an element of G. By 
assumption, o leaves a fixed. Therefore 

[/c(a):fe] s = 1. 

Since a is separable over k, we have /c(a) = k and a is an element of k. Thisproves 
our first assertion. 

Let F be an intermediate field. Then K is normal and separable over F by 
Theorem 3.4 and Theorem 4.5 of Chapter V. Hence K is Galois over F. If H = 
G(K/F) then by what we proved above we conclude that F = K H . If F, F' are 
intermediate fields, and H = G(K/F ), H ' = G(K/F '), then 

F = K H and F' = K H . 

If H = H' we conclude that F = F', whence our map 

F^G(K/F) 

is injective, thereby proving our theorem. 
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We shall sometimes call the group G(K/F) of an intermediate field the group 
associated with F. We say that a subgroup H of G belongs to an intermediate 
field F if H = G(K/F). 

Corollary 1.3. Let K/k be Galois with group G. Let F, F' be two inter- 
mediate fields, and let H, H ' be the subgroups of G belonging to F, F' respec- 
tively. Then H n H' belongs to FF'. 

Proof. Every element of H n H' leaves FF' fixed, and every element of G 
which leaves FF' fixed also leaves F and F' fixed and hence lies in H n H'. 
This proves our assertion. 

Corollary 1.4. Let the notation be as in Corollary 1.3. Thefixed field of the 
smallest subgroup of G containing H, H' is F n F'. 

Proof. Obvious. 

Corollary 1.5. Let the notation be as in Corollary 1.3. Then F a F' if 
and only if H' c H. 

Proof. If F c= F' and oeH' leaves F' fixed then o leaves F fixed, so o lies 
in H. Conversely, if H' c= H then the fixed fìeld of H is contained in the fixed 
field of H', sofc F'. 

Corollary 1.6. Let E be a finite separable extension of a field k. Let K be 
the smallest normal extension of k containing E. Then K is finite Galois over 
k. There is only a finite number of intermediate fields F such that k a F cz E. 

Proof. We know that K is normal and separable, and K is finite over k 
since we saw that it is the finite compositum of the finite number of conjugates 
of E. The Galois group of K/k has only a finite number of subgroups. Hence 
there is only a finite number of subfields of K containing k, whence a fortiori a 
finite number of subfields of E containing k. 

Of course, the last assertion of Corollary 1.6 has been proved in the preceding 
chapter, but we get another proof here from another point of view. 

Lemma 1.7. Let E be an algebraic separable extension of k. Assume that 
there is an integer n ^ 1 such that every element otofE is of degree ^ n over k. 
Then E is finite over k and [E : /c] ^ n. 

Proof. Let a be an element of E such that the degree [k( a): k] is maximal, 
say m ^ n. We contend that k(ot) = E. If this is not true, then there exists an 
element /? e E such that $ k(a), and by the primitive element theorem, there 
exists an element y e k( a, p) such that k(ot, p) = k(y). But from the tower 

k c= k(ot) cz k(ot, p) 

we see that [/c(a, f $): /c] > m whence y has degree > m over k, contradiction. 
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Theorem 1.8. (Artin). Let Kbe a field and let G be a finite group of auto - 
morphisms of K, of order n. Let k = K G be the fixed field. Then K is a finite 
Galois extension of k, and its Galois group is G. We have \_K : fc] = n. 

Proof Let oce K and let o u ..., o r be a maximal set of elements of G such 
that dja, ..., G r cc are distinct. If xeG then (To^a, ..., ra r oc) differs from 
(dja,..., a r oc) by a permutation, because t is injective, and every Tc^a is among 
the set {g^oc, ..., a- r a}; otherwise this set is not maximal. Hence oc is a root of 
the polynomial 


/(X) = fl (* - <r iû£ ) ( 

î= 1 

and for any t e G, / T = /. Hence the coefficients of / lie in K G = k. Further- 
more, / is separable. Hence every element oc of K is a root of a separable 
polynomial of degree ^n with coefficients in k. Furthermore, this poly- 
nomial splits in linear factors in K. Hence K is separable over k , is normal 
over k, hence Galois over k. By Lemma 1.7, we have \_K : /c] ^ n. The Galois 
group of K over k has order K:k ] (by Theorem 4.1 of Chapter V), and hence 
G must be the full Galois group. This proves all our assertions. 

Corollary 1.9. Let Kbe a finite Galois extension of k and let G be its Galois 

group. Then every subgroup of G belongs to some subfield F such that 

k c F c K. 

Proof Let H be a subgroup of G and let F = K H . By Artin’s theorem we 
know that K is Galois over F with group H. 

Remark. When K is an infinite Galois extension of k , then the preceding 
corollary is not true any more. This shows that some counting argument 
must be used in the proof of the finite case. In the present treatment, we have 
used an old-fashioned argument. The reader can look up Artin’s own proof in 
his book Galois Theory. In the infinite case, one defines the Krull topology on 
the Galois group G (cf. exercises 43-45), and G becomes a compact totally 
disconnected group. The subgroups which belong to the intermediate fields are 
the closed subgroups. The reader may disregard the infinite case entirely through- 
out our discussions without impairing understanding. The proofs in the infinite 
case are usually identical with those in the finite case. 

The notions of a Galois extension and a Galois group are defined completely 
algebraically. Hence they behave formally under isomorphisms the way one 
expects from objects in any category. We describe this behavior more explicitly 
in the present case. 

Let K be a Galois extension of k. Let 


À: K XK 



VI, §1 


GALOIS EXTENSIONS 265 


be an isomorphism. Then ÀX is a Galois extension of \k. 

k —aà: 

k - x » A k 

Let G be the Galois group of K over k. Then the map 

g i— ► À o a o A ~ 1 

gives a homomorphism of G into the Galois group of \K over \k , whose inverse 
is given by 


À 1 o T o X <—I T. 

Hence G(\K/\k) is isomorphic to G(K/k) under the above map. We may write 

G(XK/Xk) x = G(K/k) 


or 


G(AiC/Afc) = XG(K/k)X~\ 
where the exponent X is “conjugation,” 

= \~ 1 o a o X. 

There is no avoiding the contravariance if we wish to preserve the rule 


when we compose mappings X and a>. 

In particular, let F be an intermediate field, k c F c= K, and let X :F -+ XF 
be an embedding of F in X, which we assume is extended to an automorphism 
of K. Then XK = K. Hence 


G(K/XF) x = G(K/F) 
and 

G(K/XF ) = XG(K/F)X~ l . 

Theorem 1.10. Let K be a Galois extension of k with group G. Let F be a 
subfield , k c F c X, and let H = G(K/F). Then F is normal over k if and 
only if H is normal in G. IfF is normal over /c, then the restriction map cr i—► <r | F 
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is a homomorphism of G onto the Galois group of F over k, whose kernel is H. 
We thus have G(F/k ) « G/H. 

Proof. Assume F is normal over k, and let G' be its Galois group. The 
restriction map o -> o\F maps G into G', and by definition, its kernel is H. 
Hence H is normal in G. Furthermore, any element t e G' extends to an em- 
bedding of K in K a , which must be an automorphism of K , so the restriction 
map is surjective. This proves the last statement. Finally, assume that F is not 
normal over k. Then there exists an embedding À of F in K over k which is not 
an automorphism, i.e. ÀF # F. Extend X to an automorphism of K over k. 
The Galois groups G(K/ÀF) and G(K/F) are conjugate, and they belong to 
distinct subfields, hence cannot be equal. Hence H is not normal in G. 

A Galois extension K/k is said to be abelian (resp. cyclic) if its Galois group G 
is abelian (resp. cyclic). 

Corollaryl.il. Let K/k be abelian (resp. cyclic). If F is an intermediate 
field, k c F c= K, then F is Galois over k and abelian (resp. cyclic). 

Proof. This follows at once from the fact that a subgroup of an abelian 
group is normal, and a factor group of an abelian (resp. cyclic) group is abelian 
(resp. cyclic). 

Theorem 1.12. Let Kbe a Galois extension of k, let F be an arbitrary exten- 
sion and assume that K, F are subfields of some other field. Then KF is Galois 
over F, and K is Galois over K n F. Let H be the Galois group of KF over F, 
and G the Galois group of K over k. IfoeH then the restriction of o to K is 
in G, and the map 

ot-+o\K 

gives an isomorphism of H on the Galois group of K over K n F. 

Proof. Let oe H. The restriction of o to K is an embedding of K over k, 
whence an element of G since K is normal over k. The map o i—► o | K is clearly a 
homomorphism. If o\K is the identity, then o must be the identity of KF 
(since every element of KF can be expressed as a combination of sums, products, 
and quotients of elements in K and F). Hence our homomorphism o i-> o \ K is 
injective. Let H' be its image. Then H' leaves XnF fixed, and conversely, if an 
element a e K is fixed under H', we see that a is also fixed under H, whence 
a e F and a e K n F. Therefore K n F is the fixed field. If K is finite over k, 
or even KF finite over F, then by Theorem 1.8, we know that H' is the Galois 
group of K over K n F, and the theorem is proved in that case. 

(In the infinite case, one must add the remark that for the Krull topology, 
our map o i-» cr\ K is continuous, whence its image is closed since H is compact. 
See Theorem 14.1; Chapter I, Theorem 10.1; and Exercise 43.) 
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The diagram illustrating Theorem 1.12 is as follows: 



K nF 


k 

It is suggestive to think of the opposite sides of a parallelogram as being equal. 

Corollary 1.13. Let K bea finite Galois extension ofk. Let F be an arbitrary 
extension of k. Then [/CF : F] divides [K : /c]. 

Proof. Notation being as above, we know that the order of H divides the 
order of G, so our assertion follows. 

Warning. The assertion of the corollary is not usually valid if K is not 
Galois over k. For instance, let a = //2 be the real cube root of 2, let £ be a 
cube root of 1, C # 1, say 

r- ~l+y Z 3 

2 

and let /? = £a. Let E = Q(j5). Since j? is complex and a real, we have 

Q(/?) * Q(a). 

Let F = Q(a). Then E n F is a subfield of E whose degree over Q divides 3. 
Hence this degree is 3 or 1, and must be 1 since E # F. But 

EF = Q(oc, P) = Q(a, Q = Q(a, 7^3). 

Hence EF has degree 2 over F. 

Theorem 1.14. Let K j and K 2 be Galois extensions of a field k , vvií/i Galois 
groups G { and G 2 respectively. Assume K u K 2 are subfields of some field. 
Then K { K 2 is Galois over k. Let G be its Galois group. Map G -► Gj x G 2 
by restriction , namely 

a\->(<r\K l9 <t\K 2 ). 

This map is injective. If K x r\K 2 — k then the map is an isomorphism. 
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Proof. Normality and separability are preserved in taking the compositum 
of two fields, so K x K 2 is Galois over fc. Our map is obviously a homomorphism 
of G into Gj x G 2 . If an element oeG induces the identity on K j and K 2 
then it induces the identity on their compositum, so our map is injective. Assume 
that Xj n K 2 = k. According to Theorem 1.12, given an element ctj g Gj there 
exists an element o of the Galois group of K X K 2 over K 2 which induces on 
K v This o is afortiori in G, and induces the identity on K 2 . Hence G { x {e 2 } 
is contained in the image of our homomorphism (where e 2 is the unit element of 
G 2 ). Similarly, {e^} x G 2 is contained in this image. Hence their product is 
contained in the image, and their product is precisely G x x G 2 . This proves 
Theorem 1.14. 



Corollary 1.15. Let K u ...,K n be Galois extensions of k with Galois 
groups Gj,..., G n . Assume that K i+Ì n (K { • • • K t ) = k for each 
i = 1,..., n — 1. Then the Galois group of K { • • • K n is isomorphic to the 
product G x x • • ♦ x G n in the natural way. 

Proof. Induction. 


Corollary 1.16. Let K be a finite Galois extension of k with group G, and 
assume that G can be written as a direct product G = G x x • • • x G n . Let 
Ki be the fixed field of 


Gj x ••• x {1} x ••• x G„ 

where the group with 1 element occurs in the i-th place. Then K t is Galois over 
fc, and K i+Ì n (X x • • • K t ) = fc. Furthermore K = K x • • • K n . 

Proof. By Corollary 1.3, the compositum of all K t belongs to the intersection 
of their corresponding groups, which is clearly the identity. Hence the composi- 
tum is equal to K. Each factor of G is normal in G, so K t is Galois over fc. By 
Corollary 1.4, the intersection of normal extensions belongs to the product of 
their Galois groups, and it is then clear that K i + l n (Xj • • • K t ) = k. 
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Theorem 1.17. Assume all fields contained in some commonfield. 

(i) If L are abelian over /c, so is the composite KL. 

(ii) IfK is abelian over k and E is any extension ofk , then KE is abelian over E. 

(iii) // K is abelian over k and K => E => kwhere Eisanintermediatefield.then 
E is abelian over k and K is abelian over E. 

Proof Immediate from Theorems 1.12 and 1.14. 

If k is a field, the composite of all abelian extensions of k in a given algebraic 
closure k à is called the maximum abelian extension of k , and is denoted by k ab . 

Remark on notation. We have used systematically the notation: 

/c a = algebraic closure of /c; 

/c s = separable closure of /c; 

/c ab = abelian closure of k = maximal abelian extension. 

We have replaced other people’s notation k (and mine as well in the first edition) 
with /c a in order to make the notation functorial with respect to the ideas. 


§2. EXAMPLES AIMD APPLICATIONS 

Let k be a field and f{X) a separable polynomial of degree ^ 1 in k[X\. Let 
f(X) = (X-aJ-'-iX 

be its factorization in a splitting field K over k. Let G be the Galois group of K 
over k. We call G the Galois group of/over k. Then the elements of G permute 
the roots of f Thus we have an injective homomorphism of G into the symmetric 
group S n on n elements. Not every permutation need be given by an element 
of G. We shall discuss examples below. 

Example 1. Quadratic extensions. Let k be a field and a e k. If a is not 
a square in k 9 then the polynomial X 2 - a has no root in k and is therefore 
irreducible. Assume char k =k 2. Then the polynomial is separable (because 
2 =k 0), and if a is a root, then k(a) is the splitting field, is Galois, and its 
Galois group is cyclic of order 2. 

Conversely , given an extension K ofk ofdegree 2, there exists a e k such that 
K = k(a) and a 2 = a. This comes from completing the square and the quadratic 
formula as in elementary school. The formula is valid as long as the characteristic 
of k is =k 2. 
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Example 2. Cubic extensions. Let k be a field of characteristic 4=- 2 or 
3. Let 

f(X) = X 3 + aX + b. 

Any polynomial of degree 3 can be brought into this form by completing the 
cube. Assume that /has no root in k. Then /is irreducible because any factoriza- 
tion must have a factor of degree 1. Let a be a root of f(X). Then 

[k(a): k\ = 3. 

Let K be the splitting field. Since char k 4 2, 3,/is separable. Let G be the 
Galois group. Then G has order 3 or 6 since G is a subgroup of the symmetric 
group S 3 . In the second case, k(a) is not normal over k. 

There is an easy way to test whether the Galois group is the full symmetric 
group. We consider the discriminant. If a l5 a 2 , a 3 are the distinct roots of 
f(X), we let 

ô = (a i - a 2 )( a 2 — a 3 )(a! - a 3 ) and A = ô 2 . 

If G is the Galois group and o e G then cr(á) = ±ô. Hence o leaves A fixed. 
Thus A is in the ground field k , and in Chapter IV, §6, we have seen that 

A = -4a 3 ~ 21 b 1 . 

The set of cr in G which leave 8 fixed is precisely the set of even permutations. 
Thus G is the symmetric group if and only if A is not a square in k. We may 
summarize the above remarks as follows. 

Letf(X) be a cubic polynomial in k[X \, and assume char k 4 2, 3. Then: 

(a) f is irreducible over k if and only iffhas no root in k. 

(b) Assume f irreducible. Then the Galois group offis S 3 if and only if the 
discriminant off is not a square in k. Ifthe discriminant is a square , then 
the Galois group is cyclic oforder 3, equal to the alternating group A 3 as 
a permutation of the roots of f. 

For instance, consider 

f(X) = X 3 - X + 1 

over the rational numbers. Any rational root must be 1 or — 1, and so f(X) is 
irreducible over Q. The discriminant is - 23, and is not a square. Hence the 
Galois group is the symmetricgroup. The splitting field contains a subfield of 
degree 2, namely k(S) = k(VA). 

On the other hand, let f(X) = X 3 - 3X + 1. Then/has no root in Z, whence 
no root in Q, so/is irreducible. The discriminant is 81, which is a square, so 
the Galois group is cyclic of order 3. 

Example 3. We consider the polynomial /(X) = X 4 - 2 over the 
rationals Q. It is irreducible by Eisenstein’s criterion. Let a be a real root. 
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Let i = yf—ï. Then ±a and ±ia are the four roots of f(X\ and 

[Q(«) : Q] = 4. 

Hence the splitting field of f(X) is 

K = Q(a, i ). 

The field Q(a) n Q(i) has degree 1 or 2 over Q. The degree cannot be 2 otherwise 
i e Q(a), which is impossible since a is real. Hence the degree is 1. Hence i has 
degree 2 over Q(a) and therefore [X : Q] = 8. The Galois group off(X) has 
order 8. 

There exists an automorphism t of K leaving Q(a) fixed, sending i to — i, 
because K is Galois over Q(a), of degree 2. Then t 2 = id. 


Q(a, 0 = K 



By the multiplicativity of degrees in towers, we see that the degrees are as 
indicated in the diagram. Thus X 4 - 2 is irreducible over Q(f). Also, K is 
normal over Q(i). There exists an automorphism <r of K over Q(f) mapping the 
root a of X 4 - 2 to the root ia. Then one verifies at once that 1, cr, cr 2 , o 3 are 
distinct and o 4 = id. Thus o generates a cyclic group of order 4. We denote it 
by <cr>. Since t ^ <<x> it follows that G = <<r, t> is generated by <r and t because 
<<r> has index 2. Furthermore, one verifies directly that 

t <r = <t 3 t, 

because this relation is true when applied to a and f which generate K over Q. 
This gives us the structure of G. It is then easy to verify that the lattice of sub- 
groups is as follows: 
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Example4. Let k be a field and let t í9 ..., t„ be algebraically independent 
over k. Let K = k(t lf ..., t n ). The symmetric group G onn letters operates on 
K by permuting (t í9 ... 9 t n ) and its fixed field is the field of symmetric functions, 
by definition the field of those elements of K fixed under G. Let s ì9 ..., s n be the 
elementary symmetric polynomials, and let 

f(X) = fl(AT - t,). 

i = 1 

Up to a sign, the coefficients of/are s l9 ..., s n . We let F = K G . We contend 
that F = /c(sj, ..., s n ). Indeed, 

k(s l9 ... 9 s n ) <= F. 

On the other hand, K is the splitting field of f(X), and its degree over F is n\. 
Its degree over k(s l9 ..., s n ) is ^ n ! and hence we have equality, F = k(s l9 ..., s n ). 

The polynomial /( X) above is called the general polynomial of degree n. 
We have just constructed a Galois extension whose Galois group is the sym- 
metric group. 

Using the Hilbert irreducibility theorem, one can construct a Galois extension 
of Q whose Galois group is the symmetric group. (Cf. Chapter VII, end of §2, 
and [La 83], Chapter IX.) It is unknown whether given a finite group G, there 
exists a Galois extension of Q whose Galois group is G. By specializing para- 
meters, Emmy Noether remarked that one could prove this if one knew that every 
field E such that 


Q(s,,...,s„) c E cz Q(Í!,...,0 

is isomorphic to a field generated by n algebraically independent elements. 
However, matters are not so simple, because Swan proved that the fixed field 
of a cyclic subgroup of the symmetric group is not necessarily generated by 
algebraically independent elements over k [Sw 69], [Sw 83]. 

Example5. We shall prove that the complex numbers are algebraically 
closed. This will illustrate almost all the theorems we have proved previously. 

We use the following properties of the real numbers R: It is an ordered field, 
every positive element is a square, and every polynomial of odd degree in R[X] 
has a root in R. We shall discuss ordered fields in general later, and our argu- 
ments apply toany ordered field having the above properties. 

Let i = (in other words a root of I 2 + 1). Every element in R(i) 

has a square root. If a + bi e R(i), a, b e R, then the square root is given by 
c + di, where 

2 a + yja 2 + b 2 2 ~ a + yja 2 + b 2 

c = --- and d = --. 


Each element on the right of our equalities is positive and hence has a square root 
in R. It is then trivial to determine the sign of c and d so that (c + di) 2 = a + bi. 
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Since R has characteristic 0, every finite extension is separable. Every finite 
extension of R(i) is contained in an extension K which is finite and Galois over 
R. We must show that K = R(i). Let G be the Galois group over R and let H 
be a 2-Sylow subgroup of G. Let F be its fixed field. Counting degrees and 
orders, we find that the degree of F over R is odd. By the primitive element 
theorem, there exists an element a e F such that F = R(a). Then a is the root of 
an irreducible polynomial in R[X~\ of odd degree. This can happen only if this 
degree is 1. Hence G = H is a 2-group. 

We now see that K is Galois over R(i). Let G^ be its Galois group. Since G^ 
is a p-group (with p = 2), if G { is not the trivial group, then G^ has a subgroup 
G 2 of index 2. Let F be the fixed field of G 2 . Then F is of degree 2 over R(/); it 
is a quadratic extension. But we saw that every element of R(/) has a square 
root, and hence that R(/) has no extensions of degree 2. It follows that Gj is the 
trivial group and K = R(/), which is what we wanted. 

(The basic ideas of the above proof were already in Gauss. The variation 
of the ideas which we have selected, making a particularly efficient use of the 
Sylow group, is due to Artin.) 

Example 6. Let f(X) be an irreducible polynomial over the field /c, and 
assume that / is separable. Then the Galois group G of the splitting field is 
represented as a group of permutations of the n roots, where n = deg/ When- 
ever one has a criterion for this group to be the full symmetric group then 
one can see if it applies to this representation of G. For example, it is an easy 
exercise (cf. Chapter I, Exercise 38) that for p prime, S p is generated by 
[123 *••/?] and any transposition. We then have the following result. 

Let f(X) be an irreducible polynomial with rational coefficients and of degree 

p prime. If f has precisely ívvo nonreal roots in the complex numbers , then the 

Galois group off is S p . 

Proof. The order of G is divisible by p, and hence by Sylow’s theorem, G 
contains an element of order p. Since G is a subgroup of S p which has order p!, 
it follows that an element of order p can be represented by a p-cycle [123 •*•/?] 
after a suitable ordering of the roots, because any smaller cycle has order less 
than p, so relatively prime to p. But the pair of complex conjugate roots shows 
that complex conjugation induces a transposition in G. Hence the group is all 
of S p . 

A specific case is easily given. Drawing the graph of 
f(X) = X 5 - 4X + 2 

shows that/has exactly three real roots, so exactly two complex conjugate roots. 
Furthermore /is irreducible over Q by Eisenstein’s criterion, so we can apply 
the general statement proved above to conclude that the Galois group of / 
over Q is S 5 . See also Exercise 17 of Chapter IV. 
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Example 7. The preceding example determines a Galois group by finding 
some subgroups passing to an extension field of the ground field. There are 
other possible extensions of Q rather than the reals, for instance p-adic fields 
which will be discussed later in this book. However, instead of passing to an 
extension field, it is possible to use reduction mod p. For our purposes here, we 
assume the following statement, which will be proved in Chapter VII, theorem 
2.9. 

Let f (2Q e Z[X] be a polynomial mth integral coefficients, and leading 
coefficient 1. Let p be a prime number. Let f(X) = f(X) mod p be the 
polynomial obtained by reducing the coefficients mod p. Assume that f has 
no multiple roots in an algebraic closure of¥ p . Then there exists a bijection 

(a 1 ,...,a M )h->(â 1 ,...,5ê M ) 

of the roots of f onto those of f, and an embedding of the Galois group of f as a 
subgroup of the Galois group of f, xvhich gives an isomorphism of the action of 
those groups on the set of roots. 

The embedding will be made precise in Chapter VII, but here we just want to 
use this result to compute Galois groups. 

For instance, consider X 5 — X — ì over Z. Reducing mod 5 shows that 
this polynomial is irreducible. Reducing mod 2 gives the irreducible factors 

(X 2 + X + \)(X 3 + X 2 + \) (mod 2). 

Hence the Galois group over the rationals contains a 5-cycle and a product of a 
2-cycle and a 3-cycle. The third power of the product of the 2-cycle and 3-cycle 
is a 2-cycle, which is a transposition. Hence the Galois group contains a trans- 
position and the cycle [123 • * • p], which generate S p (cf. the exercises of Chapter 
I on the symmetric group). Thus the Galois group of X 5 - X - 1 is S p . 

Example 8. The technique of reducing mod primes to get lots of elements 
in a Galois group was used by Schur to determine the Galois groups of classical 
polynomials [Schur 31]. For instance, Schur proves that the Galois group over 
Q of the following polynomials over Q is the symmetric group: 

n 

(a) f(X) — 2 X m /m\ (in other words, the truncated exponential series), if 

m =o 

n is not divisible by 4. If n is divisible by 4, he gets the alternating group. 

(b) Let 

HJX) = 

be the m-th Hermite polynomial. Put 

HJX) = Kf\X 2 ) and H 2n + ì (X) = XKj\X 2 ). 

Then the Galois group of K%\X) over Q is the symmetric group S„ for i = 0, 
1, provided n > 12. The remaining cases were settled in [Schulz 37]. 
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Example 9. This example is addressed to those who know something 
about Riemann surfaces and coverings. Let t be transcendental over the com- 
plex numbers C, and let k = C(í). The values of t in C, or oo, correspond to the 
points of the Gauss sphere S, viewed as a Riemann surface. Let P ì9 ...,P n+í be 
distinct points of S. The finite coverings of S - {Pi,..., P n _ x } are in bijection 
with certain finite extensions of C(t), those which are unramified outside 
P i,..., P n -1 • Let K be the union of all these extension fields corresponding to 
such coverings, and let n^ be the fundamental group of 

S-{P l9 ... 9 P n+l }. 

Then it is known that n ( î ] is a free group on n generators, and has an embedding 
in the Galois group of K over C(í), such that the finite subfields of K over 
C(í) are in bijection with the subgroups of n { ” ] which are of finite index. Given a 
finite group G generated by n elements a í9 ..., a n we can find a surjective 
homomorphism n^ —> G mapping the generators of nf^ on ... ,o n . Let H 
be the kemel. Then H belongs to a subfield K H of K which is normal over C(t) 
and whose Galois group is G. In the language of coverings, H belongs to a 
finite covering of 

S — {Pi > • • • j P n +1 }• 

Over the field C(t) one can use analytic techniques to determine the Galois 
group. The Galois group is the completion of a free group, as proved by 
Douady [Dou 64]. For extensions to characteristic p, see [Pop 95]. A funda- 
mental problem is to determine the Galois group over Q(t), which requires 
much deeper insight into the number theoretic nature of this field. Basic con- 
tributions were made by Belyi [Be 80], [Be 83], who also considered the field 
Q(p)(0, where Q(p) is the field obtained by adjoining all roots of unity to the 
rationals. Belyi proved that over this latter field, essentially all the classical fi- 
nite groups occur as Galois groups. See also Conjecture 14.2 below. 

For Galois groups over Q(0» see the survey [Se 88], which contains a 
bibliography. One method is called the rigidity method, first applied by Shih 
[Shi 74], which I summarize because it gives examples of various notions defined 
throughout this book. The problem is to descend extensions of C(f) with a given 
Galois group G to extensions of Q(t) with the same Galois group. If this extension 
is K over Q (t), one also wants the extension to be regular over Q (see the 
definition in Chapter VIII, §4). To give a sufficient condition, we need some 
definitions. Let G be a finite group with trivial center. Let C x , C 2 , C 3 be conjugacy 
classes. Let P = P{C X , C 2 , C 3 ) be the set of elements 

(0i» 02» 03 ) E C\ x C 2 x C 3 

such that 010203 = 1* Let P' be the subset of P consisting of all elements 
(0i» 02 » 03 ) E P suc h that G is generated by g x , g 2ì g 3 . We say that the family 
(Cj, C 2 , C 3 ) is rigid if G operates transitively on P', and P' is not empty. 
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We define a conjugacy class C of G to be rational if given g e C and a 
positive integer s relatively prime to the order of g , then g s e C. (Assuming that 
the reader knows the terminology of characters defined in Chapter XVIII, this 
condition of rationality is equivalent to the condition that every character \ of 
G has values in the rational numbers Q.) One then has the following theorem, 
which is contained in the works of Shih, Fried, Belyi, Matzat and Thompson. 

Rigidity theorem. Let G be a finite group with trivial center, and let 
Cj, C 2 , C 3 be conjugacy classes which are rational , and such that the family 
(C 1? C 2 , C 3 ) is rigid. Then there exists a Galois extension ofQ(t) with Galois 
group G (and such that the extension is regular over Q). 
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§3. ROOTS OF UISIITY 

Let k be a field. By a root of unity (in k) we shall mean an element Çek 
such that C” = 1 for some integer n ^ 1. If the characteristic of k is p, then the 
equation 

X pm = 1 

has only one root, namely 1, and hence there is no p m -th root of unity except 1. 
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Let n be an integer > 1 and not divisible by the characteristic. The polynomial 

X" - 1 

is separable because its derivative is nX n ~ 1 # 0, and the only root of the deriva- 
tive is 0, so there is no common root. Hence in k a the polynomial X n — 1 has n 
distinct roots, which are roots of unity. They obviously form a group, and we 
know that every finite multiplicative group in a field is cyclic (Chapter IV, 
Theorem 1.9). Thus the group of n- th roots of unity is cyclic. A generator for 
this group is called a primitive n-th root of unity. 

denotes the group of all n-th roots of unity in k a and m , n are relatively 
prime integers, then 


Vmn * Vn 

This follows because p m , \i n cannot have any element in common except 1, 
and because p m p„ consequently has mn elements, each of which is an mn- th 
root of unity. Hence = \i mn , and the decomposition is that of a direct 
product. 

As a matter of notation, to avoid double indices, especially in the prime 
power case, we write \i[n] for \k n . So if p is a prime, \k[p r ] is the group of 
p r - th roots of unity. Then \k[p x ] denotes the union of all \k[p r ] for all 
positive integers r. See the comments in §14. 

Let k be any field. Let n be not divisible by the characteristic p. Let ( = 
Ç n be a primitive n- th root of unity in k a . Let a be an embedding of k(Ç) in k a 
over k. Then 


«r = o(n = i 


so that is an n- th root of unity also. Hence <j£ = Ç l for some integer i = i(<j), 
uniquely determined mod n. It follows that o maps k(Q into itself, and hence 
that k(Q is normal over k. If t is another automorphism of /c(Q over k then 

<ttÇ = C ÍÍ<T)ÍÍT) . 

Since o and x are automorphisms, it follows that i(o) and i( t) are prime to n 
(otherwise, oÇ would have a period smaller than n). In this way we get a homo- 
morphism of the Galois group G of /c(Q over k into the multiplicative group 
(Z/nZ)* of integers prime to n , mod n. Our homomorphism is clearly injective 
since i(o) is uniquely determined by o mod n , and the effect of <r on /c(Ç) is 
determined by its effect on £. We conclude that /c(Q is abelian over k. 

We know that the order of (Z/nZ)* is (p(n). Hence the degree [/c(Q: /c] 
divides cp(n). 

For a specific field /c, the question arises whether the image of G k{c)/k in 
(Z/nZ)* is all of (ZlnZ)*. Looking at k = R or C, one sees that this is not 
always the case. We now give an important example when it is the case. 
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Theorem 3.1. Let Ç be a primitive n-th root of unity. Then 

[Q(C):Q ] = </>("), 

where (p is the Euler function. The map cr i-> i(o) gives an isomorphism 

G Q(0/Q ( Z / nZ )*‘ 

Proof. Let f(X) be the irreducible polynomial of C over Q. Then /(X) 
dividesX” - LsayX" - 1 = /(X)/î(X),whereboth/,/ihaveleadingcoefficient 
1. By the Gauss lemma, it follows that f h have integral coefficients. We shall 
now prove that if p is a prime number not dividing n, then Ç p is also a root of f. 
Since C p is also a primitive w-th root of unity, and since any primitive n- th root of 
unity can be obtained by raising Ç to a succession of prime powers, with primes 
not dividing n, this will imply that all the primitive n-th roots of unity are roots 
of f which must therefore have degree ^ (p(n\ and hence precisely (p(n ). 

Suppose C p is not a root of f Then C p is a root of h , and C itself is a root 
of h(X p ). Hence f(X) divides h(X p \ and we can write 

h(X p ) =f(X)g(X). 

Since / has integral coefficients and leading coefficient 1, we see that g has 
integral coefficients. Since a p = a (mod p) for any integer a, we conclude that 

h(X p ) = h(X) p (mod p), 

and hence 

h(X) p =f(X)g(X) (mod p). 

In particular, if we denote by f and h the polynomials in Z/pZ obtained by 
reducing / and h respectively mod p, we see that / and h are not relatively 
prime, i.e. have a factor in common. But X n - T = / (X)h(X\ and hence 
X n — T has multiple roots. This is impossible, as one sees by taking the de- 
rivative, and our theorem is proved. 

Corollary 3.2. // n, m are relative prime integers ^ 1, then 

Q(C„) n Q(Cm) = Q 

Proof. We note that C„ and Ç m are both contained in Q(Ç mn ) since Ç mn is a 
primitive m-th root of unity. Furthermore, Ç m Ç n is a primitive mn-th root of 
unity. Hence 

Q(CJQ(Cm) = Q(Cm„). 

Our assertion follows from the multiplicativity (p(mn) = (p(m)(p(n). 

Suppose that n is a prime number p (having nothing to do with the character- 
istic). Then 


x p - 1 = (X - ì)(X p ~ l +•••+!). 
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Any primitive p-th root of unity is a root of the second factor on the right of this 
equation. Since there are exactly p - 1 primitive p- th roots of unity, we con- 
clude that these roots are precisely the roots of 

+ ••• + 1 . 

We saw in Chapter IV, §3 that this polynomial could be transformed into 
an Eisenstein polynomial over the rationals. This gives another proof that 
[Q (£p): Q1 =P - 1. 

We investigate more closely the factorization of X n - 1, and suppose that 
we are in characteristic 0 for simplicity. 

We have 


-1 = ri(x - o, 

{ 

where the product is taken over all n- th roots of unity. Collect together all terms 
belonging to roots of unity having the same period. Let 

o/x) = n (x - o 

period £=d 


Then 


x« -1 = n<w. 

d\n 


We see that «^(X) = X — 1, and that 


<*>„(X) = 


X” - 1 

n <mx) 

d\n 

d<n 


From this we can compute 3> (X) recursi vely, and we see that «^(X) is a polynomial 
in Q[X] because we divide recursively by polynomials having coefficients in Q. 
All our polynomials have leading coefficient 1, so that in fact ^ n (X) has integer 
coefficients by Theorem 1.1 of Chapter IV. Thus our construction is essentially 
universal and would hold over any field (whose characteristic does not divide 
n). 

We call í> n (X) the n- th cyclotomic polynomial. 

The roots of <!>„ are precisely the primitive n-th roots of unity, and hence 

deg <P n = <p(n). 

From Theorem 3.1 we conclude that d> n is irreducible over Q, and hence 
$ n (X) = Irr(£„, Q, X). 
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We leave the proofs of the following recursion formulas as exercises: 

1. If p is a prime number, then 

<& p (X) = XP~ l 4 - XP - 2 + • • • + 1 , 
and for an integer r > 1, 

%r(X) = %(XP r ~\ 

2. Let n = p\' • • • p s s be a positive integer with its prime factorization. Then 

%iX) = % l .. Pa (XX'~ l *•"). 


3. If n is odd > 1, then <Ï> 2 /î(^ 0 = O w (—X). 

4 . If p is a prime number, not dividing n , then 

%(XP) 


%n(X) = 


%(*) 


On the other hand, if p\n, then = OnCYO. 

5. We have 

0„(X) = Y\(X n/d - l)^ w >. 

d\n 


As usual, p is the Mòbius function: 

1 0 if n is divisible by p 2 for some prime p, 

( — l) r if n = p x • • • p r is a product of distinct primes, 
1 if n = 1 . 


As an exercise, show that 



if n = 1, 
if n > 1. 


Example. In light of Exercise 21 of Chapter V, we note that the association 
n i-» ^(X) can be viewed as a function from the positive integers into the 
multiplicative group of non-zero rational functions. The multiplication formula 
X n - 1 = can therefore be inverted by the general formalism of 

convolutions. Computations of a number of cyclotomic polynomials show that 
for low values of n, they have coefficients equal to 0 or ±1. However, I am 
indebted to Keith Conrad for bringing to my attention an extensive literature on 
the subject, starting with Bang in 1895. I include only the first and last items: 

A. S. Bang, Om Ligningen 4> m (X) = 0, Nyt Tidsskrift for Matematik (B) 6 (1895), 

pp. 6-12 

H. L. Montgomery and R. C. Vaughn, The order of magnitude of the m-th coef- 
ficients of cyclotomic polynomials, Glasgow Math. J. 27 (1985), pp. 143-159 
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In particular, if 4> n (X) = 'Za nj X j , define L(j ) = log max n | a nj |. Then Montgomery 
and Vaughn prove that 

/ 1/2 / 1/2 
îSbr <s L(J> ' K 4ŷr 

where the sign « means that the left-hand side is at most a positive constant 
times the right-hand side for j —» Bang also points out that <t> ì05 (X) is a 
cyclotomic polynomial of smallest degree having coefficients 0 or ±1: the 
coefficient of X 1 and X 41 is -2 (all others are 0 or ±1). 

If C is an n-th root of unity and £ # 1, then 

1 - i n 

= i + C + --- + C " -1 =o. 

This is trivial, but useful. 

Let ¥ q be the finite field with q elements, q equal to a power of the odd prime 
number p. Then F* has q - 1 elements and is a cyclic group. Hence we have 
the index 

(F* : F* 2 ) = 2. 

If v is a non-zero integer not divisible by p, let 

/v\ f 1 if v = x 2 (mod p ) for some x , 

[p/ l - 1 if V ^ X 2 (mod p) for all jc. 

This is known as the quadratic symbol, and depends only on the residue class 
of v mod p. 

From our preceding remark, we see that there are as many quadratic residues 
as there are non-residues mod p. 


Theorem 3.3. Let Ç be a primitive p-th root of unity , and let 

S 

the sum being taken over non-zero residue classes mod p. Then 

S 2 




= (t> 


Every quadratic extension of Q is contained in a cyclotomic extension. 

Proof. The last statement follows at once from the explicit expression of 
± p as a square in Q(C), because the square root of an integer is contained in the 
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field obtained by adjoining the square root of the prime factors in its factoriza- 
tion,and alsoFurthermore, for the prime 2, we have(1 + í) 2 = 2i. We 
now prove our assertion concerning S 2 . We have 


5 2 


'i 





v + n 


= 1 



rv + n 


As v ranges over non-zero residue classes, so does vp for any fixed \i , and hence 
replacing v by v\i yields 



But 1 -f ( + • • • -f C p 1 = 0, and the sum on the right over \i consequently 
yields —1. Hence 


as desired. 



We see that Q(^p) is contained in Q(Ç, y/—î) or Q(Q, depending on the 
sign of the quadratic symbol with —1. An extension of a field is said to be 
cyclotomic if it is contained in a field obtained by adjoining roots of unity. 
We have shown above that quadratic extensions of Q are cyclotomic. A 
theorem of Kronecker asserts that every abelian extension of Q is cyclotomic, 
but the proof needs techniques which cannot be covered in this book. 


§4. LINEAR INDEPENDENCE OF 
CHARACTERS 

Let G be a monoid and K a field. By a character of G in K (in this chapter), 
we shall mean a homomorphism 


X-G^K* 

of G into the multiplicative group of K. The trivial character is the homo- 
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morphism taking the constant value 1. Functions /•: G -* K are called linearly 
independent over K if whenever we have a relation 


a \f\ + ‘ ‘ + a nfn = 0 


with Qi e K , then all a f = 0. 

Examples. Characters will occur in various contexts in this book. First, 
the various conjugate embeddings of an extension field in an algebraic closure 
can be viewed as characters. These are the characters which most concern us in 
this chapter. Second, we shall meet characters in Chapter XVIII, when we shall 
extend the next theorem to a more general kind of character in connection with 
group representations. 

Next, one meets characters in analysis. For instance, given an integer m, the 
function/: R/Z —» C* such that/(jc) = e 2mmx is a character on R/Z. It can be 
shown that all continuous homomorphisms of R/Z into C* are of this type. 
Similarly, given a real number y , the function x i-> e 2mxy is a continuous character 
on R, and it is shown in Fourier analysis that all continuous characters of absolute 
value 1 on R are of this type. 

Further, let X be a compact space and let R be the ring of continuous complex- 
valued functions on X. Let R* be the group of units of R. Then given jc e X the 
evaluation map/i-> /( jc) is a character of R* into C*. (Actually, this evaluation 
map is a ring homomorphism of R onto C.) 

Artin found a neat way of expressing a linear independence property which 
covers all these cases, as well as others, in the following theorem [Ar 44]. 

Theorem 4.1. (Artin). Let G be a monoid and K a field. Let X\* • - Xn 

be distinct characters of G in K. Then they are linearly independent over K. 

Proof. One character is obviously linearly independent. Suppose that we 
have a relation 


a lXl + •• + a nXn= 0 

with a t e K, not all 0. Take such a relation with n as small as possible. Then 
n ^ 2, and no a t is equal to 0. Since Xu Xi are distinct, there exists zeG such 
that xfz) # z 2 ( z )* F° r all x e G we have 

a iXi(xz) + ••• + a„x„(xz) = 0, 

and since Xì is a character, 

a lXl(z)Xl + ■■■ + a„Xn(z)Xn = 0. 

Divide by Xi( z ) and subtract from our first relation. The term a x 'Xi cancels, and 
we get a relation 


(“ 2 _ “ 3 ) x2+ 
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The first coefficient is not 0, and this is a relation of smaller length than our first 
relation, contradiction. 

As an application of Artin’s theorem, one can consider the case when K is a 
finite normal extension of a field k, and when the characters are distinct auto- 
morphisms o x , ..., o n of K over k, viewed as homomorphisms of K* into K*. 
This special case had already been considered by Dedekind, who, however, 
expressed the theorem in a somewhat different way, considering the determinant 
constructed from o x cúj where coj is a suitable set of elements of K, and proving in 
a more complicated way the fact that this determinant is not 0. The formulation 
given above and its particularly elegant proof are due to Artin. 

As another application, we have: 

Corollary 4.2. Let <x ì9 ..., <x n be distinct non-zero elements of afield K. If 

a x ,..., a n are elements of K such that for all integers v ^ 0 we have 

a x (x\ + ••• + a n (x x n = 0 

then a { = 0 for all i. 

Proof We apply the theorem to the distinct homomorphisms 

v i—► <x? 

of Z^ 0 into K*. 

Another interesting application will be given as an exercise (relative in- 
variants). 


§5. THE NORM AND TRACE 

Let E be a finite extension of k. Let [E : /c] s = r, and let 

P^ = ÍE:k]i 

if the characteristic is p > 0, and 1 otherwise. Let o l9 ..., o r be the distinct 
embeddings of E in an algebraic closure k a of k. If <x is an element of E, we 
define its norm from E to k to be 

N E/k (a) = Nf(oi) = Ó <T v« p " = ( fl 

V = 1 \v= 1 

Similarly, we define the trace 


^[E:k]i 


Tr E/k (a) = Trfo) = [£:*],£*,«. 

v = 1 

The trace is equal to 0 if [E : /c] t > 1, in other words, if E/k is not separable. 
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Thus if E is separable over /c, we have 

Nfa) = 

a 

where the product is taken over the distinct embeddings of E in /c a over k. 
Similarly, if Ejk is separable, then 

Tif(a) = £ <r*. 

a 

Theorem 5.1. Let E/k be a finite extension. Then the norm Nf is a multi- 
plicative homomorphìsm of E* into k* and the trace is an additive homo- 
morphism of E into k. lfE^F^>kisa tower of fields, then the two maps are 
transitive, in other words, 

Nf = N r k o Nf and Tr k £ = Trf o Trf. 

IfE = k(a), andf(X) = Irr(a, k, X) = X" + a„_ 1 A' n ~ 1 + • • • + a 0 , then 

N k k '*\a) = (— l)"a 0 and 7r kM (a) = 

Proof. For the first assertion, we note that a plA is separable over k if 
p p = [£: fc] t . On the other hand, the product 

t[o v a p “ 

V — 1 

is left fixed under any isomorphism into /c a because applying such an iso- 
morphism simply permutes the factors. Hence this product must lie in k since 
a pM is separable over k. A similar reasoning applies to the trace. 

For the second assertion, let {t ; } be the family of distinct embeddings of F 
into /c a over k. Extend each r } to an automorphism of fc 3 , and denote this 
extension by Xj also. Let {aj be the family of embeddings of E in /c a over F. 
(Without loss of generality, we may assume that E c k a .) If cr is an embedding 
of E over k in /c a , then for some / i j l o leaves F fixed, and hence tJ l o = cr, for 
some i. Hence cr = TjOi and consequently the family {t^ crj gives all distinct 
embeddings of E into k a over k. Since the inseparability degree is multiplicative 
in towers, our assertion concerning the transitivity of the norm and trace is 
obvious, because we have already shown that Np maps E into F, and similarly 
for the trace. 

Suppose now that E = /c(a). We have 


/(X) = ((*- ai )...(*- a r )) [£;fel * 


if a j,..., a r are the distinct roots of/. Looking at the constant term of f gives us 
the expression for the norm, and looking at the next to highest term gives us the 
expression for the trace. 

We observe that the trace is a /c-linear map of E into /c, namely 

Trf(ca) = c Trf(a) 
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for all a e E and cek. This is clear since c is fixed under every embedding of 
E over k. Thus the trace is a /c-linear functional of E into k. For simplicity, 
we write Tr = Trf. 

Theorem 5.2. Let E be afinite separable extension of k. Then Tr: E -* k is 
a non-zero functional. The map 

(x, y) i ► Tr(xy) 

ofE x E -> kis bilinear , and identifies E mth its dual space. 

Proofi That Tr is non-zero follows from the theorem on linear indepen- 
dence of characters. For each xeE, the map 

Tr x : E - k 

such that Tr x (y) = Tr(xy) is obviously a /c-linear map, and the map 

x i—► Tr x 

is a /:-homomorphism of E into its dual space £ v . (We don’t write E* for the 
dual space because we use the star to denote the multiplicative group of E.) 
If Tr x is the zero map, then Tr(x£) = 0. If x # 0 then xE = E. Hence the 
kernel of x i—► Tr x is 0. Hence we get an injective homomorphism of E into 
the dual space Ê. Since these spaces have the same finite dimension, it follows 
that we get an isomorphism. This proves our theorem. 

Corollary 5.3. Let (o x , ..., (o n be a basis of E over k. Then there exists a 
basis co\,... ,(ú' n of E over k such that Tr(co,coj) = <5^-. 

Proofi The basis a>\, ..., co n is none other than the dual basis which we 
defined when we considered the dual space of an arbitrary vector space. 

Corollary 5.4. Let E be a finite separable extension of k, and let a u ... ,a n 
be the distinct set of embeddings of E into /c a over k. Let w l5 ..., w n be ele- 
ments of E. Then the vectors 

Él = (fflVV!, . . . , (X^VV,,), 


L = {a„w u ...,a n w n ) 

are linearly independent over E if w í9 ..., w n form a basis of E over k. 

Proofi Assume that w l5 ..., w n form a basis of E/k. Let oc u ..., oc n be ele- 
ments of E such that 


«ìÉi + ••• + oi n Ç n = 0. 


Then we see that 


û(i<7i + ••• 4* cc n a, 
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applied to each one ofw l9 ...,w n gives the value 0. But a í9 . .., a n are linearly 
independent as characters ofthe multiplicative group £* into /c a *. It follows that 
a, = 0 for i = 1 ,..., n, and our vectors are linearly independent. 

Remark. In characteristic 0, one sees much more trivially that the trace is 
not identically 0. Indeed, if c e k and c # 0, then Tr(c) = nc where n = [E : /c], 
and n # 0. This argument also holds in characteristic p when n is prime to p. 


Proposition 5.5. Let E = /c(a) be a separable extension. Let 


f(X ) = Irr(a, /c, X), 
and let f\X) be its derivative. Let 

a) = + + ’' ’ + Pn-\X n ~ 1 

with pi e E. Then the dual basis of 1, a,..., a" _ 1 is 

Jo_ Pn^ 

/'(a)’ *• •’/'(<*)• 


Prco/. Let a j,..., a n be the distinct roots of/ Then 


” f(X) q? 
Â (Jf " a<) / (a,) 


for 0 ^ r ^ n — 1. 


To see this, let #(X) be the difference of the left- and right-hand side of this 
equality. Then g has degree ^ n — 1, and has n roots a l9 ..., a n . Hence g is 
identically zero. 

The polynomials 

JVO a? 

(X - a f ) /'(a f ) 


are all conjugate to each other. If we define the trace of a polynomial with 
coefficients in E to be the polynomial obtained by applying the trace to the 
coefficients, then 


Tr 


; f(X) q r ~ 

(X - a) /'(a/ 


= x r . 


Looking at the coefficients of each power of X in this equation, we see that 



thereby proving our proposition. 

Finally we establish a connection with determinants, whose basic properties 
we now assume. Let E be a finite extension of k, which we view as a finite 
dimensional vector space over k. For each a e E we have the k -linear map 
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multiplication by a, 

m a : E —» E such that m a (x) = ax. 

Then we have the determinant det (m a ), which can be computed as the determinant 
of the matrix M a representing m a with respect to a basis. Similarly we have the 
trace Tr (m a ) 9 which is the sum of the diagonal elements of the matrix M a . 

Proposition 5.6. Let E be a finite extension ofk and let a e E. Then 

det (m a ) = N E/k (a) and Tr (m a ) = Tr Elìc (a). 

Proof. Let F = k(a). If [F : k] = d , then 1, a,. . . , c^ _1 is a basis 
for F over k. Let {w u ..., w r } be a basis for E over F. Then {a l Wj} 
(i = 0,..., d — 1; j = 1,.. ., r) is a basis for E over k. Let 

f(X) =X d + a d - { X d ~ 1 + ... + «o 

be the irreducible polynomial of a over k. Then N F/k (a) = (— ì) d a 0 , an< l by the 
transitivity of the norm, we have 

NE/k ( °ì) = N F/k (a) r . 

The reader can verify directly on the above basis that N F / k (u) r is the determinant 
of m a on F, and then that N F/k (a) d is the determinant of m a on E , thus concluding 
the proof for the determinant. The trace is handled exactly in the same way, 
except that Tr E/k (a) = r • Tr F/k (a). The trace of the matrix for m a on F is equal 
to —a d - x . From this the statement identifying the two traces is immediate, as it 
was for the norm. 


§6. CYCLIC EXTENSIONS 

We recall that a finite extension is said to be cyclic if it is Galois and its 
Galois group is cyclic. The determination of cyclic extensions when enough roots 
of unity are in the ground field is based on the following fact. 

Theorem 6.1. (Hilbert’s Theorem 90). Let Kjk be cyclic of degree n 
with Galois group G. Let <j be a generator of G. Let f e K. The norm 
N k (fi) = N(p) is equal to 1 if and only if there exists an element a # 0 in K 
such that P = a/cra. 

Proof. Assume such an element a exists. Taking the norm of we get 
N(a)/N((j<x). But the norm is the product over all automorphisms in G. Inserting 
o just permutes these automorphisms. Hence the norm is equal to 1. 

It will be convenient to use an exponential notation as follows. If t, t' g G 
and Çe K we write 

+T ' = ÇÇ'. 
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By Artin’s theorem on characters, the map given by 
id + pa + p í+a a 2 + ••• + p í+a + 

on K is not identically zero. Hence there exists 9 e K such that the element 

a = e + pd a 4- p l+a e a2 + ••• + p l +*+-+<r"- 2 0*»- 1 

is not equal to 0. It is then clear that /íot a = a using the fact that iV(j?) = 1, and 
hence that when we apply o to the last term in the sum, we obtain 9. We divide 
by ot a to conclude the proof. 

Theorem 6.2. Let k be a field, n an integer > 0 prime to the characteristic 
of k, and assume that there is a primitive n-th root of unity in k. 

(i) Let Kbe a cyclic extension of degree n. Then there exists ote K such that 
K = k( a), and ot satisfies an equation X n — a = 0 for some aek. 

(ii) Conversely , let aek. Let otbe a root of X n — a. Then k(ot) is cyclic over 
k, of degree d,d\n, and ot d is an element of k. 

Proof. Let Ç be a primitive n-th root of unity in k, and let K/k be cyclic with 
groupG. Let o be a generator of G. We have N(Ç~ l ) = (Ç -1 )” = 1. ByHilbert’s 
theorem 90, there exists ote K such that cra = £a. Since £ is in k, we have 
o l ot = C'a for i = 1 ,... ,n. Hence the elements £ l a are n distinct conjugates of ot 
over k, whence [/c(a): /c] is at least equal to n. Since [K : k ] = n, it follows that 
K = k(ot). Furthermore, 


o(ot n ) = o(ot) n = (Ça)" = ot n . 


Hence ot n is fixed under cr, hence is fixed under each power of cr, hence is fixed 
under G. Therefore ot n is an element of k, and we let a = ot n . This proves the 
first part of the theorem. 

Conversely, let a e k. Let abea root of X n - a. Then aÇ' is also a root for 
each i = 1 ,. . ., n, and hence all roots lie in k(a) which is therefore normal over 
k. All the roots are distinct so k(a) is Galois over k. Let G be the Galois group. 

If cr is an automorphism of k(ot)/k then cra is also a root of X n — a. Hence 
oot = co a ot where co a is an n- th root of unity, not necessarily primitive. The map 
cr i—► is obviously a homomorphism of G into the group of n- th roots of unity, 
and is injective. Since a subgroup of a cyclic group is cyclic, we conclude that 
G is cyclic, of order d, and d\n. The image of G is a cyclic group of order d. 
If cr is a generator of G, then co^. is a primitive d\h root of unity. Now we get 


o(ot d ) = (oot) d = (co a ot) d = ot d . 


Hence ot d is fixed under o, and therefore fixed under G. It is an element of k, and 
our theorem is proved. 
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We now pass to the analogue of Hilbert’s theorem 90 in characteristic p for 
cyclic extensions of degree p. 

Theorem 6.3. (Hilbert’s Theorem 90, Additive Form). Let k be a field and 
K/k a cyclic extension of degree n with group G. Let o be a generator of G. 
Let /? g K. The trace Tr£(/?) is equal to 0 if and only if there exists an element 
ote K such that /? = ot — cra. 

Proof If such an element a exists, then we see that the trace is 0 because 
the trace is equal to the sum taken over all elements of G, and applying c per- 
mutes these elements. 

Conversely, assume Tr(j?) = 0. There exists an element 0 e K such that 
Tr(0) # 0. Let 

a = W Qa + (P + ^ +- + (/) + <r/)+- + <J n ~ 2 ii)e an -'l 

From this it follows at once that /? = a - cra. 

Theorem 6.4. (Artin-Schreier) Let kbe a field of characteristic p. 

(i) Let Kbea cyclic extension of k of degree p. Then there exists a e K such 
that K = /c(a) and a satisfies an equation X p — X — a = 0 with some 
ae k. 

(ii) Conversely , given aek , the polynomial f(X) = X p — X — a either has 
one root in /c, in which case all its roots are in /c, or it is irreducible. In 
this latter case , if oc is a root then k(ot) is cyclic of degree p over k. 

Proof Let K/k be cyclic of degree p. Then Trf( — 1) = 0 (it is just the sum 
of — 1 with itself p times). Let c be a generator of the Galois group. By the 
additive form of Hilbert’s theorem 90, there exists ote K such that cot — ot = 1, 
or in other words, cot = ot + 1. Hence c'ot = a + i for all integers i = 1,..., p 
and ot has p distinct conjugates. Hence [/c(a): /c] ^ p. It follows that K = k(ot). 
We note that 

cr(a p — a) = c(ot) p — c(ot) = (a -F ì) p — (a -f 1) = ot p — ot. 

Hence a p — ot is fixed under cr, hence it is fixed under the powers of cr, and 

therefore under G. It lies in the fixed field k. If we let a = ot p — ot we see that 
our first assertion is proved. 

Conversely, let aek. If ot is a root of X p — X — a then ot + i is also a 

root for i = 1,..., p. Thus f(X) has p distinct roots. If one root lies in k 

then all roots lie in k. Assume that no root lies in k. We contend that the 
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polynomial is irreducible. Suppose that 

f(X) = g(X)h(X) 

with g,he k\_X] and 1 ^ deg g < p. Since 

f(X) =f\(X-cc- 0 

i= 1 

we see that g(X) is a product over certain integers i. Let d = deg g. The co- 
efficient of X d ~ l in g is a sum of terms —(a + i) taken over precisely d integers 
i. Hence it is equal to — da -f j for some integer j. But d 0 in k, and hence 
a lies in k y because the coefficients of g lie in k , contradiction. We know therefore 
that/(X) is irreducible. All roots lie in k(a ), which is therefore normal over k. 
Sin ct f(X) has no multiple roots, it follows that k(cc) is Galois over k. There 
exists an automorphism o of k(oc) over k such that oot = a + 1 (because a + 1 
is also a root). Hence the powers o l of o give o l a = a + i for i = 1,..., p and 
are distinct. Hence the Galois group consists of these powers and is cyclic, 
thereby proving the theorem. 

For cyclic extensions of degree p r , see the exercises on Witt vectors and the 
bibliography at the end of §8. 


§7. SOLVABLE AND RADICAL EXTENSIONS 

A finite extension E/k (which we shall assume separable for convenience) is 
said to be solvable if the Galois group of the smallest Galois extension K of k 
containing E is a solvable group. This is equivalent to saying that there exists a 
solvable Galois extension L of k such that k cz E cz L. Indeed, we have 
k cz E c= K cz L and G(K/k) is a homomorphic image of G(L/k). 

Proposition 7.1. Solvable extensions form a distinguished class of extensions. 

Proof. Let E/k be solvable. Let F be a field containing k and assume £, F 
are subfields of some algebraically closed field. Let K be Galois solvable over fc, 
and E a K. Then KF is Galois over F and G(KF/F) is a subgroup of G(K/k) 
by Theorem 1.12. Hence EF/F is solvable. It is clear that a subextension of a 
solvable extension is solvable. Let E F => fc be a tower, and assume that E/F 
is solvable and F/k is solvable. Let K be a finite solvable Galois extension of fc 
containing F. We just saw that EK/K is solvable. Let L be a solvable Galois 
extension of K containing EK. If o is any embedding of L over fc in a given 
algebraic closure, then oK = K and hence oL is a solvable extension of K. We 
let M be the compositum of all extensions oL for all embeddings o of L over fc. 
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Then M is Galois over /c, and is therefore Galois over K. The Galois group of 
M over K is a subgroup of the product 

n g(vl/k) 

a 

by Theorem 1.14. Hence it is solvable. We have a surjective homomorphism 
G(M/k) -► G(K/k) by Theorem 1.10. Hence the Galois group of M/k has a 
solvable normal subgroup whose factor group is solvable. It is therefore 
solvable. Since E a M, our proof is complete. 


EK 



E | 

I K 



F 


k 

A finite extension F of k is said to be solvable by radicalsif it is separable and 
if there exists a finite extension E of k containing F, and admitting a tower 
decomposition 

U£ 0 c£ 1 c£ 2 c...c£ m = £ 
such that each step E i+ ,/£,■ is one of the following types: 

1. It is obtained by adjoining a root of unity. 

2. It is obtained by adjoining a root of a polynomial X n — a with ae E^ and 
n prime to the characteristic. 

3. It is obtained by adjoining a root of an equation X p — X — a with 
a e Ei if p is the characteristic > 0. 

One can see at once that the class of extensions which are solvable by 
radicals is a distinguished class. 

Theorem7.2. Let E be a separable extension of k. Then E is solvable by 

radicals if and only if E/k is solvable. 

Proof Assume that E/k is solvable, and let K be a finite solvable Galois 
extension of k containing E. Let m be the product of all primes unequal to the 
characteristic dividing the degree \_K : /c], and let F = /c(Ç) where C is a primitive 
m-th root of unity. Then F/k is abelian. We lift K over F. Then KF is solvable 
over F. There is a tower of subfields between F and KF such that each step is 
cyclic of prime order, because every solvable group admits a tower of sub- 
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groups of the same type, and we can use Theorem 1.10. By Theorems 6.2 and 
6.4, we conclude that KF is solvable by radicals over F, and hence is solvable 
by radicals over k. This proves that E/k is solvable by radicals. 


KF 



Conversely, assume that E/k is solvable by radicals. For any embedding cr 
of E in £ a over /c, the extension aE/k is also solvable by radicals. Hence the 
smallest Galois extension K of E containing /c, which is a composite of E and 
its conjugates is solvable by radicals. Let m be the product of all primes unequal 
to the characteristic dividing the degree \_K : /c] and again let F = k(Ç) where C 
is a primitive m-th root of unity. It will suffice to prove that KF is solvable over 
F, because it follows then that KF is solvable over k and hence G(K/k) is solvable 
because it is a homomorphic image of G(KF/k). But KF/F can be decomposed 
into a tower of extensions, such that each step is prime degree and of the type 
described in Theorem 6.2 or Theorem 6.4, and the corresponding root of unity 
is in the field F. Hence KF/F is solvable, and our theorem is proved. 

Remark. One could modify our preceding discussion by not assuming 
separability. Then one must deal with normal extensions instead of Galois 
extensions, and one must allow equations X p — a in the solvability by radicals, 
with p equal to the characteristic. Then we still have the theorem corresponding 
to Theorem 7.2. The proof is clear in view of Chapter V, §6. 

For a proof that every solvable group is a Galois group over the rationals, I 
refer to Shafarevich [Sh 54], as well as contributions of Iwasawa [Iw 53]. 

[Iw 53] K. Iwasawa, On solvable extension of algebraic number fields, Ann. ofMath. 
58 (1953), pp. 548-572 

[Sh 54] I. Shafarevich, Construction of fields of algebraic numbers with given solvable 
Galois group, Izv. Akad. Nauk SSSR 18 (1954), pp. 525-578 ( Amer. Math. 
Soc. Transl. 4 (1956), pp. 185-237) 


§8. ABELIAN KUMMER THEORY 

In this section we shall carry out a generalization of the theorem concerning 
cyclic extensions when the ground field contains enough roots of unity. 

Let k be a field and m a positive integer. A Galois extension K of k with 
group G is said to be of exponent m if cr m = 1 for all cr e G. 
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We shall investigate abelian extensions of exponent m. We first assume 
that m is prime to the characteristic of /c, and that k contains a primitive m-th 
root of unity. We denote by \i m the group of m-th roots of unity. We assume 
that all our algebraic extensions in this section are contained in a fixed algebraic 
closure /c a . 

Let aek. The symbol a ìlm (or j/a) is not well defined. If a m = a and ( is 
an m-th root of unity, then (Ca) m = a also. We shall use the symbol a ì,m to 
denote any such element a, which will be called an m-th root of a. Since the 
roots of unity are in the ground field, we observe that the field /c(a) is the same 
no matter which m-th root a of a we select. We denote this field by k(a í,m ). 

We denote by /c* m the subgroup of /c* consisting of all m-th powers of non- 
zero elements of k. It is the image of /c* under the homomorphism x i—► x m . 

Let B be a subgroup of /c* containing /c* m . We denote by k(B í,m ) or K B the 
composite of all fields k(a í,m ) with ae B. It is uniquely determined by B as a 
subfield of /c a . 

Let aeB and let a be an m-th root of a. The polynomial X m — a splits into 
linear factors in K Bì and thus K B is Galois over /c, because this holds for all 
aeB. Let G be the Galois group. Let aeG. Then cra = (o a oc for some m-th 
root of unity co a e \i m c= /c*. The map 

G\-^CO a 

is obviously a homomorphism of G into p m , i.e. for t , g e G we have 


tcra = co x co a Qí = co a co x oc. 

We may write co a = gol/ol. This root of unity co a is independent of the choice 
of m-th root of a , for if a' is another m-th root, then ot' = £oc for some (e\i m , 
whence 

cra'/a' = £goc/£oc = goc/oc. 

We denote co a by <<j, a}. The map 

(g, a) i ► <(J, a > 

gives us a map 

G x B^\i m . 

If a, b e B and a m = a, p m = b then (aj?) m = ab and 

G(ocp)/ocp = (G(x/oc)(Gp/p). 

We conclude that the map above is bilinear. Furthermore, if a e /c* m it follows 
that <cr, a} = 1. 


Theorem 8.1. Let kbea field , m an integer > 0 prime to the characteristic of 
/c, and assume that a primitive m-th root of unity lies in k. Let Bbea subgroup 
of /c* containing /c* m and let K B = k(B ì,m ). Then K B is Galois , and abelian 
of exponent m. Let G be its Galois group. We have a bilinear map 

G x B \i m given by (<r, a) i-> <<j, a}. 
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IfasG and aeB , and oc m = a then <<r, a > = aoc/oc. The hernel on the left is 1 
and the kerne\ on the right is /c* m . The extension K B /k isfinite if and only if 
(B : k* m ) is finite. If that is the case, then 

B/k* m « G A , 

and in particular we have the equality 

[K B :k] = (B:k* m ). 

Proof. Let oeG. Suppose <ít, a} = 1 for all ae B. Then for every gener- 
ator oc of K B such that oc m = ae Bwe have ooc = oc. Hence cr induces the identity 
on K b and the kernel on the left is L Let ae B and suppose <cr, a} = 1 for all 
oeG. Consider the subfield k(a í/m ) of K B . If a ì/m is not in there exists an 
automorphism of k(a ì/m ) over k which is not the identity. Extend this auto- 
morphism to K B , and call this extension cr. Then clearly <cr, a} # 1. This 
proves our contention. 

By the duality theorem of Chapter I, §9 we see that G is finite if and only 
if B/k* m is finite, and in that case we have the isomorphism as stated, so that 
in particular the order of G is equal to (B : k* m ), thereby proving the theorem. 

Theorem 8.2. Notation being as in Theorem 8.1, the map B^ K B gives a 
bijection of the set of subgroups ofk* containing k* m and the abelian extensions 
of k of exponent m. 

Proof. Let B u B 2 be subgroups of k* containing k* m . If B x c= B 2 then 
k(B\ /m ) c= k(B\ /m ). Conversely, assume that k(B\ /m ) c= k(B\ /m ). We wish to 
prove B x c= B 2 . Let b e B x . Then k(b í/m ) c= k(B\ /m ) and k(b ì/m ) is contained in 
a finitely generated subextension of k(B\ /m ). Thus we may assume without loss 
of generality that B 2 /k* m is finitely generated, hence finite. Let B 3 be the sub- 
group of k* generated by B 2 and b. Then k(B\ /m ) = k(B\ /m ) and from what we 
saw above, the degree of this field over k is precisely 

(B 2 : k* m ) or (B 3 :k* m ). 

Thus these two indices are equal, and B 2 = B 3 . This proves that B x c= B 2 . 

We now have obtained an injection of our set of groups B into the set of 
abelian extensions of k of exponent m. Assume finally that K is an abelian 
extension of k of exponent m. Any finite subextension is a composite of cyclic 
extensions of exponent m because any finite abelian group is a product of 
cyclic groups, and we can apply Corollary 1.16. By Theorem 6.2, every cyclic 
extension can be obtained by adjoining an m-th root. Hence K can be obtained 
by adjoining a family of m-th roots, say m-th roots of elements {bj} jeJ with 
bj e k*. Let B be the subgroup of k* generated by all bj and k* m . If b' = ba m 
with a,bek then obviously 

k(b ,í/m ) = k(b ì/m ). 

Hence k(B í/m ) = K , as desired. 
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When we deal with abelian extensions of exponent p equal to the char- 
acteristic, then we have to develop an additive theory, which bears the same 
relationship to Theorems 8.1 and 8.2 as Theorem 6.4 bears to Theorem 6.2. 

If/c is a field, we define the operator p by 

p(x) = x p - x 

for x e k. Then p is an additive homomorphism of k into itself. The subgroup 
p(k) plays the same role as the subgroup /c* m in the multiplicative theory, 
whenever m is a prime number. The theory concerning a power of p is slightly 
more elaborate and is due to Witt. 

We now assume k has characteristic p. A root of the polynomial X p — X — a 
with aek will be denoted by p~ l a. If B is a subgroup of k containing pk 
we let K b = k(p~ l B) be the field obtained by adjoining p~ l a to k for all a e B. 
We emphasize the fact that B is an additive subgroup of k. 

Theorem 8.3. Let kbe a field of characteristic p. The map B\-> k(p~ l B) 
is a bijection between subgroups of k containing pk and abelian extensions of 
k of exponent p. Let K = K B = k(p~ l B), and let G be its Galois group. 
If o eG and a e B, and pot = a, ìet <<r, a} = <ra - a. Then we have a bilinear 
map 

G x B -+ Z/pZ given by (o , a) -► <a, a). 

The kernel on the ìeft is 1 and the kernel on the right is pk. The extension 
K B /k is finite if and only if (B : pk) is finite and if that is the case , then 


ÍK B :k] = (B : pk). 


Proof. The proof is entirely similar to the proof of Theorems 8.1 and 8.2. 
It can be obtained by replacing multiplication by addition, and using the “ p-th 
root” instead of an m-th root. Otherwise, there is no change in the wording of 
the proof. 

The analogous theorem for abelian extensions of exponent p n requires 
Witt vectors, and will be developed in the exercises. 

Bibliography 

[Wi 35 ] E. Witt, Der Existenzsatz fiir abelsche Funktionenkôrper, J. reine angew. 
Math. 173 ( 1935 ), pp. 43-51 

[Wi 36 ] E. Witt, Konstruktion von galoisschen Kòrpern der Charakteristik p mit 
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237-245 

[Wi 37] E. Witt, Zyklische Kòrper und Algebren der Charakteristik p vom Grad p n . 

Struktur diskret bewerteter perfekter Korper mit vollkommenem Restklas- 
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§9. THE EQUATION X n - a = 0 

When the roots of unity are not in the ground field, the equation X n - a = 0 
is still interesting but a little more subtle to treat. 

Theorem 9.1. Let kbea field and n an integer ^ 2. Let a e k, a # 0. Assume 
that for all prime numbers p such that p\n we have a$k p , and if 4\n then 
a$ — 4/c 4 . Then X n — a is irreducible in k[X~\. 

Proof Our first assumption means that a is not a p-th power in k. We 
shall reduce our theorem to the case when n is a prime power, by induction. 
Write n = p r m with p prime to m , and p odd. Let 

m 

X" - a = n (* - «.) 

V= 1 

be the factorization of X m — a into linear factors, and say a = a,. Substituting 
X pr for X we get 


X” — a = X p " m - a = n ( X p " - a v ). 

v = 1 

We may assume inductively that X m — a is irreducible in k[X ]. We contend 
that a is not a p-th power in /c(a). Otherwise, a = f e /c(a). Let iV be the 
norm from /c(a) to k. Then 


-a = (- l) m yV(a) = (- 1 ) m N(P P ) = (- l) m N(P) p . 


If m is odd, a is a p-th power, which is impossible. Similarly, if m is even and p 
is odd, we also get a contradiction. This proves our contention, because m is 
prime to p. If we know our theorem for prime powers, then we conclude that 
X pr — a is irreducible over /c(a). If A is a root of X pr — a then k c= /c(a) c= k(A) 
gives a tower, of which the bottom step has degree m and the top step has degree 
p r . It follows that A has degree n over k and hence that X n — a is irreducible. 

We now suppose that n = p r is a prime power. 

If p is the characteristic, let a be a p-th root of a. Then X p — a = (X — a) p 
and hence X pr — a = (2f pr_ 1 — a) p if r ^ 2. By an argument even more trivial 
than before, we see that a is not a p-th power in k(a), hence inductively 
XP r — a is irreducible over k(a). HenceX^ — a is irreducible over k. 

Suppose that p is not the characteristic. We work inductively again, and 
let abea root of X p — a. 

Suppose a is not a p-th power in k. We claim that X p — a is irreducible. 
Otherwise a root a of X p — a generates an extension k(a) of degree d < p 
and a p = a. Taking the norm from k(a) to k we get N(a) p = a d . Since d is 
prime to p, it follows that a is a p-th power in k , contradiction. 
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Let r ^ 2. We let a = a r We have 

p 

X p -a = YUx-aJ 

v — 1 

and 

p 

XP r - a= n (XP r ~' - a„). 

V=l 

Assume that a is not a p-th power in /c(a). Let A be a root of X pr ~ 1 — a. If p 
is odd then by induction, A has degree p r ~ 1 over /c(a), hence has degree p r over 
k and we are done. If p = 2, suppose a = — 4/? 4 with /? e /c(a). Let N be the 
norm from £(a) to k. Then — 0 = A(a) = 16A(/3) 4 , so —a is a square in /:. Since 
p = 2 we get V-T e &(a) and a = (V—T 2/3 2 ) 2 , a contradiction. Hence again 
by induction, we fìnd that A has degree p r over k. We therefore assume that 
a = & p with some /3 e k(a ), and derive the consequences. 

Taking the norm from /c(a) to /c we find 

-a = (_iyw(a) = (-l) p N(jS p ) = (- 1) Í W(/?) P . 

If p is odd, then a is a p-th power in /c, contradiction. Hence p = 2, and 

-0 = N (/?) 2 

is a square in k. Write —a = b 2 with bek. Since u is not a square in k we con- 
clude that — 1 is not a square in k. Let i 2 = — 1. Over k(i) we have the factoriza- 
tion 

JY 2r - a = AT 2r Hh b 2 = (X 2r ~ l + ib)(X 2r " - ib). 

Each factor is of degree 2 r ~ 1 and we argue inductively. If A" 2r ~ 1 ± ib is reducible 
over k(i) then ± ib is a square in k(i) or lies in — 4(/c(i)) 4 . In either case, ± ib is a 
square in k(i), say 


± ib = (c + di) 2 = c 2 + 2 cdi — d 2 

with c, d e k. We conclude that c 2 = d 2 or c = ±d, and ±ib = 2cJ/ = ±2 c 2 i. 
Squaring gives a contradiction, namely 

a = — b 2 = -4 c 4 . 

We now conclude by unique factorization that X r + b 2 cannot factor in 
/c[A"], thereby proving our theorem. 

The conditions of our theorem are necessary because 

X 4 + 4b 4 = (X 2 + 2 bX + lb 2 )(X 2 - 2bX + 2 b 2 ). 

If n = 4m and a e —4 /c 4 then X n — a is reducible. 
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Corollary 9.2. Let k be afield and assume that aek,a^0, and that a is not 
a p-th powerfor some prime p. If p is equal to the characteristic , or if p is odd, 
thenfor every integer r ^ 1 the polynomial X pr - a is irreducible over k. 

Proof. The assertion is logically weaker than the assertion of the theorem. 

Corollary 9.3. Let kbe a field and assume that the algebraic closure k a of k 
is of finite degree > 1 over k. Then k a = k(í) where i 2 = -1, and k has 
characteristic 0. 

Proof. We note that fc* is normal over k. If k* is not separable over k, so 
char k = p > 0, then k? is purely inseparable over some subfield of degree > 
1 (by Chapter V, §6), and hence there is a subfield E containing k , and an element 
a e E such that X p - a is irreducible over E. By Corollary 9.2, k* cannot be of 
finite degree over E. (The reader may restrict his or her attention to characteristic 
0 if Chapter V, §6 was omitted.) 

We may therefore assume that k a is Galois over k. Let /cj = k(i). Then /c a 
is also Galois over k v Let G be the Galois group of /c a //c,. Suppose that there 
is a prime number p dividing the order of G, and let H be a subgroup of order p. 
Let F be its fixed field. Then [/c a : F] = p. I ïp is the characteristic, then Exercise 
29 at the end of the chapter will give the contradiction. We may assume that p 
is not the characteristic. The p-ih roots of unity # 1 are the roots of a poly- 
nomial of degree ^ p — 1 (namely X p ~ 1 -f ■ • • -f 1), and hence must lie in F. 
By Theorem 6.2, it follows that k a is the splitting field of some polynomial 
X p — a with ae F. The polynomial X pl — a is necessarily reducible. By the 
theorem, we must have p = 2 and a = -4Z? 4 with beF. This implies 

k a = F(a íl2 ) = F( i). 

But we assumed i e /c,, contradiction. 

Thus we have proved k* = k(i). It remains to prove that char k = 0, and for 
this I use an argument shown to me by Keith Conrad. We first show that a sum 
of squares in k is a square. It suffices to prove this for a sum of two squares, 
and in this case we write an element x + iy e jfc(i) = k? as a square. 

x + iy = (u + /v) 2 , x, y, u, v e k , 

and then x 2 + y 2 = (u 2 + v 2 ) 2 . Then to prove k has characteristic 0, we merely 
observe that if the characteristic is > 0, then — 1 is a finite sum 1 + ... + 1, 
whence a square by what we have just shown, but kf = k(i) y so this conciudes 
the proof. 

Corollary 9.3 is due to Artin; see [Ar 24], given at the end of Chapter XI. 
In that chapter, much more will be proved about the field k. 

Example 1. Let k = Q and let Gq = G(Q a /Q). Then the only non-trivial 
torsion elements in Gq have order 2. It follows from Artin’s theory (as given 
in Chapter XI) that all such torsion elements are conjugate in Gq. One uses 
Chapter XI, Theorems 2.2, 2.4, and 2.9.) 
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Example 2. Let k be a field of characteristic not dividing n. Let a e k 9 
a # 0 and let K be the splitting field of X n — a. Let a be one root of 
X n - a 9 and let Ç be a primitive n-t h root of unity. Then 

K = k(a, £) = k(a, §i n ). 

We assume the reader is acquainted with matrices over a commutative ring. Let 
a e G K/k . Then ( crá) n = a, so there exists some integer b = b(&) uniquely 
determined mod n, such that 

cr(a) = atf^. 

Since crinduces an automorphism of the cyclic group ix^, there exists an integer 
d(cr) relatively prime to n and uniquely determined mod n such that a(Ç) = 
£^ (<7) . Let G(n) be the subgroup of GL 2 (Z/nZ) consisting of all matrices 

M = ^ with b e Z/nZ and d E (Z/nZ)*. 

Observe that #G(n) = n<p(n). We obtain an injective map 

j°J ° fc “ 

which is immediately verified to be an injective homomorphism. The question 
arises, when is it an isomorphism? The next theorem gives an answer over some 
fields, applicable especially to the rational numbers. 

Theorem 9.4. Let kbe a field. Let n be an odd positive integer prime to the 
characteristic , and assume that [^(ix^): k] = <p(n). Let a e k, and suppose that 
for each prime p\n the element a is not a p-th power in k. Let K be the splitting 
field ofX n — a over k. Then the above homomorphism a i—> M(a) is an 
isomorphism of G K/k with G(n). The commutator group is Gaì(K/k(\k n )), so 
/cíix^) is the maximal abelian subextension of K. 

Proof. This is a special case of the general theory of § 11, and Exercise 39, 
taking into account the representation of G K/k in the group of matrices. One need 
only use the fact that the order of G K/k is n<p(n), according to that exercise, and 
so #(G K/k ) = #G(n), so G K/k = G(n). However, we shall given an independent 
proof as an example of techniques of Galois theory. We prove the theorem by 
induction. 

Suppose first n = p is prime. Since [^(ix^) : k\ = p - 1 is prime to p, it 
follows that if a is a root of X p - a, then k(a) fl k(\x p ) = k because 
[k(a) :k\= p. Hence [K : k\ = p(p - 1), so G K/k = G(p). 

A direct computation of a commutator of elements in G(n) for arbitrary n 
shows that the commutator subgroup is contained in the group of matrices 

(‘ ?),0eZ/„Z, 
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and so must be that subgroup because its factor group is isomorphic to (Z/nZ)* 
under the projection on the diagonal. This proves the theorem when n = p. 

Now \ztp\n and write n = pm. Then [A:(|x m ) : k\ = <p(m), immediately from 
the hypothesis that [^(ix^) : k\ = <p(n). Let a be a root of X n - a , and let 
P = a p . Then (3 is a root of X m — a, and by induction we can apply the theorem 
to X m — a. The fìeld diagram is as follows. 



Since a has degree pm over k , it follows that a cannot have lower degree than 
p over k(P), so [k(a) : k((3)\ = p and X p - (3 is irreducible over k((3). We apply 
the fìrst part of the proof to X p - (3 over k((3). The property concerning the 
maximal abelian subextension of the splitting field shows that 

k(a) fl k((3, |i w ) = k((3). 

Hence [ k(a , |xj : k((3, [i n )\ = p. By induction, [k((3, pi n ) : /:(|x w )] = m, again 
because of the maximal abelian subextension of the splitting field of X m - a 
over k. This proves that [K: k\ = n<p(n), whence G K/k = G(n ), and the commutator 
statement has already been proved. This concludes the proof of Theorem 9.4. 

Remarks. When n is even, there are some complications, because for 
instance Q(V2) is contained in Q(|x 8 ), so there are dependence relations among 
the fields in question. The non-abelian extensions, as in Theorem 9.4, are of 
intrinsic interest because they constitute the first examples of such extensions 
that come to mind, but they arose in other important contexts. For instance, 
Artin used them to give a probabilistic model for the density of primes p such 
that 2 (say) is a primitive root mod p (that is, 2 generates the cyclic group 
(Z/pZ)*. Instead of 2 he took any non-square integer ± 1. At first, Artin did 
not realize explicitly the above type of dependence, and so came to an answer 
that was off by some factor in some cases. Lehmer discovered the discrepancy 
by computations. As Artin then said, one has to multiply by the “obvious” factor 
which reflects the field dependencies. Artin never published his conjecture, but 
the matter is discussed in detail by Lang-Tate in the introduction to his collected 
papers (Addison-Wesley, Springer Verlag). 

Similar conjectural probabilistic models were constructed by Lang-Trotter in 
connection with elliptic curves, and more generally with certain p- adic repre- 
sentations of the Galois group, in “Primitive points on elliptic curves”, Bull. 
AMS 83 No. 2 (1977), pp. 289-292; and [LaT 75] (end of §14). 

For further comments on the p- adic representations of Galois groups, see §14 
and §15. 
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§10. GALOIS COHOMOLOGY 

Let G be a group and A an abelian group which we write additively for the 
general remarks which we make, preceding our theorems. Let us assume that 
G operates on A, by means of a homomorphism G -► Aut(>l). By a 1-cocycle of 
G in A one means a family of elements {a a } a e G with a a e A, satisfying the relations 


a a -f aa x = a ax 

for all o, x e G. If {oc a } aeG an d {PoÌozg are 1-cocycles, then we can add them to 
get a 1-cocycle {a a -f P a } aeG - It is then clear that 1-cocycles form a group, 
denoted by Z\G, A). By a 1-coboundary of G in A one means a family of ele- 
ments such that there exists an element jSe A for which a a = op — f 

for all o e G. It is then clear that a 1-coboundary is a 1-cocycle, and that the 
1-coboundaries form a group, denoted by B l (G, A). The factor group 

Z\G, A)/B\G, A) 

is called the first cohomology group of G in A and is denoted by H l (G, A). 
Remarks. Suppose G is cyclic. Let 

Tr G : A A be the homomorphism a i-> 2 cr(a). 

cr<=G 


Let y be a generator of G. Let (1 - y)A be the subgroup of A consisting of all 
elements a - y(a) with a e A. Then (1 - y)A is contained in ker Tr G . The 
reader will verify as an exercise that there is an isomorphism 

ker Tr G /(l - y)A « H\G, A). 

Then the next theorem for a cyclic group is just Hilbert’s Theorem 90 of §6. 
Cf. also the cohomology of groups, Chapter XX, Exercise 4, for an even more 
general context. 

Theorem 10.1. Let K/k be a finite Galois extension with Galois group G. 
Then for the operation of G on X* we have H\G , X*) = 1 ,and for the 
operation of G on the additive group of X we have H l (G , X) = 0. In other 
words , the first cohomology group is trivial in both cases. 

Proof. Let {a a } aeG be a 1-cocycle of G in X*. The multiplicative cocycle 
relation reads 
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By the linear independence of characters, there exists 6 e K such that the element 

0 = Z «t T ( 0 ) 

t e G 


is # 0. Then 


Op = Z = Z a <rra„ lffT ( 0 ) 

reG reG 


= «; 1 Z a <TT<n(û) = a n l P- 

r e G 


We get a, = P/op, and using /T 1 instead of /? gives what we want. 

For the additive part of the theorem, we find an element 6 e K such that the 
trace Tr(0) is not equal to 0. Given a 1-cocycle {a^} in the additive group of K , 
we let 


P = 


1 

TW 


Z a t T ( 0 )- 

re G 


It follows at once that a ff = /? - crjS, as desired. 

The next lemma will be applied to the non-abelian Kummer theory of the 
next section. 


Lemma 10.2. (Sah). Let G be a group and let E be a G-module. Letxbein 
the center of G. Then H l (G , E) is annihilated by the map x i—► rx — x on E. 
In particular , if this map is an automorphism of E , then H l (G , £) = 0. 

Proo/. Let / be a 1-cocycle of G in E. Then 

/(<*) ^/(tíjt" 1 ) =/(t) + T(/(aT _1 ) 

= /(t) + t[/(ct) + (j/(t _1 )]. 


Therefore 


t/((J) — f(ó) = -<Jt/(t ^) -/(t). 
But/(1) =/(l) +/(1) implies/(1) = 0, and 

0 =/(l) =/(tt _1 ) =/(t) + t/(t _1 ). 


This shows that (t — l)/(cr) = (<r — 1)/(t), so/is a coboundary. This proves 
the lemma. 
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§11. IMOIM-ABELIAN KUMMER EXTENSIONS 

We are interested in the splitting fields of equations X n — a = 0 when the 
n-th roots of unity are not contained in the ground field. More generally, we 
want to know roughly (or as precisely as possible) the Galois group of simul- 
taneous equations of this type. For this purpose, we axiomatize the pattern 
of proof to an additive notation, which in fact makes it easier to see what is 
going on. 

We fix an integer N > 1, and we let M range over positive integers divid- 
ing N. We let P be the set of primes dividing N. We let G be a group, and let: 

A = G-module such that the isotropy group of any element of A is of finite 
index in G. We also assume that A is divisible by the primes p\N, 
that is 


pA = A for all pe P. 

T = finitely generated subgroup of A such that T is pointwise fixed by G. 
We assume that A N is finite. Then -J- T is also finitely generated. Note that 


Example. For our purposes here, the above situation summarizes the 
properties which hold in the following situation. Let K be a finitely generated 
field over the rational numbers, or even a finite extension of the rational numbers. 
We let A be the multiplicative group of the algebraic closure K a . We let G = G K 
be the Galois group Gal(K a /.K). We let T be a fìnitely generated subgroup of 
the multiplicative group K*. Then all the above properties are satisfied. We 

see that A N = \i N is the group of /V-th roots of unity. The group written ^ T 

in additive notation is written T 1/N in multiplicative notation. 

Next we define the appropriate groups analogous to the Galois groups of 
Kummer theory, as follows. For any G-submodule B of A, we let: 

G(B) = image of G in Aut (B), 

G(N) = G(A n ) = image of G in Aut(/1 N ), 

H(N) = subgroup of G leaving A N pointwise fixed, 


H r (M, N) (for M\N) = image of H(N) in Aut/^ f\ 
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Then we have an exact sequence: 

r + a n ^J -* G(N) -* o. 

Example. In the concrete case mentioned above, the reader will easily 
recognize these various groups as Galois groups. For instance, let A be the 
multiplicative group. Then we have the following lattice of field extensions 
with corresponding Galois groups: 


0 -► H r (M, N)-+G 


i 

\M 


G(r 1/M Hiv) 


k(h n , r 1/M )| 
K(fi N ) \ 


#r(M, N) 
G(N) 


In applications, we want to know how much degeneracy there is when we trans- 
late K(\í Mì T 1/M ) over K(\i N ) with M\N. This is the reason we play with the 
pair M, N rather than a single N. 

Let us return to a general Rummer representation as above. We are in- 
terested especially in that part of (Z/NZ)* contained in G(N\ namely the group 
of integers n (mod N) such that there is an element [n] in G(N) such that 

[rî]a = na for all a e Á N . 

Such elements are always contained in the center of G(N\ and are called 

homotheties. 

Write 


n = n p n(p) 

Let S be a subset of P. We want to make some non-degeneracy assumptions 
about G(N). We call S the special set. 

There is a product decomposition 

(Z/NZ)* = n (Z/p n{p) Z)*. 

p\N 

If 21 N we suppose that 2 e S. For each peS we suppose that there is an integer 
c(p) = p f(p) with f(p) ^ 1 such that 

g ( a n >=> n^, P , x n(z/p" (p, z)*, 

peS piS 

where U c(p) is the subgroup of Z(p" (p) ) consisting of those elements = 1 mod c(p). 



306 GALOIS THEORY 


VI, §11 


The product decomposition on the right is relative to the direct sum decom- 
position 

A N = A p n(p). 
p\N 

The above assumption will be called the non-degeneracy assumption. The 
integers c(p) measure the extent to which G(A N ) is degenerate. 

Under this assumption, we observe that 

[2] e G(A m ) if M | N and M is not divisible by primes of 5; 

[1 + c] e G(A m ) if M\N and M is divisible only by primes of S, 

where 


c = c(s) = n c(p )• 

peS 

We can then use [2] - [1] = [1] and [1 + c] — [1] = [c] in the context of 
Lemma 10.2, since [1] and [c] are in the center of G. 

For any M we define 


c(M) = n c ( p ). 


P\M 
peS 


Defìne 

and the exponent 

c(ryr) = smallest positive integer e such that cT' cz T. 

It is clear that degeneracy in the Galois group H r (M , N) defined above can 
arise from lots of roots of unity in the ground field, or at least degeneracy in 
the Galois group of roots of unity; and also if we look at an equation 

X M - a = 0, 

from the fact that a is already highly divisible in K . This second degeneracy 
would arise from the exponent c(ryr), as can be seen by looking at the Galois 
group of the divisions of T. The next theorem shows that these are the only 
sources of degeneracy. 

We have the abelian Kummer pairing for M \ N , 

H r (M , N) x T/MT -► A m given by (t, x) ► xy — y, 
where y is any element such that My = x. The value of the pairing is indepen- 
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dent of the choice of y. Thus for x e T, we have a homomorphism 

<P X : H r(M , N) -* A m 

such that 

(p x ( t) = xy — y, where My = x. 

Theorem 11.1. LetM\N. Let cp be the homomorphism 
(p:T^> Hom (H r (M, N), A M ) 

í/nd /cí be its hernel. Let e M (V) = g.c.d. (c(ryr), M). Under the non- 
degeneracy assumption , we have 

«Mìednr. c Àf r. 

Proí?/. Let x g T and suppose = 0. Let My = x. For <r g G let 

= °y - y. 

Then {y CT } is a 1-cocycle of G in Á M , and by the hypothesis that (/) x = 0, this 
cocycle depends only on the class of a modulo the subgroup of G leaving the 
elements of A N fixed. In other words, we may view {y a } as a cocycle of G(N) in 
A m . Let c = c(N). By Lemma 10.2, it follows that { cy a } splits as a cocycle of 
G(N) in A m . In other words, there exists t 0 e A M such that 

cy a = °t 0 — t 0i 

and this equation in fact holds for cr g G. Let t be such that ct = t 0 . Then 

coy - cy = oct - cy , 

whence c(y — t) is fixed by all o e G, and therefore lies in ~ T. Therefore 


e(r/T)c(y-t)er. 

We multiply both sides by M and observe that cM(y — t) = cMy = cx. This 
shows that 


c(/vHryr)r^ c mt. 

Since T/MT has exponent M, we may replace c(ryr) by the greatest common 
divisor as stated in the theorem, and we can replace c(N) by c(M) to conclude 
the proof. 

Corollary 11.2. Assume that M is prime to 2(T': T) and is not divisible by 
any primes of the special set S. Then we have an injection 


(p : T/MT Hom(//p(M, /V), A M ). 
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Ifin addition T is free mth basis a r }, and we let (p ( = (p a ., then the map 

H r (M , N) -> >455 tfïuen by z -► ((pfz\ . .., <p r (i)) 

is injective. If A M is cyclic of order M, this map is an isomorphism. 

Proof Under the hypotheses of the corollary, we have c(M) = 1 and 
c M (T) = 1 in the theorem. 

Example. Consider the case of Galois theory when A is the multiplicative 
group of K a . Let a u ... 9 a r be elements of K* which are multiplicatively inde- 
pendent. They generate a group as in the corollary. Furthermore, A M — \i M 
is cyclic, so the corollary applies. If M is prime to 2(P : T) and is not divisible 
by any primes of the special set 5, we have an isomorphism 

(p: r/MT - Hom (H r (M, N\ \i M ). 


§12. ALGEBRAIC INDEPENDENCE OF 
HOMOMORPHISMS 

Let A be an additive group, and let K be a fìeld. Let k u ..., X n : A -► K be 
additive homomorphisms. We shall say that k u ..., X n are algebraically 
dependent (over K) if there exists a polynomial f(X l9 ...,X n ) in 
K \_X l9 . . . , Xn\ such that for all x e A we have 

/(Aj(x),..., X n (x)) = 0, 

but such that / does not induce the zero function on K (n \ i.e. on the direct 
product of K with itself n times. We know that with each polynomial we can 
associate a unique reduced polynomial giving the same function. If K is 
infinite, the reduced polynomial is equal to / itself. In our definition of de- 
pendence, we could as well assume that / is reduced. 

A polynomial f(X u ..., X n ) will be called additive if it induces an additive 
homomorphism of K {n) into K. Let (y) = (Y u ..., Y n ) be variables inde- 
pendent from (A"). Let 

g(X, Y)=f(X+ Y )-f(X)-f(Y) 

where X -f Y is the componentwise vector addition. Then the total degree of 
g viewed as a polynomial in (X) with coefficients in X[T] is strictly less than 
the total degree of f and similarly, its degree in each X t is strictly less than the 
degree of/ in each X u One sees this easily by considering the difference of 
monomials, 
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M (V) (X + Y) - M {V) (X) - M iv) (Y) 

= (X x + y i r---(X w + YJ"-X\'---Xï- YÏ---YÏ. 

A similar assertion holds for g vievved as a polynomial in(Y) with coefficients in 
K[X]. 

If/ is reduced, it follows that g is reduced. Hence if/ is additive, it follows 
that g is the zero polynomial. 

Example. Let K have characteristic p. Then in one variable, the map 


for a e K and m ^ 1 is additive, and given by the additive polynomial aX pm . 
We shall see later that this is a typical example. 

Theorem 12.1. (Artin). Let X Í9 ..., : A -► K be additive homomorph- 

isms of an additive group into a field. If these homomorphisms are alge- 
braicaìly dependent over K , then there exists an additive polynomial 

f(X l9 ...,XJ* 0 

in K[X] such that 

..., A„(x)) = 0 


for all x e A. 

Proof. Let f(X)—f(X ì9 ... 9 X r )eK[X'\ be a reduced polynomial of 
lowest possible degree such that / # 0 but for all xeA,f( A(x)) = 0, where 
A(x) is the vector (A/x),..., A„(x)). We shall prove that/is additive. 

Let g(X 9 Y) =f(X + 7) - f(X) - f(Y). Then 

g( A(x), A(y)) =f(A(x + y)) — f(A(x)) -f(A(y)) = 0 

for all x, y e A. We shall prove that g induces the zero function on K {n) x K {n) . 
Assume otherwise. We have two cases. 

Case 1. We have g(Ç 9 A(y)) = 0 for all ÇeK {n) and all yeA. By 
hypothesis, there exists Ç e K {n) such that #(£', Y) is not identically 0. Let 
P(Y) = g(Ç\ Y). Since the degree of g in(Y) is strictly smaller than the degree 
of /, we have a contradiction. 

Case 2. There exist Ç e K {n) and y' e A such that g(Ç\ A(y')) # 0. Let 
P(X) = g(X, A(y')). Then P is not the zero polynomial, but P(A(x)) = 0 for all 
x e A, again a contradiction. 
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We conclude that g induces the zero function on K {n) x K (n) , which proves 
what we wanted, namely that/is additive. 

We now consider additive polynomials more closely. 

Let/be an additive polynomial in n variables over K , and assume that/is 
reduced. Let 


fi(Xi) =/( 0 ,..., Xi ,..., 0 ) 

with X t in the i-th place, and zeros in the other components. By additivity, it 
follows that 


because the difference of the right-hand side and left-hand side is a reduced 
polynomial taking the value 0 on K (n) . Furthermore, each/ is an additive 
polynomial in one variable. We now study such polynomials. 

Let f(X) be a reduced polynomial in one variable, which induces a linear 
map of K into itself. Suppose that there occurs a monomial a r X r in /with 
coefficient a r ^ 0. Then the monomials of degree r in 

g(X 9 Y) =f(X + Y)-f(X)-f(Y) 


are given by 


a r (X + YJ — a r X r — a r Y r . 

We have already seen that g is identically 0. Hence the above expression is 
identically 0. Hence the polynomial 

(X + Y) r - X r - Y r 

is the zero polynomial. It contains the term rX r ~ 1 Y. Hence if r > 1, our field 
must have characteristic p and r is divisible by p. Write r = p m s where s is 
prime to p. Then 

o = (x + yj - x r - Y r = (x pm + Y pm y - (x pm y - (Y pm y. 

Arguing as before, we conclude that s = 1. 

Hence if/is an additive polynomial in one variable, we have 

m 

f(X)= Ia v ^ v , 

v = 0 

with a v e K. In characteristic 0, the only additive polynomials in one variable 
are of type aX with ae K. 

As expected, we define X l9 ..., A n to be algebraically independent if, whenever 
/is a reduced polynomial such that f(A(x)) = 0 for all xeK, then/is the zero 
polynomial. 
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We shall apply Theorem 12.1 to the case when À l9 ...,À n are automorphisms 
of a field, and combine Theorem 12.1 with the theorem on the linear indepen- 
dence of characters. 

Theorem 12.2. Let K be an infinite field , and let a l9 ..., a n be the distinct 
elements of a finite group of automorphisms of K. Then o l9 ..., o n are alge- 
braically independent over K. 

Proof. (Artin). In characteristic 0, Theorem 12.1 and the linear inde- 
pendence of characters show that our assertion is true. Let the characteristic 
be p > 0, and assume that c í9 ... 9 o n are algebraically dependent. 

There exists an additive polynomial /( X l9 ... 9 X n ) in K[X ] which is 
reduced, f # 0, and such that 

/(ffiW, • • •, = 0 

for all xe K. By what we saw above, we can write this relation in the form 

Z Z a irVi( X ) Pr = 0 

i= 1 r = 1 

for all xe K, and with not all coefficients a ir equal to 0. Therefore by the linear 
independence of characters, the automorphisms 

{af} with i=l, ... 9 n and r = 1,..., m 

cannot be all distinct. Hence we have 


with either i ^ j or r # s. Say r ^ s. For all x e K we have 


Oi(xr = ofxY. 


Extracting p-th roots in characteristic p is unique. Hence 
Oi(x) = ofxY^ = Vj(x ps -) 
for all x e K. Let o = ojfi^. Then 

o(x) = x pS 

for all xe K. Taking o n = id shows that 

x = x pMs ~ r) 

for all x e K. Since K is infinite, this can hold only if s = r. But in that case, 
o t = o j9 contradicting the fact that we started with distinct automorphisms. 
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§13. THE NORMAL BASIS THEOREM 

Theorem 13.1. Let K/k be a finite Galois extension oj degree n. Let o u ...,o n 
be the elements of the Galois group G. Then there exists an element w e K 
such that o ... 9 o n w form a basis of K over k. 

Proof We prove this here only when k is infinite. The case when k is 
finite can be proved later by methods of linear algebra, as an exercise. 

For each oeG , let X a be a variable, and let t ax = X a -i x . Let X t = X ar Let 

f(X u ..., X n ) = det(í a . ^). 

Then/is not identically 0, as one sees by substituting 1 for X id and 0 for X a if 
o # id. Since k is infinite,/is reduced. Hence the determinant will not be 0 for 
all x e K if we substitute ofx) for X ( in/. Hence there exists we K such that 

àet(or l Oj{w)) ^ 0 . 

Suppose a ì9 ...,a n ek are such that 


tfi<Ti(w) -f • • ■ + a n o n (w) = 0. 


Apply o/~ 1 to this relation for each i = 1,...,«. Since a f e k we get a system of 
linear equations, regarding the a f as unknowns. Since the determinant of the 
coefficients is / 0, it follows that 

aj = 0 for j = 1,..., n 
and hence that w is the desired element. 

Remark. In terms of representations as in Chapters III and XVIII, the 
normal basis theorem says that the representation of the Galois group on the 
additive group of the field is the regular representation. One may also say that 
K is free of dimension 1 over the group ring k[G]. Such a result may be viewed 
as the first step in much more subtle investigations having to do with algebraic 
number theory. Let K be a number field (finite extension of Q) and let o* be 
its ring of algebraic integers, which will be defined in Chapter VII, §1. Then 
one may ask for a description of as a Z[G] module, which is a much more 
difficult problem. For fundamental work about this problem, see A. Frôhlich, 
Galois Module Structures of Algebraic Integers , Ergebnisse der Math. 3 Folge 
Vol. 1, Springer Verlag (1983). See also the reference [CCFT 91] given at the 
end of Chapter III, §1. 
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§14. INFINITE GALOIS EXTENSIONS 

Although we have already given some of the basic theorems of Galois theory 
already for possibly infinite extensions, the non-finiteness did not really appear 
in a substantial way. We now want to discuss its role more extensively. 

Let K/k be a Galois extension with group G. For each finite Galois subex- 
tension F, we have the Galois groups G K/F and G F/k . Put H = G K/F . 
Then H has finite index, equal to #(G F/ÌC ) = [F : k\. This just comes as a special 
case of the general Galois theory. We have a canonical homomorphism 

G G/H = G f/k . 

Therefore by the universal property of the inverse limit, we obtain a 
homomorphism 

G lim G/H, 

where the limit is taken for H in the family of Galois groups G K/F as above. 

Theorem 14.1. The homomorphism G —> lim G/H is an isomorphism. 

Proof. First the kernel is trivial, because if o-is in the kernel, then orestricted 
to every finite subextension of K is trivial, and so is trivial on K. Recall that an 
element of the inverse limit is a family {a H } with cr H e G/H , satisfying a certain 
compatibility condition. This compatibility condition means that we may define 
an element a of G as follows. Let a e K. Then a is contained in some finite 
Galois extension F C K. Let H = Gal (K/F). Let aa = (r H a. The compatibility 
condition means that cr H a is independent of the choice of F. Then it is immediately 
verified that cr is an automorphism of K over k , which maps to each a H in the 
canonical map of G into G/H. Hence the map G —> lim G/H is surjective, thereby 
proving the theorem. 

Remark. For the topological interpretation, see Chapter I, Theorem 10.1, 
and Exercise 43. 

Example. Let \k[p*] be the union of all groups of roots of unity | \i[p n ], 
where p is a prime and n = 1,2,... ranges over the positive integers. Let 
K = Qíixtp 00 ]). Then K is an abelian infinite extension of Q. Let Z p be the ring 
of p-adic integers, and Z* the group of units. From §3, we know that (Z/p^Z)* 
is isomorphic to GaHQíixIp^l/Q)). These isomorphisms are compatible in the 
tower of p- th roots of unity, so we obtain an isomorphism 


Z*-^ Gal(Q(p,[p 0C ]/Q)). 
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Towers of cyclotomic fields have been extensively studied by Iwasawa. Cf. 
a systematic exposition and bibliography in [La 90]. 

For other types of representations in a group GL 2 (Z p ), see Serre [Se 68], 
[Se 72], Shimura [Shi 71], and Lang-Trotter [LaT 75]. One general framework 
in which the representation of Galois groups on roots of unity can be seen has 
to do with commutative algebraic groups, starting with elliptic curves. Specif- 
ically, consider an equation 

y 2 = 4x 3 - g 2 x - g 3 

with g 2 , g 3 e Q and non-zero discriminant: A = g 2 - 21g\ + 0. The set of 
solutions together with a point at infinity is denoted by E. From complex analysis 
(or by purely algebraic means), one sees that if K is an extension of Q, then the 
set of solutions E(K) with x, y e K and 00 form a group, called the group of 
rational points of E in K. One is interested in the torsion group, say E( Q a ) tor of 
points in the algebraic closure, or for a given prime p , in the group £(Q a )[p r ] 
and E^Q^lp 00 ]. As an abelian group, there is an isomorphism 

E(Q*)[p r ] « (Z/p r Z) x (Z/p r Z), 

so the Galois group operates on the points of order p r via a representation in 
GL 2 (Z/p r Z), rather than GL { (Z/p r Z) = (Z/p r Z )* in the case of roots of unity. 
Passing to the inverse limit, one obtains a representation of Gal(Q a /Q) = Gq 
in GL 2 (Z p ). One of Serre’s theorems is that the image of Gq in GL 2 (Z p ) is a 
subgroup of finite index, equal to GL 2 (Z p ) for all but a finite number of primes 
p , if End C (E) = Z. 

More generally, using freely the language of algebraic geometry, when A is 
a commutative algebraic group, say with coefficients in Q, then one may consider 
its group of points A(Q a ) tor , and the representation of Gq in a similar way. 
Developing the notions to deal with these situations leads into algebraic geometry. 

Instead of considering cyclotomic extensions of a ground field, one may also 
consider extensions of cyclotomic fields. The following conjecture is due to 
Shafarevich. See the references at the end of §7. 

Conjecture 14.2. Let k 0 = Q(pi) be the compositum ofall cyclotomic exten - 
sions of Q in a given algebraic closure Q a . Let k be a finite extension of k 0 . 
Let G k = Gal(Q a /k). Then G k is isomorphic to the completion ofa free group 
on countably many generators. 

If G is the free group, then we recall that the completion is the inverse limit 
lim G/H , taken over all normal subgroups H of finite index. Readers should 
view this conjecture as being in analogy to the situation with Riemann surfaces, 
as mentioned in Example 9 of §2. It would be interesting to investigate the extent 
to which the conjecture remains valid if Q(|Ji) is replaced by Q(A(Q a ) tor ), where 
A is an elliptic curve. For some results about free groups occurring as Galois 
groups, see also Wingberg [Wi 91]. 
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§15. THE MODULAR CONNECTION 

This final section gives a major connection between Galois theory and the 
theory of modular forms, which has arisen since the 1960s. 

One fundamental question is whether given a finite group G, there exists a 
Galois extension K of Q whose Galois group is G. In Exercise 23 you will prove 
this when G is abelian. 

Already in the nineteenth century, number theorists realized the big difference 
between abelian and non-abelian extensions, and started understanding abelian 
extensions. Kronecker stated and gave what are today considered incomplete 
arguments that every finite abelian extension of Q is contained in some extension 
Q(£), where £ is a root of unity. The difficulty lay in the peculiarities of the 
prime 2. The trouble was fixed by Weber at the end of the nineteenth century. 
Note that the trouble with 2 has been systematic since then. It arose in Artin’s 
conjecture about densities of primitive roots as mentioned in the remarks after 
Theorem 9.4. It arose in the Grunwald theorem of class field theory (corrected 
by Wang, cf. Artin-Tate [ArT 68], Chapter 10). It arose in Shafarevich’s proof 
that given a solvable group, there exists a Galois extension of Q having that 
group as Galois group, mentioned at the end of §7. 

Abelian extensions of a number field F are harder to describe than over the 
rationals, and the fundamental theory giving a description of such extensions is 
called class field theory (see the above reference). I shall give one significant 
example exhibiting the flavor. Let R F be the ring of algebraic integers in F. It 
can be shown that R F is a Dedekind ring. (C/. [La 70], Chapter I, §6, Theorem 
2.) Let P be a prime ideal of R F . Then P fl Z = (p) for some prime number p. 
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Furthermore, R F /P is a finite field with q elements. Let K be a finite Galois 
extension of F. It will be shown in Chapter VII that there exists a prime Q of 
R k such that Q fl R F = P. Furthermore, there exists an element 

¥r Q eG = Ga \(K/F) 
such that Fr q(Q) = Q and for all a e R K we have 

¥r Q a = ofl mod Q. 

We call Fr^ a Frobenius element in the Galois group G associated with Q. (See 
Chapter VII, Theorem 2.9.) Furthermore, for all but a fìnite number of Q, two 
such elements are conjugate to each other in G. We denote any of them by Fr P . 
If G is abelian, then there is only one element Fr^ in the Galois group. 

Theorem 15.1. There exists a unique finite abelian extension K ofF having 
the folloxving property. If P x , P 2 are prime ideals of R F , then 
Fr^j = Fr P2 if and only if there is an element a of K such that aP x = P 2 . 

In a similar but more complicated manner, one can characterize all abelian 
extensions of F. This theory is known as class field theory, developed by Kro- 
necker, Weber, Hilbert, Takagi, and Artin. The main statement concerning the 
Frobenius automorphism as above is Artin’s Reciprocity Law. Artin-Tate’s notes 
give a cohomological account of class field theory. My Algebraic Number Theory 
gives an account following Artin’s first proof dating back to 1927, with later 
simplifications by Artin himself. Both techniques are valuable to know. 

Cyclotomicextensions should be viewed in the lightof Theorem 15.1. Indeed, 
let K = Q(£), where £ is a primitive n-th root of unity. For a prime p\n, we 
have the Frobenius automorphism ¥r p , whose effect on £ is Fr^) = Then 

Fr pi = Fr^ if and only if p x = p 2 mod n. 

To encompass both Theorem 15.1 and the cyclotomic case in one framework, 
one has to formulate the result of class field theory for generalized ideal classes, 
not just the ordinary ones when two ideals are equivalent if and only if they 
differ multiplicatively by a non-zero field element. See my Algebraic Number 
Theory for a description of these generalized ideal classes. 

The non-abelian case is much more difficult. I shall indicate briefly a special 
case which gives some of the flavor of what goes on. The problem is to do for 
non-abelian extensions what Artin did for abelian extensions. Artin went as far 
as saying that the problem was not to give proofs but to formulate what was to 
be proved. The insight of Langlands and others in the sixties shows that actually 
Artin was mistaken. The problem lies in both. Shimura made several computations 
in this direction involving ‘ l modular forms” [Sh 66]. Langlands gave a number 
of conjectures relating Galois groups with “automorphic forms”, which showed 
that the answer lay in deeper theories, whose formulations, let alone their proofs, 
were difficult. Great progress was made in the seventies by Serre and Deligne, 
who proved a first case of Langland’s conjecture [DeS 74]. 
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The study of non-abelian Galois groups occurs via their linear “representa- 
tions”. For instance, let / be a prime number. We can ask whether GL n ( F z ), or 
GL 2 (F/), or PGL 2 (F/) occurs as a Galois group over Q, and “how”. The problem 
is to find natural objects on which the Galois group operates as a linear map, 
such that we get in a natural way an isomorphism of this Galois group with one 
of the above linear groups. The theories which indicate in which direction to 
find such objects are much beyond the level of this course, and lie in the theory 
of modular functions, involving both analysis and algebra, which form a back- 
ground for the number theoretic applications. Again I pick a special case to give 
the flavor. 

Let K be a finite Galois extension of Q, with Galois group 

G = Gal(ÂVQ). 


Let 

p\ G GL 2 (F/) 

be a homomorphism of G into the group of 2 x 2 matrices over the finite field 
F/ for some prime /. Such a homomorphism is called a representation of G. 
From elementary linear algebra, if 



is a 2 x 2 matrix, we have its trace and determinant defined by 

tr(M) = a + d and det M = ad — bc. 

Thus we can take the trace and determinant tr p(&) and det p(cr) for cr e G. 
Consider the infinite product with a variable q : 

oc oc 

A (q) = ?ri(l - q n ) 24 = S a n q*. 

«=1 n= 1 

The coefficients a n are integers, and a x = 1. 

Theorem 15.2. For each prime l there exists a unique Galois extension K of 
Q, xvith Galois group G, and an injective homomorphism 

p: G -» GL 2 ( F/) 

having the folloxving property. For all but a finite number ofprimes /?, if a p is 
the coefficient of q p in A (q), then we have 

tr p(Fv p ) = a p mod / and det p(Fv p ) = p n mod /. 

Furthermore, for all primes 1^2,3, 5, 7, 23, 691, the image p(G) in GL 2 (F/) 
consists of those matrices M e GL 2 (F/) such that det M is an eleventh power 
in Ff. 
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The above theorem was conjectured by Serre in 1968 [Se 68]. A proof of 
the existence as in the first statement was given by Deligne [De 68]. The second 
statement, describing how big the Galois group actually is in the group of matrices 
GL 2 (F/) is due to Serre and Swinnerton-Dyer [Se 72], [SwD 73]. 

The point of A(g) is that if we put q = e 2mz , where z is a variable in the 
upper half-plane, then A is a modular form of weight 12. For defìnitions and an 
introduction, see the last chapter of [Se 73], [La 73], [La 76], and the following 
comments. The general result behind Theorem 15.2 for modular forms of weight 
^ 2 was given by Deligne [De 73]. For weight 1, it is due to Deligne-Serre 
[DeS 74]. We summarize the situation as follows. 

Let iVbea positive integer. To N we associate the subgroups 

m) c rm c r 0 (N) 

of SL 2 ( Z) defined by the conditions for a matrix a = ^ e SL 2 ( Z): 

a e T (N) if and only if a = d = 1 mod N and b = c = 0 mod N\ 
a e r x (N) if and only if a = d = 1 mod N and c = 0 mod N\ 
a e T 0 (N) if and only if c = 0 mod N. 

Let/be a function on the upper half-plane $ = {z e C, Im(z) > 0}. Let k 
be an integer. For 

r = (“ ‘) e SLrfR), 

define/° [y] k (an operation on the right) by 

/° [y]*(z) = (CZ + dy k f(yz) where yz = — / - 

cz -r a 

Let T be a subgroup of SL 2 ( Z) containing T(N). We defìne /to be modular of 
weight k on T if: 

M k 1. /is holomorphic on íp; 

M k l.f is holomorphic at the cusps, meaning that for all a e SL 2 ( Z), the 
function /° [a] k has a power series expansion 

00 

/° [a]*(z) = 2 

n =0 

M k 3. We have/° [y]^ = /for all y e T. 

One says that/is cuspidal if in M k 2 the power series has a zero; that is, the 
power starts with n ^ 1. 
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Suppose that/is modular of weight k on T(N). Then/is modular on T X (N) 
if and only if f(z + 1) = /(z), or equivalently /has an expansion of the form 

oc 

/(z) = Mq z ) = 2 a n q n where q = q 2 = e 2mz . 

n=0 

This power series is called the <?-expansion of /. 

Suppose / has weight k on r,(A0. If y e T 0 (N) and y is the above written 
matrix, then /° [y] k depends only on the image of d in (Z/NZ)*, and we then 
denote/° [y] k by /° [d] k . Let 

(Z/NZ)* C* 

be a homomorphism (also called a Dirichlet character). One says that e is odd 
if e(— 1) = —1, and even if e(— 1) = 1. One says that/is modular of type 
(k, e) on r o (A0 if/has weight k on T X (N), and 

fo [d] k = e(d)f for all d e (Z/NZ)*. 

It is possible to define an algebra of operators on the space of modular forms 
of given type. This requires more extensive background, and I refer the reader 
to [La 76] for a systematic exposition. Among all such forms, it is then possible 
to distinguish some of them which are eigenvectors for this Hecke algebra, or, 
as one says, eigenfunctions for this algebra. One may then state the Deligne- 
Serre theorem as follows. 

Let f =£ 0 be a modular form oftype (1, e) on T 0 (N ), so fhas weight 1. Assume 
that e is odd. Assume that f is an eigenfunction of the Hecke algebra , with q- 
expansion f x = 2 ti n q n , normalized so that a x = 1. Then there exists a unique 
finite Galois extension K of Q with Galois group G, and a representation 
p: G —> GL 2 (C) (actually an injective homomorphism ), such that for all 
primes pXN the characteristic polynomial of p(Fv p ) is 

X 2 ~ a p X + e(p). 

The representation p is irreducible if and only if f is cuspidal. 

Note that the representation p has values in GL 2 (C). For extensive work of Serre 
and his conjectures concerning representations of Galois groups in GL 2 (F) when 
F is a finite field, see [Se 87]. Roughly speaking, the general philosophy started 
by a conjecture of Taniyama-Shimura and the Langlands conjectures is that 
everything in sight is “modular”. Theorem 15.2 and the Deligne-Serre theorem 
are prototypes of results in this direction. For “modular” representations in GL 2 (F), 
when F is a finite field, Serre’s conjectures have been proved, mostly by Ribet 
[Ri 90]. As a result, following an idea of Frey, Ribet also showed how the 
Taniyama-Shimura conjecture implies Fermat’s last theorem [Ri 90b]. Note that 
Serre’s conjectures that certain representations in GL 2 (F) are modular imply the 
Taniyama-Shimura conjecture. 
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EXERCISES 

1. What is the Galois group of the following polynomials? 

(a) X 3 - X - 1 over Q. 

(b) X 3 - 10 over Q. 

(c) X 3 - lOoverQíyï). 

(d) X 3 - 10 over Q(y^3). 

(e) A -3 — X - 1 over Q( v / - 23). 

(f) X 4 - 5 over Q, Q( v '5), Q( V ^5), Q(0- 

(g) X 4 — a where a ìs any integer ^ 0, # + 1 and is square free. Over Q. 
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(h) X 3 — a where a is any square-free integer ^ 2. Over Q. 

(i) X 4 -h 2 over Q, Q(i). 

(j) (X 2 - 2)(X 2 - 3)(X 2 - 5)(X 2 - 1) over Q. 

(k) Let p l9 ..., p n be distinct prime numbers. What is the Galois group of 

(* 2 - Pi) -(X 2 - P»)overQ? _ 

(l) (X 3 - 2)(X 3 - 3)(X 2 - 2) over Q(J-3). 

(m) X n - t , where t is transcendental over the complex numbers C and n is a 
positive integer. Over C(í). 

(n) X 4 - r, where t is as before. Over R (t). 

2. Find the Galois groups over Q of the following polynomials. 

(a) X 3 + X + 1 (b) X 3 - X + ì (g) X 3 + X 2 - 2X - \ 

(c) X 3 + 2X + ì (d) X 3 -2X + \ 

(e) X 3 - X - 1 (f) X 3 - 12X + 8 

3. Let k = C(t) be the field of rational functions in one variable. Find the Galois group 
over k of the following polynomials: 

(a) X 3 + X + t (b) X 3 - X + t 

(c) X 3 + tX + \ (d) X 3 -2 tX + t 

(e) X 3 - X - t (f) X 3 + r 2 X - t 3 

4. Let k be a field of characteristic + 2. Let c E £, c £ k 2 . Let F = k(Vc). Let 
a = a + b Vc with a, b E k and not both a, b = 0. Let E = F(\íâ). Prove that 
the following conditions are equivalent. 

(1) E is Galois over k. 

(2) E = F(\fâ'), where a' = a - b\íc. 

(3) Either aa' = a 2 - cb 2 E k 2 or caa' E k 2 . 

Show that when these conditions are satisfied, then E is cyclic over k of degree 4 if 
and only if caa' E k 2 . 

5. Let k be a field of characteristic + 2, 3. Let /(X), g(X) = X 2 - c be irreducible 
polynomials over k , of degree 3 and 2 respectively. Let D be the discriminant of/. 
Assume that 

[k(D 112 ) : k\ = 2 and k(D m ) + k(c m ). 

Let abea root of /and /3 a root of g in an algebraic closure. Prove: 

(a) The splitting field of fg over k has degree 12. 

(b) Let y = a + /3. Then [k(y) : k] = 6. 

6. (a) Let K be cyclic over k of degree 4, and of characteristic + 2. Let G K/k = (o). 

Let E be the unique subfield of K of degree 2 over k. Since [K : E] = 2, there 
exists a E K such that a 2 = y E E and K = E(a). Prove that there exists 
z E E such that 

zcrz = — 1, cra = za, z 2 = cry/y. 

(b) Conversely, let E be a quadratic extension of k and let G E/k = (t). Let z E E 
be an element such that zîz = - 1. Prove that there exists y E E such that 
z 2 = ry/y. Then E = k(y). Let a 2 = y, and let K = k(a). Show that K is 
Galois, cyclic of degree 4 over k. Let cr be an extension of r to K. Show that 
cr is an automorphism of K which generates G Klk , satisfying a 2 a = -a and 
aa = ±za. Replacing z by -z originally if necessary, one can then have 
aa = za. 
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7. (a) Let K = Q(Vâ) where a E Z, a < 0. Show that K cannot be embedded in a 
cyclic extension whose degree over Q is divisible by 4. 

(b) Let/(X) = X 4 4- 30X 2 + 45. Let abe a root of F. Prove that Q(ct) is cyclic of 
degree 4 over Q. 

(c) Let/(X) = X 4 + 4jc 2 + 2. Prove that/ is irreducible over Q and that the Galois 
group is cyclic. 

Let/(X) = X 4 + aX 2 + b be an irreducible polynomial over Q, with roots + a, ± p, 
and splitting field K. 

(a) Show that Gal(X/Q) is isomorphic to a subgroup of D 8 (the non-abelian group 
of order 8 other than the quaternion group), and thus is isomorphic to one of the 
following: 

(i) Z/4Z (ii) Z/2Z x Z/2Z (iii) D 8 . 

(b) Show that the first case happens if and only if 


Case (ii) happens if and only if E Q or a 2 — fi 2 e Q. Case (iii) happens 
othenvise. (Actually, in (ii), the case a 2 — fi 2 E Q cannot occur. It corresponds 
to a subgroup D s C S 4 which is isomorphic to Z/2Z x Z/2Z, but is not 
transitive on {1, 2, 3, 4}). 

(c) Find the splitting field K in C of the polynomial 

X 4 - 4X 2 - 1. 

Determine the Galois group of this splitting field over Q, and describe fully 
the lattices of subfields and of subgroups of the Galois group. 

9. Let K be a finite separable extension of a field k, of prime degree p. Let 0 e K be 
such that K = k(9), and let 0 U ...,O P be the conjugates of 6 over k in some algebraic 
closure. Let 6 = 0 X . If 0 2 e k(0), show that K is Galois and in fact cyclic over k. 

10. Let/(X) 6 Q[X] be a polynomial of degree n , and let K be a splitting field of/ over Q. 
Suppose that Gal(X/Q) is the symmetric group S n with n > 2. 

(a) Show that/is irreducible over Q. 

(b) If a is a root of f show that the only automorphism of Q(a) is the identity. 

(c) If n > 4, show that a” £ Q. 

11. A polynomial f(X) is said to be reciprocal if whenever a is a root, then 1/a is also a 
root. We suppose that / has coefficients in a real subfield k of the complex numbers. If 
j is irreducible over k, and has a nonreal root of absolute value 1, show that j is 
reciprocal of even degree. 

12. What is the Galois group over the rationals of X 5 — 4X + 2? 

13. What is the Galois group over the rationals of the following polynomials: 

(a) X 4 + 2X 2 + X + 3 

(b) X 4 + 3X 3 - 3X - 2 

(c) X 6 + 22X 5 - 9X 4 + 12X 3 - 37X 2 - 29X - 15 
[Hint: Reduce mod 2, 3, 5.] 

14. Prove that given a symmetric group S n , there exists a polynomial /(X) e Z[X] with 
leading coefficient 1 whose Galois group over Q is S n . [ Hint: Reducing mod 2, 3, 5, 
show that there exists a polynomial whose reductions are such that the Galois group 
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contams enough cycles to generate S n . Use the Chinese remainder theorem, also to 
be able to apply Eisenstein’s criterion.] 

15. Let K/k be a Galois extension, and let F be an intermediate field between k and K. 
Let H be the subgroup of GdX{K/k) mapping F into itself. Show that H is the normal- 
izer of Gd\(K/F) in Gal (K/k). 

16. Let K/k be a finite Galois extension with group G. Let a e K be such that 
{cra } ffeG is a normal basis. For each subset S of G let S(a) = 2 o. e5 o-a. Let H be a 
subgroup of G and let F be the fixed field of H. Show that there exists a basis of F 
over k consisting of elements of the form S(a). 

Cyclotomic fields 

17. (a) Let k be a field of characteristic Jf2n, for some odd integer n ^ 1, and let £ be 

a primitive n-th root of unity, in k. Show that k also contains a primitive 2n-th 
root of unity. 

(b) Let k be a finite extension of the rationals. Show that there is only a finite number 
of roots of unity in k. 

18. (a) Determine which roots of unity lie in the following fields: Q(/), Q(V^2), 

Q(V2), Q(V—3), Q(V3), Q(V—5). 

(b) For which integers m does a primitive m-th root of unity have degree 2 over Q? 

19. Let f be a primitive n-th root of unity. Let K = Q(f). 

(a) If n = p r (r ^ 1) is a prime power, show that N ki q( 1 - £) = p. 

(b) If n is composite (divisible by at least two primes) then N K/Q (\ - £) = 1. 

20. Let f(X) E Z[X] be a non-constant polynomial with integer coefficients. Show that 
the values f(a) with a E Z + are divisible by infinitely many primes. 

[Note : This is trivial. A much deeper question is whether there are infinitely many 
a such that f(a) is prime. There are three necessary conditions: 

The leading coefficient of /is positive. 

The polynomial is irreducible. 

The set of values f(Z + ) has no common divisor > 1. 

A conjecture of Bouniakowski [Bo 1854] states that these conditions are sufficient. 
The conjecture was rediscovered later and generalized to several polynomials by 
Schinzel [Sch 58]. A special case is the conjecture that X 2 + 1 represents infinitely 
many primes. For a discussion of the general conjecture and a quantitative version 
giving a conjectured asymptotic estimate, see Bateman and Horn [BaH 62]. Also see 
the comments in [HaR 74]. More precisely, let/ 1 ,... ,/ r be polynomials with integer 
coefficients satisfying the first two conditions (positive leading coefficient, irre- 
ducible). Let 


be their product, and assume that/satisfies the third condition. Define: 

tt^x) = number of positive integers n ^ x such that//«),... ,/ r (n) are all primes. 

(We ignore the finite number of values of n for which some/(n) is negative.) The 
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Bateman-Hom conjecture is that 

X 

7T (/) W ~<d, ■ ■■ d r )-'C(f) f dt, 

0 

where 

the product being taken over all primes p, and N f {p) is the number of solutions of 
the congruence 

f(n) = 0 mod p. 

Bateman and Hom show that the product converges absolutely. When r = 1 and 
f(n) = an + b with a, b relatively prime integers, a > 0, then one gets Dirichlet’s 
theorem that there are infinitely many primes in an arithmetic progression, together 
with the Dirichlet density of such primes. 

[BaH 62] P. T. Bateman and R. Horn, A heuristic asymptotic formula conceming 
the distribution of prime numbers, Math. Comp. 16 (1962) pp. 363-367 

[Bo 1854] V. Bouniakowsky, Sur les diviseurs numériques invariables des fonc- 
tions rationnelles entières, Mémoires sc. math. et phys. T. VI (1854- 
1855) pp. 307-329 

[HaR 74] H. Halberstam and H.-E. Richert, Sieve methods , Academic Press, 
1974 

[Sch 58] A. Schinzel and W. Sierpinsri, Sur certaines hypothèses concernant 
les nombres premiers, Acta Arith. 4 (1958) pp. 185-208 

21. (a) Let a be a non-zero integer, p a prime, n a positive integer, and p X n. Prove 

that p | <!>„(<z) if and only if a has period n in (Z/pZ)*. 

(b) Again assume pX n Prove that p | 4>„(a) for some a E Z if and only if p = 1 
mod n. Deduce from this that there are infinitely many primes = 1 mod n, a 
special case of Dirichlet’s theorem for the existence of primes in an arithmetic 
progression. 

22. Let F = F p be the prime field of characteristic p. Let K be the field obtained from 
F by adjoining all primitive /-th roots of unity, for all prime numbers l ± p. Prove 
that K is algebraically closed. [Hint: Show that if q is a prime number, and r an 
integer ^ 1, there exists a prime / such that the period of p mod / is q r , by using 
the following old trick of Van der Waerden: Let / be a prime dividing the number 

b = fr = (p^ ' - ìr' +áp qr ~' - 1 y- 2 + - + q . 

P q ~ 1 

If / does not divide p qr ~ ' — 1, we are done. Otherwise, / = q. But in that case q 2 does 
not divide b, and hence there exists a prime / # q such that / divides b. Then the degree 
of F(C Z ) over F is q r , so K contains subfields of arbitrary degree over F.] 

23. (a) Let G be a finite abelian group. Prove that there exists an abelian extension of 

Q whose Galois group is G. 
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(b) Let k be a fìnite extension of Q, and let G be a finite abelian group. Prove that 
there exist infinitely many abelian extensions of k whose Galois group is G. 

24. Prove that there are infinitely many non-zero integers a, b 0 such that 
— 4 a 3 — 27 b 2 is a square in Z. 

25. Let k be a field such that every finite extension is cyclic. Show that there exists an 
automorphism cr of k a over k such that k is the fixed field of cr. 

26. Let Q a be a fixed algebraic closure of Q. Let E be a maximal subfield of Q a not 
containing V2 (such a subfield exists by Zom’s lemma). Show that every finite 
extension of E is cyclic. (Your proof should work taking any algebraic irrational 
number instead of V2.) 

27. Let k be a field, k a an algebraic closure, and cr an automorphism of k a leaving k 
fixed. Let F be the fixed field of cr. Show that every finite extension of F is cyclic. 
(The above two problems are examples of Artin, showing how to dig holes in an 
algebraically closed field.) 

28. Let E be an algebraic extension of k such that every non-constant polynomial f(X) 
in k[Y] has at least one root in E. Provethat E is algebraically closed. [Hint: Discuss 
the separable and purely inseparable cases separately, and use the primitive element 
theorem.] 

29. (a) Let /C be a cyclic extension of a field F, with Galois group G generated by o. Assume 

that the characteristic is p , and that \_K: F] = p m ~ l for some integer m ^ 2. 
Let f be an element of K such that Tr%(f) = 1. Show that there exists an element 
a in K such that 

OCL — OL = P P — f. 

(b) Prove that the polynomial X p — X — a is irreducible in K[X~\. 

(c) If 9 is a root of this polynomial, prove that F(9) is a Galois, cyclic extension of 
degree p m of F, and that its Galois group is generated by an extension o* of o 
such that 

a*(9) = 9 + p. 

30. Let A be an abelian group and let G be a finite cyclic group operating on A [by means 
of a homomorphism G —► Aut(4)]. Let o be a generator of G. We define the trace 
Tr G = Tr on A by Tr(x) = £ zx. Let A Tr denote the kernel of the trace, and let 

zeG 

(1 — o)A denote the subgroup of A consisting of all elements of type y — oy. Show that 

H l (G,A)*A Tr /(l -°)A. 

31. Let F be a finite field and K a finite extension of F. Show that the norm N* and the 
trace Tr£ are surjective (as maps from K into F). 

32. LetFbeafìniteseparableextensionof/c,ofdegreen. Let W = (vv,,...,w„)beelements 

of E. Let o n be the distinct embeddings of E m k a over k. Define the dis- 

criminant of W to be 

D m (W) = det(c t^j) 2 . 

Prove: 

(a) If V = (v lf ..., v n ) is another set of elements of E and C = (c,-,) is a matrix 
of elements of k such that vv, = then 

D m (W) = det (C) 2 D m (V). 
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(b) The discriminant is an element of k. 

(c) Let E = k( a) and let f(X) = Irr(a, k, X). Let otj,..., a„ be the roots of/and 
say a = aj. Then 

/'(«) = fl (a ~ «,)■ 

j= 2 

Show that 

D m ( 1, a"" 1 ) = (- 1)" ( "- ,)/2 Nf(/'(«)). 

(d) Let the notation be as in (a). Show that det(Tr(w,W;)) = (det((7, w,)) 2 . [Hint: 
Let A be the matrix (cr, Wj). Show that 'AA is the matrix (Tr(w, vv ; )).] 


Rational functions 

33. Let K = C(x) where x is transcendental over C, and let Ç be a primitive cube root of 
unity in C. Let o be the automorphism of K over C such that ox = Çx. Let r be the 
automorphism of K over C such that tx = x -1 . Show that 

o 3 = 1 = t 2 and T(j = ct - 1 t . 

Show that the group of automorphisms G generated by o and r has order 6 and the 
subfield F of K fixed by G is the field C(y) where y = x 3 + x“ 3 . 

34. Give an example of a field K which is of degree 2 over two distinct subfields E and F 
respectively, but such that K is not algebraic over E n F. 

35. Let k be a field and X a variable over k. Let 


cp(X) = 


f(X) 

g(X) 


be a rational function in k(X ), expressed as a quotient of two polynomials f g which 
are relatively prime. Define the degree of (p to be max(deg /, deg g). Let Y = (p(X). 
(a) Show that the degree of (p is equal to the degree of the field extension k(X) over k(Y) 
(assuming Y $ k). (b) Show that every automorphism of k(X) over k can be represented 
by a rational function (p of degree 1, and is therefore induced by a map 


Xv 


aX + b 
cX + d 


with a y b,c,dek and ad — bc # 0. (c) Let G be the group of automorphisms of k(X) 
over k. Show that G is generated by the following automorphisms: 

z b :X\-*X + b, o a :Xi->aX (a # 0), IhF 1 

with a,bek. 

36. Let k be a finite field with q elements. LetX = k(X) be the rational field in one variable. 
Let G be the group of automorphisms of K obtained by the mappings 


Xv 


aX + b 


cX + d 
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with a, b, c, d in k and ad - bc ^ 0. Prove the following statements: 

(a) The order of G is q 3 — q. 

(b) The fixed field of G is equal to k(Y) where 

v _ (X q2 - X ) q+1 
- Jx q - X) q2+Ì ' 

(c) Let H x be the subgroup of G consisting of the mappings X t—>aX + b with 
a # 0. The fixed field of H { is k(T) where T = (X q - X) q ~ l . 

(d) Let H 2 be the subgroup of H x consisting of the mappings X -> X + b with 
bek. The fixed field of H 2 is equal to k(Z) where Z = X q - X. 


Some aspects of Kummer theory 

37. Let k be a field of characteristic 0. Assume that for each finite extension E of k , the 
index (E* : E* n ) is finite for every positive integer n. Show that for each positive integer 
n, there exists only a finite number of abelian extensions of k of degree n. 

38. Let a # 0, # ± 1 be a square-free integer. For each prime number p , let K p be 
the splitting field of the polynomial X p - a over Q. Show that [K p : Q] = p(p - 1). 
For each square-free integer m > 0, let 

^ = n 

p\m 

be the compositum of all fields K p for p\m. Let d m = [ K m : Q] be the degree of K m 
over Q. Show that if m is odd then d m = Y[d p , and if m is even, m = 2n then d 2n = d n 

P I m 

or 2 d n according as Vâ is or is not in the field of m-th roots of unity Q(f m ). 

39. Let K be a field of characteristic 0 for simplicity. Let T be a finitely generated subgroup 
of K*. Let N be an odd positive integer. Assume that for each prime p\N we have 

r = r llp nK, 

and also that Gai\(K(p N )/K) « Z (N)*. Prove the following. 

(a) T/T N ~ T/(T n K* N ) « TK* N /K* N . 

(b) Let K n = K(h n ). Then 


r n K% n = T N . 

[Hint: If these two groups are not equal, then for some prime p \ N there exists 
an element aeT such that 

a = b p with beK N but b$K. 

In other words, a is not a p-th power in K but becomes a p-th power in K N . The 
equation x p — a is irreducible over K. Show that b has degree p over K(n p ), 
and that K(n p ,a ílp ) is not abelian over K , so a ìlp has degree p over K()i p ). 
Finish the proof yourself.] 
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(c) Conclude that the natural Kummer map 

r/T*' - Hom (HAN), |i„) 


is an isomorphism. 

(d) Let G r (N) = Gal(K(r 1/iV , \i n )/K). Then the commutator subgroup of G r (N) 
is Hr(N ), and in particular G3l\(K n /K) is the maximal abelian quotient of 

GAN)- 

40. Let K be a field and p a prime number not equal to the characteristic of K. Let T be a 
finitely generated subgroup of K*, and assume that T is equal to its own p-division 
group in K, that is if z e K and z p e T, then z e T. If p is odd, assume that \i p cz K, and 
if p = 2, assume that ji 4 c - K. Let 


(T : r p ) = p r+l . 


Show that r llp is its own p-division group in K(r 1/P ), and 

[ K(r llpm ):K ] = p m(r+1) 


for all positive integers m. 

41. Relative invariants (Sato). Let k be a field and K an extension of k. Let G be a group 
of automorphisms of K over k, and assume that k is the fixed field of G. (We do not 
assume that K is algebraic over k.) By a relative invariant of G in K we shall mean an 
element PeK, P ^ 0, such that for each oeG there exists an element X(o)ek for 
which P a = ^(cr)P. Since o is an automorphism, we have x(o) e k*. We say that the 
map x-G -* k* belongs to P, and call it a character. Prove the following statements: 

(a) The map x above is a homomorphism. 

(b) If the same character x belongs to relative invariants P and Q then there 
exists cek* such that P = cQ. 

(c) The relative invariants form a multiplicative group, which we denote by /. 
Elements PP m of / are called multiplicatively independent mod k* if 
their images in the factor group I/k* are multiplicatively independent, i.e. if 
given integers v^,..., v m such that 

P\' • • • = cek*, 

then v^ = • • • = v m = 0. 

(d) If P x , ..., P m are multiplicatively independent mod k* prove that they are 
algebraically independent over k. [Hint: Use Artin’s theorem on characters.] 

(e) Assume that K = k(X í9 ..., X n ) is the quotient field of the polynomial ring 
k[X x ,..., X n ] = k[X ], and assume that G induces an automorphism of the 
polynomial ring. Prove: If F j (X) and F 2 (X) are relative invariant polynomials, 
then their g.c.d. is relative invariant. If P(X) = F { (X)/F 2 (X) is a relative 
invariant, and is the quotient of two relatively prime polynomials, then F t (X) 
and F 2 (X) are relative invariants. Prove that the relative invariant poly- 
nomials generate I/k*. Let S be the set of relative invariant polynomials which 
cannot be factored into a product of two relative invariant polynomials of 
degrees ^ 1. Show that the elements of S/k* are multiplicatively independent, 
and hence that I/k* is a free abelian group. [If you know about transcendence 
degree, then using (d) you can conclude that this group is finitely generated.] 
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42. Let f(z) be a rational function with coefficients in a finite extension of the rationals. 
Assume that there are infinitely many roots of unity £ such that/(0 is a root of unity. 
Show that there exists an integer n such that/(z) = cz" for some constant c (which is in 
fact a root of unity). 

This exercise can be generalized as follows: Let T 0 be a finitely generated multi- 
plicative group of complex numbers. Let T be the group of all complex numbers y 
such that y m lies in T 0 for some integer m # 0. Let /(z) be a rational function with 
complex coefficients such that there exist infinitely many y e T for which f(y) lies in T. 
Then again,/(z) = cz n for some c and n. (Cf. Fundamentals of Diophantine Geometry.) 

43. Let K/k be a Galois extension. We define the Krull topology on the group 
G(K/k) = G by defining a base for open sets to consist of all sets aH where a E G 
and H — G(K/F) for some finite extension F of k contained in K. 

(a) Show that if one takes only those sets crH for which F is finite Galois over 
k then one obtains another base for the same topology. 

(b) The projective limit lim G/H is embedded in the direct product 

lim G/H -* n G/H. 

H H 

Give the direct product the product topology. By Tychonoff’s theorem in 
elementary point set topology, the direct product is compact because it is a 
direct product of finite groups, which are compact (and of course also discrete). 
Show that the inverse limit Hm_ G/H is closed in the product, and is therefore 
compact. 

(c) Conclude that G(K/k) is compact. 

(d) Show that every closed subgroup of finite index in G(K/k) is open. 

(e) Show that the closed subgroups of G(K/k) are precisely those subgroups 
which are of the form G(K/F) for some extension F of k contained in K. 

(f) Let H be an arbitrary subgroup of G and let F be the fixed field of H. Show 
that G(K/F) is the closure of H in G. 

44. Let k be a field such that every finite extension is cyclic, and having one extension of 
degree n for each integer n. Show that the Galois group G = G(k*/k) is the inverse limit 
lim Z/mZ, as mZ ranges over all ideals of Z, ordered by inclusion. Show that this limit 
is isomorphic to the direct product of the limits 

n lim Z/p"Z = ílz. 

p n ^°° p 

taken over all prime numbers p, in other words, it is isomorphic to the product of all 
p-adic integers. 

45. Let k be a perfect field and k a its algebraic closure. Let cr e G(k a /k) be an element 
of infinite order, and suppose k is the fixed field of cr. For each prime /?, let K p be 
the composite of all cyclic extensions of k of degree a power of p. 

(a) Prove that k a is the composite of all extensions K p . 

(b) Prove that either K p = k , or K p is infinite cyclic over k. In other words, K p 
cannot be finite cyclic over k and =é k. 

(c) Suppose k a = K p for some prime /?, so k a is an infinite cyclic tower of 
/?-extensions. Let wbea /?-adic unit, u E Z* such that u does not represent 
a rational number. Define cr“, and prove that cr, cr u are linearly independent 
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over Z, i.e. the group generated by <r and a u is free abelian of rank 2. In 
particular {cr} and {cr, cr“} have the same fixed field k. 

Witt vectors 

46. Let jCj, jc 2 , . .. be a sequence of algebraically independent elements over the integers 
Z. For each integer n ^ 1 define 


x (n) = X dx n J d . 

d | n 

Show that x n can be expressed in terms of x (d) for d \ n, with rational coefificients. 

Using vector notation, we call (xj, x 2 ,...) the Witt components of the vector x, 
and call (x (1) , x (2) ,...) its ghost components. We call x a Witt vector. 

Define the power series 


/x(o= no -x„n 

nè 1 

Show that 

-tylogf x (t)= I 

dt i 

d 

[By — log / (í) we mean f'(t)/f(t) if f (t) is a power series, and the derivative f(t) is taken 
dt 

formally.] 

If x, y are two Witt vectors, define their sum and product componentwise mth 
respect to the qhost components , i.e. 

(x + y) (n) = x (n) + y n) . 

What is (x + y)„? Well, show that 

fÁOfyit) = n ( l + (X + y)„n =f x+y (t). 

Hence (x + y) n is a polynomial with integer coefificients in x,, y u . .., x„, y n . Also show 
that 


4(í) = II (! - xT d y7 le n ielm 

d,e^ 1 

where m is the least common multiple of d, e and d, e range over all integers ^ 1. Thus 
(xy) n is also a polynomial in x x , y x ..., x„, y n with integer coefificients. The above 
arguments are due to Witt (oral communication) and dififer from those of his original 
paper. 

If A is a commutative ring, then taking a homomorphic image of the polynomial 
ring over Z into A, we see that we can define addition and multiplication of Witt 
vectors with components in A, and that these Witt vectors form a ring W(A). Show 
that W is a functor, i.e. that any ring homomorphism (p of A into a commutative ring A' 
induces a homomorphism W((p ): W(A) -► W(A'). 
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47. Let p be a prime number, and consider the projection of W(A) on vectors whose 
components are indexed by a power of p. Now use the log to the base p to index 
these components, so that we write x n instead of x p n. For instance, x 0 now denotes 
what was Jtj previously. For a Witt vector x = (jc 0 , jcj, ... , x nì .. .) define 

Vx = (0, x 0 , x u ...) and Fx = (x g,xÇ,...). 

Thus V is a shifting operator. We have V ° F = F ° V. Show that 
(Vx) (n) = px (n ~ l) and x in) = (Fx) {n ~ l) + p n x n . 

Also from the definition, we have 

jc ( w) = jcg" + px^ 1 + • • • + p'tt n . 

48. Let k be a field of characteristic p, and consider W(k). Then V is an additive endomorph- 
ism o{W(k\ and F is a ring homomorphism of VF(/c) into itself. Furthermore, if x e W(k) 
then 


px = VFx. 

If x, y e W(k), then (V l x)(V j y) = V i+j (F pj x • F pi y). Foraek denote by {a} the Witt 
vector (a, 0, 0,...). Then we can write symbolically 

* = X »"{*«>• 

» = o 

Show that if x e W(k) and x 0 ^ 0 then x is a unit in W(k). Hint : One has 


1 -x{xc>>} = Fy 


and then 


x{x 0 - ‘} I (VyY = (1 - Vy) £ (Vy)‘ = 1. 

0 0 

49. Let n be an integer ^ 1 and P a prime number again. Let k be a field of characteristic p. 
Let W„(k) be the ring of truncated Witt vectors (x 0 ,... ,x„_ x ) with components in k. 
We view W„(k) as an additive group. If x e W„(k), define #?(x) = Fx — x. Then p is a 
homomorphism. If K is a Galois extension of k , and cr E G(K/k ), and x E W n (K) we 
can define ox to have component (crx 0 ,..., crx„_ ^). Prove the analogue of Hilbert’s 
Theorem 90 for Witt vectors, and prove that the first cohomology group is trivial. (One 
takes a vector whose trace is not 0, and finds a coboundary the same way as in the proof 
ofTheorem 10.1). 

50. If x e W„(k), show that there exists Ç e W„(k) such that p(£) = x. Do this inductively, 
solving first for the first component, and then showing that a vector (0, a,,..., a„_,) is 
in the image of p if and only if (aj,..., a„_ t ) is in the image of p. Prove inductively 
that if £, Ç' e W„(k’) for some extension k ' of k and if = pÇ' then ^ — í' is a vector 
with components in the prime field. Hence the solutions of fpÇ = x for given x e W„(k) 
all difler by the vectors with components in the prime field, and there are p n such 
vectors. We define 
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or symbolically, 

k(p-'x). 

Prove that it is a Galois extension of /c, and show that the cyclic extensions of /c, of 
degree p", are precisely those of type k(p~ l x) with a vector x such that x 0 $ pk. 

51. Develop the Kummer theory for abelian extensions of k of exponent p n by using W n (k). 
In other words, show that there is a bijection between subgroups B of W n (k) containing 
pW n (k) and abelian extensions as above, given by 


B^K b 

where K B = k(p~ l B). All of this is due to Witt, cf. the references at the end of §8, 
especially [Wi 37]. The proofs are the same, mutatis mutandis , as those given for 
the Kummer theory in the text. 


Further Progress and directions 

Major progress was made in the 90s concerning some problems mentioned in the 
chapter. Foremost was Wiles’s proof of enough of the Shimura-Taniyama conjecture to 
imply Fermat’s Last Theorem [Wil 95], [TaW 95]. 

[TaW 95] R. Taylor and A. Wiles, Ring-theoretic properties or certain Hecke alge- 
bras, Annals of Math. 141 (1995) pp. 553-572 

[Wil 95] A. Wiles, Modular elliptic curves and Fermat’s last theorem, Annals. of 
Math. 141 (1995) pp. 443-551 

Then a proof of the complete Shimura-Taniyama conjecture was given in [BrCDT 01]. 

[BrCDT 01] C. Breuil, B. Conrad, F. Diamond, R. Taylor, On the modularity of el- 
liptic curves over Q: Wild 3-adic exercises, J. Amer. Math. Soc. 14 (2001) 
pp. 843-839 

In a quite different direction, Neukirch started the characterization of number fields 
by their absolute Galois groups [Ne 68], [Ne 69a], [Ne 69b], and proved it for Galois 
extensions of Q. His results were extended and his subsequent conjectures were proved 
by Ikeda and Uchida [Ik 77], [Uch 77], [Uch 79], [Uch 81]. These results were extended 
to finitely generated extensions of Q (function fields) by Pop [Pop 94], who has a more 
extensive bibliography on these and related questions of algebraic geometry. For these 
references, see the bibliography at the end of the book. 



CHAPTER VII_ 

Extensions of Rings 


It is not always desirable to deal only with field extensions. Sometimes one 
wants to obtain a field extension by reducing a ring extension modulo a prime 
ideal. This procedure occurs in several contexts, and so we are led to give the 
basic theory of Galois automorphisms over rings, looking especially at how the 
Galois automorphisms operate on prime ideals or the residue class fields. The 
two examples given after Theorem 2.9 show the importance of working over 
rings, to get families of extensions in two very different contexts. 

Throughout this chapter, A, B, C will denote commutative rings. 


§1. INTEGRAL RING EXTENSIONS 

In Chapters V and VI we have studied algebraic extensions of fields. For a 
number of reasons, it is desirable to study algebraic extensions of rings. 
For instance, given a polynomial with integer coefïìcients, say X 5 — X — 1, 
one can reduce this polynomial mod p for any prime p, and thus get a poly- 
nomial with coefïicients in a finite field. As another example, consider the 
polynomial 

X n + s„_ i X n 1 + • • • + s 0 

where s II _ 1 ,...,s 0 are algebraically independent over a field k. This poly- 
nomial has coefficients in /c[s 0 ,.. ., s„_ j] and by substituting elements of k for 
s 0 ,..., s„_ j one obtains a polynomial with coefficients in k. One can then get 
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information about polynomials by taking a homomorphism of the ring in 
which they have their coeíficients. This chapter is devoted to a brief description 
of the basic facts concerning polynomials over rings. 

Let M be an A-module. We say that M is faithful if, whenever a e A is such 
that aM = 0, then a = 0. We note that A is a faithful module over itself since 
A contains a unit element. Furthermore, if A + 0, then a faithful module over 
A cannot be the 0-module. 

Let A be a subring of B. Let a e B. The following conditions are equivalent: 

INT 1. The element a is a root of a polynomial 

X” Hh a n . { X n - x + ... + «o 

with coefficients Ai e /1, and degree n ^ 1. (The essential thing here 
is that the leading coefficient is equal to 1.) 

INT 2. The subring A[ot\ is a finitely generated /4-module. 

INT 3. There exists a faithful module over A[ol] which is a finitely gener- 
ated /1-module. 

We prove the equivalence. Assume INT 1. Let g(X) be a polynomial 
in A[X~\ of degree ^ 1 with leading coefficient 1 such that g(oì) = 0. If 
f(X)eA[X\ then 

f(X) = q(X)g(X) + r(X) 

with q, r e A[X\ and deg r < deg g. Hence /(a) = r(a), and we see that if 
deg g = n, then 1, a,..., a"“ 1 are generators of A[<x ] as a module over A. 

An equation g(X) = 0 with g as above, such that g(oi) = 0 is called an 
integral equation for a over A. 

Assume INT 2. We let the module be A[<x\ itself. 

Assume INT 3, and let M be the faithful module over A[ot\ which is finitely 
generated over A , say by elements w í9 ..., w n . Since otM c= M there exist ele- 
ments a tj e A such that 


aw, = , j 

+ • 

•• + a ln w„ 

aw„ = a nl Wi 

+ • * 

■ + a nn w n . 


Transposing otw u ..., aw„ to the right-hand side of these equations, we con- 
clude that the determinant 


a - a n 


d = 


a - a 22 


— a, 


— a, 


nn 


ot — a. 
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is such that dM = 0. (This will be proved in the chapter when we deal with 
determinants.) Since M is faithful, we must have d = 0. Hence a is a root of 
the polynomial 

dettfr^. - a u \ 

which gives an integral equation for a over A. 

An element x satisfying the three conditions INT 1, 2, 3 is called integral 
over A. 

Proposition 1.1. Let A be an enîire ring and K its quotient field. Let a be 
algebraic over K. Then there exists an element c # 0 in A such that ca is 
integral over A. 

Proof. There exists an equation 

a n <x n -f fl M _ 1 a' I_1 -f • • • + a 0 = 0 
with cii e A and a n ^ 0. Multiply it by a n ~ l . Then 

( a n oí) n + ••• + a 0 a n n ~ l = 0 

is an integral equation for a n a over A. This proves the proposition. 

Let A C B be subrings of a commutative ring C, and let a e C. If a is integral 
over A then a is a fortiori integral over B. Thus integrality is preserved under 
lifting. In particular, a is integral over any ring which is intermediate between 
A and B. 

Let B contain A as a subring. We shall say that B is integral over A if every 
element of B is integral over A. 

Proposition 1.2. IfB is integral over A andfinitely generated as an A-algebra , 
then B is finitely generated as an A-module. 

Proof. We may prove this by induction on the number of ring generators, 
and thus we may assume that B = A[a] for some element a integral over A, by 
considering a tower 

A c= A\_xf\ c= A\ot l9 a 2 ] c= • • c= A[x í9 ... , aj = B. 

But we have already seen that our assertion is true in that case, this being part 
of the definition of integrality. 

Just as we did for extension fields, one may define a class (3 of extension 
rings A c= B to be distinguished if it satisfies the analogous properties, namely: 

(1) Let A c= B c= C be a tower of rings. The extension A c= C is in G if 
and only if A c= B is in C and B c= C is in G. 

(2) If A c= B is in C, if C is any extension ring of A, and if B, C are both 
subrings of some ring, then C c= £[C] is in G. (We note that 
B[C~\ = C[B\ is the smallest ring containing both B and C.) 
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As with fields, we find formally as a consequence of (1) and (2) that (3) holds, 
namely: 

(3) If A c= B and A c= C are in C, and B, C are subrings of some ring, 
then A a £[C] is in C. 

Proposition 1.3. Integral ring extensions form a distinguished class. 

Proof. Let A C B C C be a tower of rings. If C is integral over A, then it 
is clear that B is integral over A and C is integral over B. Conversely, assume 
that each step in the tower is integral. Let a e C. Then a satisfies an integral 
equation 

a n + b n .. x a"" 1 + --- + fc 0 = 0 

with bi e B. Let = v4[fc 0 ,. .., Then is a finitely generated A- 

module by Proposition 1.2, and is obviously faithful. Then Bj[a] is finite over 
B ì9 hence over A, and hence a is integral over A. Hence C is integral over A. 
Finally let B, C be extension rings of A and assume B integral over A. Assume 
that B, C are subrings of some ring. Then C[£] is generated by elements of 
B over C, and each element of B is integral over C. That C[£] is integral over 
C will follow immediately from our next proposition. 

Proposition 1.4. Let A be a subring ofC. Then the elements of C which are 

integral over A form a subring of C. 

Proof. Let a, /? e C be integral over A. Let M = A\_ol] and N = /4[/?]. 
Then MN contains 1, and is therefore faithful as an ,4-module. Furthermore, 
aM c= M and fN a N. Hence MN is mapped into itself by multiplication 
with a + /} and ajS. Furthermore MN is finitely generated over A (if {vvj are 
generators of M and {vj} are generators of N then {vv,^} are generators of 
MN). This proves our proposition. 

In Proposition 1.4, the set of elements of C which are integral over A is 
called the integral closure of A in C. 

Example. Consider the integers Z. Let K be a fìnite extension of Q. We 
call K a number field. The integral closure of Z in K is called the ring of 
algebraic integers of K. This is the most classical example. 

In algebraic geometry, one considers a finitely generated entire ring R over 
Z or over a field k. Let F be the quotient field of R. One then considers the 
integral closure of R in F, which is proved to be finite over R. If K is a finite 
extension of F, one also considers the integral closure of R in K. 

Proposition 1.5. Let A cz B be an extension ring, and let B be integral 

over A. Let o be a homomorphism of B. Then o(B) is integral over o(A). 

Proof. Let a e B, and let 

ot n + a n - ^ a” 1 + • • • + a 0 = 0 
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be an integral equation for a over A. Applying o yields 

a(oc) n + o(a n - , )o(cc) n ~ 1 + • • • + cr(a 0 ) = 0, 
thereby proving our assertion. 

Corollary 1.6. Let A be an entire ring, k its quotient field, and E a finite 

extension of k. Let a e E be integral over A. Then the norm and trace of a 

(from E to k) are integral over A, and so are the coefficients of the irreducible 

polynomial satisfied by a over k. 

Proof. For each embedding cr of E over k, cra is integral over A. Since the 
norm is the product of oot over all such o (raised to a power of the characteristic), 
it follows that the norm is integral over A. Similarly for the trace, and similarly 
for the coefficients of Irr(a, k, X), which are elementary symmetric functions of 
the roots. 

Let A be an entire ring and k its quotient fìeld. We say that A is integrally 
closed if it is equal to its integral closure in k. 

Proposition 1.7. Let A be entire and factorial. Then A is integrally closed. 

Proof. Suppose that there exists a quotient a/b with a, b e A which is 
integral over A , and a prime element p in A which divides b but not a. We have, 
for some integer n ^ 1, and cz, e A, 

(a/b) n + a n - i(a/b) n 1 + • • • + a 0 = 0 

whence 

a n + a n _ x ba n ~ x + • • • + a 0 b n = 0. 

Since p divides b, it must divide a n , and hence must divide a, contradiction. 

Let /: A -* B be a ring-homomorphism (A, B being commutative rings). 
We recall that such a homomorphism is also called an A- algebra. We may 
view B as an ,4-module. We say that B is integral over A (for this ring-homo- 
morphism /) if B is integral over f(A). This extension of our definition of 
integrality is useful because there are applications when certain collapsings take 
place, and we still wish to speak of integrality. Strictly speaking we should 
not say that B is integral over A, but that /is an integral ring-homomorphism, 
or simply that / is integral. We shall use this terminology frequently. 

Some of our preceding propositions have immediate consequences for 
integral ring-homomorphisms; for instance, if f.A^B and g :B^>C are 
integral, then g o /: A -> C is integral. However, it is not necessarily true that 
if g o f is integral, so is /. 

Let /: A -> B be integral, and let S be a multiplicative subset of A. Then 
we get a homomorphism 

S~ l f: S~ X A -+ S~ l B, 

where strictly speaking, S~ X B = (f(S))~ X B, and S~ l f is defined by 

(S- l f)(x/s) = f(x)/f(s). 
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It is trivially verified that this is a homomorphism. 
diagram 


B - >S~ l B 


We have a commutative 


/ 


s- l f 


A -► S~ l A 


the horizontal maps being the canonical ones: x -* x/ì. 


Proposition 1.8. Let f\A->B be integral , and let S be a multiplicative 
subset of A. Then S~ l f : S~ l A -> S~ ^B is integral. 

Proof. If a e B is integral over f(A\ then writing ocf instead of f(a)fi>i or 
a e A and fi e B we have 

cc n 4- a n - t oc n ~ l -f • • • 4- a 0 = 0 

with a { e A. Taking the canonical image in S~ X A and S~ l B respectively, we 
see that this relation proves the integrality of a/1 over S~ l A , the coefficients 
being now aj 1. 


Propositíon 1.9. Let A be entire and integrally closed. Let Sbe a multipli- 
cative subset of A,0 $ S. Then S~ l A is integrally closed. 

Proof. Let a be an element of the quotient field, integral over We 

have an equation 

o" + — a"- 1 + ••• + — = 0, 

5 n-l S 0 

a { e A and e S. Let s be the product s w _ x • • • s 0 . Then it is clear that sa is 
integral over A , whence in A. Hence a lies in S _1 /l, and S _1 /l is integrally 
closed. 

Let p be a prime ideal of a ring A and let S be the complement of p in A. 
We write S = A — p. If/: A -> B is an /1-algebra (i.e. a ring-homomorphism), 
we shall write B v instead of S' l B. We can view B v as an A v = S~ ^/1-module. 

Let A be a subring of B. Let p be a prime ideal of A and let ^ be a prime 
ideal of B. We say that ^ lies above p if ^ n /1 = p. If that is the case, then 
the injection A B induces an injection of the factor rings 

A/p -> B/% 

and in fact we have a commutative diagram: 

B - >B/S$ 


A -► A/p 
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the horizontal arrows being the canonical homomorphisms, and the vertical 
arrows being injections. 

If B is integral over A, then B/ty is integral over A/p by Proposition 1.5. 

Propositíon 1.10. Let A be a subring of B , let p be a prime ideal of A , and 
assume B integral over A. Then pB # B and there exists a prime ideal ^ of 
B lying above p. 

Proof. We know that B p is integral over A p and that A p is a local ring 
with maximal ideal m p = 5" *p, where S = A — p. Since we obviously have 

pB p = pA p B p = m p B p , 

it will suffice to prove our first assertion when A is a local ring. (Note that the 
existence of a prime ideal p implies that 1 # 0, and pB = B if and only if 1 e pB.) 
In that case, if pB = B , then 1 has an expression as a finite linear combination 
of elements of B with coefficients in p, 

1 = fljhj -f • • • + a n b n 

with a { e p and ò, e B. We shall now use notation as if A p a B p . We leave it 
to the reader as an exercise to verify that our arguments are valid when we 
deal only with a canonical homomorphism A p ^> B p . Let B 0 = A[b u ..., b n ~\. 
Then pB 0 = B 0 and B 0 is a finite /1-module by Proposition 1.2. Hence B 0 = 0 
by Nakayama’s lemma, contradiction. (See Lemma 4.1 of Chapter X.) 

To prove our second assertion, note the following commutative diagram: 

B - >B P 


A - >A P 

We have just proved m p J5 p # B p . Hence m p B p is contained in a maximal ideal 
of B p . Taking inverse images, we see that the inverse image of 50Ì in A p is an 
ideal containing m p (in the case of an inclusion A p a B p the inverse image is 
9JÎ n A p ). Since m p is maximal, we have 9W n A p = m p . Let ^ be the inverse 
image of 9M in B (in the case of inclusion, = 9M n B). Then ^ is a prime 
ideal of B. The inverse image of m p in A is simply p. Taking the inverse image 
of 9W going around both ways in the diagram, we find that 

n A = p, 

as was to be shown. 

Propositíon 1.11. Let A be a subring of B , and assume that B is integral 
over A. Let tybe a prime ideal of B lying over a prime ideal p of A. Then ^ 
is maximal if and only if p is maximal. 
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Proof. Assume p maximal in A. Then A/p is a field, and B/S$ is an entire 
ring, integral over A/p. If a e B/% then a is algebraic over A/p, and we know 
that A/p\jx\ is a field. Hence every non-zero element of B/S$ is invertible in 
B/ty, which is therefore a field. Conversely, assume that ^3 is maximal in B. 
Then B/S$ is a field, which is integral over the entire ring A/p. If A/p is not a 
field, it has a non-zero maximal ideal m. By Proposition 1.10, there exists a 
prime ideal of B/S$ lying above m, 9W # 0, contradiction. 


§2. INTEGRAL GALOIS EXTENSIONS 

We shall now investigate the relationship between the Galois theory of a 
polynomial, and the Galois theory of this same polynomial reduced modulo a 
prime ideal. 

Proposition 2.1. Let A be an entire ring , integrally closed in its quotient 
fieid K. Let L be a fìnite Galois extension of K mth group G. Let p be a 
maximaì ideal of A, and let 93, Q be prime ideals of the integral closure B of 
A in L lying above p. Then there exists a e G such that aty = Q. 

Proof. Suppose that Q # cr93 for any o e G. Then tQ # <t 93 for any pair 

of elements <r, t g G. There exists an element xe B such that 

x = 0 (mod <t 93), all <r e G 

x = 1 (mod crQ), all <r e G 

(use the Chinese remainder theorem). The norm 

Nfa) = n ° x 

a e G 

lies in B n K = A (because A is integrally closed), and lies in 93 n A = p. 
But x ^ <tQ for all <r e G, so that <rx ^ Q for all o e G. This contradicts the fact 
that the norm of x lies in p = Q n A. 

If one localizes, one can eliminate the hypothesis that p is maximal; just 
assume that p is prime. 

Corollary 2.2 Let A be integrally closed in its quotient field K. Let E be a 
finite separable extension of K, and B the integral closure of A in E. Let p be 
a maximal ideal of A. Then there exists only a finite number ofprime ideals of 
B lying above p. 

Proof. Let L be the smallest Galois extension of K containing E. If Qj, 
Q 2 are two distinct prime ideals of B lying above p, and 93i, 93 2 are two pnme 
ideals of the integral closure of A in L lying above Cj and Q 2 respectively, then 
93i # 93 2 * This argument reduces our assertion to the case that E is Galois 
over K , and it then becomes an immediate consequence of the proposition. 
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Let A be integrally closed in its quotient field K, and let B be its integral 
closure in a finite Galois extension L, with group G. Then oB = B for every 
c g G. Let p be a maximal ideal of A, and a maximal ideal of B lying above p. 
We denote by G<p the subgroup of G consisting of those automorphisms such 
that cr^ = Then G v operates in a natural way on the residue class field 
B/ty, and leaves A/p fixed. To each c e Gy we can associate an automorphism 
c of B/ì P over A/p, and the map given by 

c i—► c 

induces a homomorphism of G<p into the group of automorphisms of B/ty 
over A/p. 

The group G<p will be called the decomposition group of Its fixed field 
will be denoted by L dec , and will be called the decomposition field of Let 
B dec be the integral closure of A in L dec , and 0 = ^în B dtc . By Proposition 2.1, 
we know that ^ is the only prime of B lying above Q. 

Let G = IJo-jGsp be a coset decomposition of G v in G. Then the prime 
ideals Ojty are precisely the distinct primes of B lying above p. Indeed, for two 
elements c, t e G we have if and only if t~ ^cr^ = i.e. t ~ l c lies in 

G v . Thus t, c lie in the same coset mod G v . 

It is then immediately clear that the decomposition group of a prime cr^P 
is cGyO' 1 . 

Proposition 2.3. The field L dec is the smallest subfield E of L containing 

K such that ^ is the only prime of B lying above n E (which is prime in 

B n E). 

Proof . Let E be as above, and let H be the Galois group of L over E. Let 
q = n E. By Proposition 2.1, all primes of B lying above q are conjugate by 
elements of H. Since there is only one prime, namely it means that H leaves 
^P invariant. Hence G c= G v and E => L dec . We have already observed that 
L dec has the required property. 

Proposition 2.4. Notation being as above, we have A/p = B dec /Q (under 

the canonical injection A/p -> B dec /Q). 

Proof. If c is an element of G, not in G v , then cr<P # ^ and cr -1 ^ # ty. 
Let 


= c~ l y n B dec . 

Then Q a ^ Q. Let x be an element of B dec . There exists an element y of B dec 
such that 


y = x (mod Q) 
y = 1 (mod Q a ) 
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for each o in G, but not in G*p. Hence in particular, 

y = x (mod ^B) 
y = 1 (mod o~ 1 ^B) 

for each cr not in G<p. This second congruence yields 

oy = 1 (mod ^B) 

for all g The norm of y from L dec to K is a product of y and other factors 
oy with o $ G<p. Thus we obtain 

N? c \y) = x (mod^B). 

But the norm lies in K , and even in A, since it is a product of elements integral 
over A. This last congruence holds mod D, since both x and the norm lie in 
B dec . This is precisely the meaning of the assertion in our proposition. 

If x is an element of B, we shall denote by x its image under the homo- 
morphism B B/ty. Then 5 is the automorphism of B/S\ B satisfying the relation 


GX = (ox). 


If f(X) is a polynomial with coefficients in B, we denote by f(X) its natural 
image under the above homomorphism. Thus, if 


/(*) = fc M X" + ... + òo 


then 


f{X) = B m X' + - + 5 0 . 

Proposition 2.5. Let A be integrally closed in its quotient field K, and let 
B be its integral closure in a finite Galois extension L of K, mth group G. 
Let p be a maximal ideal of A, and ^ a maximal ideal of B lying above p. 
Then B/^B is a normal extension of A/p, and the map o\-+â induces a homo- 
morphism of G<p onto the Galois group of B/ty over A/p. 

Proof. Let B = B/ty and À = A/p. Any element of B can be written as 
x for some xe B. Let x generate a separable subextension of B over Â, and let 
/ be the irreducible polynomial for x over K. The coefficients of / lie in A 
because x is integral over A, and all the roots of/are integral over A. Thus 

m 

nx) = n (x - xd 

i= 1 
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splits into linear factors in B. Since 

f(X) =t(X- X,) 

i = 1 

and all the x, lie in B, it follows that/splits into linear factors in B. We observe 
that f(x) = 0 implies f(x) = 0. Hence B is normal over Â , and 

[Â(x): Â\ ^ [K(x) :K~]^[L: K~\. 

This implies that the maximal separable subextension of Â in B is of finite 
degree over Â (using the primitive element theorem of elementary field theory). 
This degree is in fact bounded by [L : K\. 

There remains to prove that the map gives a surjective homo- 

morphism of G v onto the Galois group of B over Â. To do this, we shall give 
an argument which reduces our problem to the case when ^ is the only prime 
ideal of B lying above p. Indeed, by Proposition 2.4, the residue class fields of 
the ground ring and the ring B dec in the decomposition field are the same. 
This means that to prove our surjectivity, we may take L dec as ground field. 
This is the desired reduction, and we can assume K = L dec , G = G v . 

This being the case, take a generator of the maximal separable subextension 
of B over À, and let it be x, for some element x in B. Let /be the irreducible 
polynomial of x over K. Any automorphism of B is determined by its effect 
on x, and maps x on some root of/. Suppose that x = x P Given any root x f 
of/, there exists an element o of G = G v such that ox = x,. Hence ôx = x,. 
Hence the automorphisms of B over À induced by elements of G operate 
transitively on the roots of /. Hence they give us all automorphisms of the 
residue class field, as was to be shown. 

Corollary 2.6. Let A be integrally closed in its quotient field K. Let L be a 
finite Galois extension of K f and B the integral closure of A in L. Let p be a 
maximal ideal ofA. Let <p: A —» A/p be the canonical homomorphism, and let 
\f/ x , tfe be two homomorphisms of B extending <p in a given algebraic closure 
ofA/p. Then there exists an automorphism cr of L over K such that 

^1=^2° cr. 

Proof. The kernels of ì// í9 \j/ 2 are prime ideals of B which are conjugate 
by Proposition 2.1. Hence there exists an element t of the Galois group G 
such that i/^i, ìf /2 ° t have the same kernel. Without loss of generality, we may 
therefore assume that i// l9 \\f 2 have the same kernel ^3. Hence there exists an 
automorphism co of i//^(B) onto \I/ 2 (B) such that ^ = \j/ 2 . There exists an 
element o of G<p such that a> ° ^ =^o(j, by the preceding proposition. This 
proves what we wanted. 
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Remark. In all the above propositions, we could assume p prime instead 
of maximal. In that case, one has to localize at p to be able to apply our proofs. 

In the above discussions, the kernel of the map 

G<p Gy 

is called the inertia group of ty. It consists of those automorphisms of G v 
which induce the trivial automorphism on the residue class field. Its fixed field 
is called the inertia field, and is denoted by L in . 

Corollary 2.7. Let the assumptions be as in Corollary 2.6 and assume that 
^ is the only prime of B lying above p. Let f(X)bea polynomial in A\_X ] 
mth leading coeffcient 1. Assume that f is irreducible in X[A"], and has a 
root a in B. Then the reduced polynomial f is a power of an irreducible poly- 
nomial in Â\Xf 

Proof. By Corollary 2.6, we know that any two roots of / are conjugate 
under some isomorphism of B over À, and hence that/cannot split into relative 
prime polynomials. Therefore, / is a power of an irreducible polynomial. 

Proposition 2.8. Let A be an entire ring , integrally closed in its quotient 
field K. Let L be a finite Galois extension of K. Let L = K( a), where a is 
integral over A , and let 

f(X) = X n + a n - x X n -' + ••• + a 0 

be the irreducible polynomial of a over k, with a t e A. Let p be a maximal 
ideal in A , let be a prime ideal of the integral closure B of A in L,S$ lying 
above p. Let f(X) be the reduced polynomial with coejficients in A/p. Let 
G<p be the decomposition group. If f has no multiple roots , then the map 
o i—► <r has trivial kernel , and is an isomorphism of Gy on the Galois group of 
f over A/p. 

Proof. Let 

nx) = n (* - *) 

be the factorization of / in L. We know that all x^eB. If oeG v , then we 
denote by ô the homomorphic image of o in the group G v , as before. We 
have 


j(X) =n (x - x,). 

Suppose that Jx, = x, for all i. Since (ôxj = ôx h and since / has no multiple 
roots, it follows that o is also the identity. Hence our map is injective, the in- 
ertia group is trivial. The field Â[x h ..., xJ is a subfield of B and any auto- 
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morphism of B over À which restricts to the identity on this subfield must be 
the identity, because the map G v -> G v is onto the Galois group of B over Â. 
Hence B is purely inseparable over Â[x l9 ...,x n ] and therefore G v is iso- 
morphic to the Galois group of / over À. 

Proposition 2.8 is only a special case of the more-general situation when 
the root of a polynomial does not necessarily generate a Galois extension. We 
state a version useful to compute Galois groups. 

Theorem 2.9. Let A be an entire ring , integrally closed in its quotient field 

K. Let f(X)eA[X ] have leading coejficient 1 and be irreducible over K 
(or A 9 it's the same thing). Let p be a maximal ideal of A and let f = / mod p. 
Suppose that f has no multiple roots in an aìgebraic closure of A/p. Let 
L be a splitting field for f over K , and let B be the integral closure of A in 

L. Let ^ be any prime of B above p and let a bar denote reduction mod p. 
Then the map 

G v —> G v 

is an isomorphism of G v mth the Galois group of f over À. 

Proof. Let (<x u ..., a n ) be the roots of / in B and let (â l5 ..., â„) be their 
reductions mod Since 

f(X) = lî (X - 

i = 1 

it follows that 

fm = íi (x - á ; ). 

i= 1 

Any element of G is determined by its effect as a permutation of the roots, and 
for o e G v , we have 


O = OXj. 

Hence if ô = id then o = id, so the map G v -+ G v is injective. It is surjective 
by Proposition 2.5, so the theorem is proved. 

This theorem justifies the statement used to compute Galois groups in Chapter 
VI, §2. 

Theorem 2.9 gives a very efficient tool for analyzing polynomials over a 
ring. 

Example. Consider the “generic” polynomial 


f w (X) = X" + w n _ í X n - 1 + ••• + w 0 
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where w 0 ,. . ., w n _ x are algebraically independent over a field k. We know that 
the Galois group of this polynomial over the field K = k(w 0 ,. . . , vv w _!) is the 
symmetric group. Let t x ,..., t n be the roots. Let a be a generator of the splitting 
field L; that is, L = K(a). Without loss of generality, we can select a to be 
integral over the ring k[w 0 , . . ., w„_,](multiply any given generator by a suitably 
chosen polynomial and use Proposition 1.1). Let g w (X) be the irreducible poly- 
nomial of a over k(w 0 ,. . . , w„_!). The coefficients of g are polynomials in (w). 
If we can substitute values (a) for (w) with a 0 ,. .. , a n _ x e k such that g a remains 
irreducible, then by Proposition 2.8 we conclude at once that the Galois group 
of g a is the symmetric group also. Similarly, if a finite Galois extension of 
k(w 0 ,. . ., w n ~ x ) has Galois group G, then we can do a similar substitution to 
get a Galois extension of k having Galois group G , provided the special polynomial 
g a remains irreducible. 

Example. Let K be a number field; that is, a finite extension of Q. Let o 
be the ring of algebraic integers. Let L be a finite Galois extension of À'andO 
the algebraic integers in L. Let p be a prime of o and ^ a prime of 0 lying above 
p. Then o/p is a finite field, say with q elements. Then 0/^is a finite extension 
of o/p, and by the theory of finite fields, there is a unique element in G<p, called 
the Frobenius element Fr^, such that Fr^(T) = x q for x e 0/^p. The conditions 
of Theorem 2.9 are satisfied for all but a finite number of primes p, and for such 
primes, there is a unique element Fr^ e G^ such that Fr^(jc) = x q mod for all 
x e 0. We call Fr<p the Frobenius element in G^. Cf. Chapter VI, §15, where 
some of the significance of the Frobenius element is explained. 


§3. EXTENSION OF HOMOMORPHISMS 

When we first discussed the process of localization, we considered very 
briefly the extension of a homomorphism to a local ring. In our discussion of 
field theory, we also described an extension theorem for embeddings of one 
field into another. We shall now treat the extension question in full generality. 

First we recall the case of a local ring. Let A be a commutative ring and p 
a prime ideal. We know that the local ring A p is the set of all fractions x/y , with 
x, y e A and y £ p. Its maximal ideal consists of those fractions with jc e p. Let 
L be a field and let <p: A —» L be a homomorphism whose kernel is p. Then we 
can extend <p to a homomorphism of A p into L by letting 

<P(x/y) = (p(x)/cp(y) 
if x/y is an element of A p as above. 

Second, we have integral ring extensions. Let o be a local ring with maximal 
ideal m, let B be integral over o, and let cp : o -► L be a homomorphism of o 
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into an algebraically closed field L. We assume that the kernel of cp is m. By 
Proposition 1.10, we know that there exists a maximal ideal 9M of B lying above 
m, i.e. such that 9JÎ n o = m. Then B/9JÎ is a field, which is an algebraic exten- 
sion of o/m, and o/m is isomorphic to the subfield cp( o) of L because the kernel 
of cp is m. 

We can find an isomorphism of o/m onto cp( o) such that the composite 
homomorphism 


o -► o/m -> L 

is equal to cp. We now embed B/9M into L so as to make the following diagram 
commutative: 


B - >B/m 


o-► o/m-> L 

and in this way get a homomorphism of B into L which extends cp. 

Proposition 3.1. Let Abe a subring of B and assume that B is integral over 
A. Let cp : A -► L be a homomorphism into a field L which is algebraically 
closed. Then cp has an extension to a homomorphism of B into L. 

Proof. Let p be the kernel of cp and let S be the complement of p in A. 
Then we have a commutative diagram 

B - >S~ l B 


A - >S~ l A = A p 

and cp can be factored through the canonical homomorphism of A into S _Ì A. 
Furthermore, S~ l B is integral over S~ l A. This reduces the question to the 
case when we deal with a local ring, which has just been discussed above. 

Theorem 3.2. Let A be a subring of a field K and let x e K, x # 0. Let 
cp : A -+ L be a homomorphism of A into an algebraically closed field L. 
Then cp has an extension to a homomorphism of A[x\ or A[x~ l \ into L. 

Proof. We may first extend cp to a homomorphism of the local ring A p , 
where p is the kernel of cp. Thus without loss of generality, we may assume that 
A is a local ring with maximal ideal m. Suppose that 

m/l[x _1 ] = A [x — 1 ] • 
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Then we can write 


1 = a 0 -f úíjX - 1 -f • • • + a n x~ n 
with a { e xxx. Multiplying by x n we obtain 

(1 - a 0 )x n -f 1 -f • • • + b 0 = 0 

with suitable elements e A. Since a 0 e m, it follows that 1 — a 0 $ m and 
hence 1 - a 0 is a unit in A because A is assumed to be a local ring. Dividing 
by 1 — <a 0 we see that x is integral over A, and hence that our homomorphism 
has an extension to A[x] by Proposition 3.1. 

If on the other hand we have 

xx\A[x~ *] # A[x~ *] 

then xx\A[x~ 1 ] is contained in some maximal ideal ^ of A[x~ 1 ] and n A 
contains m. Since m is maximal, we must have ^ n A = m. Since <p and the 
canonical map A -> A/xxx have the same kernel, namely m, we can find an 
embedding ij/ of A/xx\ into L such that the composite map 

A -> A/xxx L 

is equal to cp. We note that A/xxx is canonically embedded in B/ty where 
B = A[x~'l and extend \j/ to a homomorphism of B/ty into L, which we can 
do whether the image of x“ 1 in B/ty is transcendental or algebraic over A/xx\. 
The composite B -► B/S$ -► L gives us what we want. 

CoroIIary 3.3. Let Abe a subring of a field K and let L be an algebraically 
closed field. Let (p : A-* Lbe a homomorphism. Let Bbe a maximal subring 
of K to which (p has an extension homomorphism into L. Then B is a local 
ring and if x e K, x # 0, then x e B or x~ l e B. 

Proof. Let 5 be the set of pairs (C, \jj) where C is a subring of K and 
\j/: C -> L is a homomorphism extending cp. Then S is not empty (containing 
(A 9 cp) ], and is partially ordered by ascending inclusion and restriction. In 
other words, (C, \j/) ^ (C', \j/') if C <= C' and the restriction of \p' to C is equal 
to \j/. It is clear that 5 is inductively ordered, and by Zorn’s lemma there exists 
a maximal element, say (£, \j/ 0 ). Then first B is a local ring, otherwise \j/ 0 extends 
to the local ring arising from the kernel, and second, B has the desired property 
according to Theorem 3.2. 

Let B be a subring of a field K having the property that given x e K, x ± 0, 
then x e. B oï x~ 1 e B. Then we call B a valuation ring in K. We shall study 
such rings in greater detail in Chapter XII. However, we shall also give some 
applications in the next chapter, so we make some more comments here. 
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Let Fbea field. We let the symbol oo satisfy the usual algebraic rules. If 
aeF, we define 

a±oo = oo, a • oo = oo if a ± 0, 

oo • oo = oo, - = °o and — = 0. 

0 oo 

The expressions oo ± oo, 0 • oo, 0/0, and oo/oo are not defined. 

A place (p of a field X into a field F is a mapping 

(p : K -* {F, oo} 

of K into the set consisting of F and oo satisfying the usual rules for a homo- 
morphism, namely 


(p(a ± b) = (p(a) ± (p(b\ 

(p(ab) = (p(a)(p(b) 

whenever the expressions on the right-hand side of these formulas are defined, 
and such that (p( 1) = 1. We shall also say that the place is F-valued. The 
elements of K which are not mapped into oo will be called finite under the place, 
and the others will be called infìnite. 

The reader will verify at once that the set o of elements of K which are 
fìnite under a place is a valuation ring of K. The maximal ideal consists of those 
elements x such that (p(x ) = 0. Conversely, if o is a valuation ring of K with 
maximal ideal m, we let (p : o -> o/m be the canonical homomorphism, and 
define (p(x) = oo for x e K, x $ o. Then it is trivialíy verifìed that (p is a place. 

If (p x :K-*{F u oo } and (p 2 ' K -+ { F 2 , oo} are places of X, we take their 
restrictions to their images. We may therefore assume that they are surjective. 
We shall say that they are equivalent if there exists an isomorphism À: Fj -► F 2 
such that (p 2 = (pi ° A. (We put A(oo) = oo.) One sees that two places are 
equivalent if and only if they have the same valuation ring. It is clear that there 
is a bijection between equivalence classes of places of X, and valuation rings of 
X. A place is called trivial if it is injective. The valuation ring of the trivial place 
is simply X itself. 

As with homomorphisms, we observe that the composite of two places is also 
a place (trivial verification). 

It is often convenient to deal with places instead of valuation rings, just as it is 
convenient to deal with homomorphisms and not always with canonical homo- 
morphisms or a ring modulo an ideal. 

The general theory of valuations and valuation rings is due to Krull, All- 
gemeine Bewertungstheorie, J. reine angew. Math. 167 (1932), pp. 169-196. 
However, the extension theory of homomorphisms as above was realized only 
around 1945 by Chevalley and Zariski. 
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We shall now give some examples of places and valuation rings. 

Example 1. Let p be a prime number. Let be the ring of all rational 
numbers whose denominator is not divisible by p. Then is a valuation ring. 
The maximal ideal consists of those rational numbers whose numerator is divisible 
by p- 

Example 2. Let k be a field and R = k[X] the polynomial ring in one 
variable. Let p = p(X) be an irreducible polynomial. Let o be the ring of rational 
functions whose denominator is not divisible by p. Then o is a valuation ring, 
similar to that of Example 1. 

Example 3. Let R be the ring of power series k[[X]] in one variable. Then 
R is a valuation ring, whose maximal ideal consists of those power series divisible 
by X. The residue class field is k itself. 

Example 4. Let R = k[[X x ,. . . , X n ]] be the ring of power series in several 
variables. Then R is not a valuation ring, but R is imbedded in the field of repeated 
power series /:((X!))((X 2 )) * * * ((X n )) = K n . By Example 3, there is a place of 
K n which is X^pvalued. By induction and composition, we can define a 
/:-valued place of K n . Since the field of rational functions k(X ì9 . .. , X n ) is 
contained in K nf the restriction of this place to k(X u . . . , X n ) gives a k-\ alued 
place of the field of rational functions in n variables. 

Example 5. In Chapter XI we shall consider the notion of ordered field. 
Let k be an ordered subfield of an ordered field K. Let o be the subset of elements 
of K which are not infinitely large with respect to k. Let m be the subset of 
elements of o which are infinitely small with respect to k. Then o is a valuation 
ring in K and m is its maximal ideal. 

The fol!owing property of places will be used in connection with projective 
space in the next chapter. 

Proposition 3.4. Let <p: K —> {L, oo} be an L-valued place of K. Given a 

finite number of non-zero elements jc lf ..., x n e K there exists an index j such 

that <p is finite on xjx y for i = 1,. . . , n. 

Proof. Let B be the valuation ring of the place. Define x t ^ Xj to mean that 
xjxj e B. Then the relation ^ is transitive, that is if x t ^ x y and xj % x r then 

^ x r . Furthermore, by the property of a valuation ring, we always have 
x t ^ Xj or Xj ^ Xi for all pairs of indices /, j. Hence we may order our ele- 
ments, and we select the index j such that x t ^ Xj for all i. This index j 
satisfies the requirement of the proposition. 

We can obtain a characterization of integral elements by means of val- 
uation rings. We shall use the following terminology. If o, O are local 
rings with maximal ideals m, 9W respectively, we shall say that O lies above o 
if o c= O and 9JÎ n o = m. We then have a canonical injection o/m -> 0/9JÎ. 
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Proposition 3.5. Let obe a local ring contained in a field L. An element x of 
L is integral over o if and only ij x lies in every valuation ring O of L lying 
above o. 

Proof Assume that x is not integral over o. Let m be the maximal ideal of o. 
Then the ideal (m, 1/x) of o[l/x] cannot be the entire ring, otherwise we can 
write 


-1 = a„( l/x) n + ■■■ + a,( 1/x) + y 
with y e m and a { e o. From this we get 


(1 + y)x n + • • • + a n = 0. 


But 1 + y is not in m, hence is a unit of o. We divide the equation by 1 + y to 
conclude that x is integral over o, contrary to our hypothesis. Thus (m, 1/x) is 
not the entire ring, and is contained in a maximal ideal ^, whose intersection 
with o contains m and hence must be equal to m. Extending the canonical homo- 
morphism o[l/x] -► o[l/x]/^$ to a homomorphism of a valuation ring O of L, 
we see that the image of 1/x is 0 and hence that x cannot be in this valuation ring. 
Conversely, assume that x is integral over o, and let 

x n + a n - x x n ~ x + • • • + a 0 = 0 

be an integral equation forx with coefficients in o. Let D be any valuation ring 
of L lying above o. Suppose x £ O. Let <p be the place given by the canonical 
homomorphism of O modulo its maximal ideal. Then <p(x) = oo so <p(ì/x) = 0.- 
Divide the above equation by x n , and apply <p. Then each term except the first 
maps to 0 under <p , so we get <p(l) = 0, a contradiction which proves the 
proposition.' 

Proposition 3.6. Let A be a ring contained in a field L. An element x of L 
is integral over A ifand only ifx lies in every valuation ring O ofL containing 
A. In terms of places, x is integral over A ifand only if every place ofL finite 
on A is finite on x. 

Proof. Assume that every place finite on A is finite on x. We may assume 
x + 0. If 1/x is a unit in A[l/x] then we can write 

x = c 0 + Cj(l/x) + • • • + C n — ! (1 / x) n ~ 1 

with t'i e A and some n. Multiplying by x n ~ l we conclude thatx is integral over 
A. If 1/x is not a unit in A[l/x], then 1/x generates a proper principal ideal. 
By Zorn’s lemma this ideal is contained in a maximal ideal 33Î. The homomorphism 
A[l/x] —> A[ì/x]/W can be extended to a place which is a finite on A but maps 
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1/jc on 0, so jc on which contradicts the possibility that 1 /jc is not a unit in 
A[1 /jc] and proves that jc is integral over A. The converse implication is proved 
just as in the second part of Proposition 3.5. 

Remark. Let K be a subfield of L and let jc e L. Then jc is integral over 
K if and only if jc is algebraic over K. So if a place <p of L is finite on K , and jc 
is algebraic over K , then <p is finite on K(x). Of course this is a trivial case of 
the integrality criterion which can be seen directly. Let 

jc" -I- a n - x x n ~ x + • • • + a 0 = 0 

be the irreducible equation for jc over K. Suppose jc + 0. Then a 0 + 0. Hence 
<p(x) + 0 immediately from the equation, so <p is an isomorphism of K(x) on its 
image. 

The next result is a generalization whose technique of proof can also be used 
in Exercise 1 of Chapter IX (the Hilbert-Zariski theorem). 

Theorem 3.7. General Integrality Criterion. Let A be an entire ring. 

Let z x . z m be elements ofsome extension fìeld ofits quotient field K. Assume 

that each z s (s = 1 ,.. . , m) satisfies a polynomial relation 

4 S + 9s( z 1 . z m) = 0 

where g s (Z ìt ..., Z m ) e A[Z lt ... f Z m ] is a polynomial of total degree < d s> 
and that any pure power ofZ s occuring with non-zero coejficient in g s occurs 
with a power strictly less than d s . Then z lt ..., z m are integral over A. 

Proof. We apply Proposition 3.6. Suppose some z s is not integral overA. 
There exists a place <p of X, finite on A , such that <p(z s ) = °° f 0 r some 5. By 
Proposition 3.4 we can pick an index 5 such that <p(zj/z s ) + °° for all j. We 
divide the polynomial relation of the hypothesis in the lemma by zf s and apply 
the place. By the hypothesis on g st it follows that <p(g s (z) / zf s ) = 0, whence we 
get 1 = 0, a contradiction which proves the theorem. 


EXERCISES 

1. Let K be a Galois extension of the rationals Q, with group G. Let B be the mtegral 
closure of Z in K , and let a e B be such that K = Q(a). Let f(X) = Irr(a, Q, X). Let 
p be a prime number, and assume that / remains irreducible mod p over Z/pZ. What 
can you say about the Galois group G? (Artm asked this question to Tate on his qualify- 
ing exam.) 

2. Let A be an entire ring and K its quotient field. Let t be transcendental over K. If A 
is integrally closed, show that 4[í] is integrally closed. 
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For the folloxving exercises, you can use §1 of Chapter X. 

3. Let A be an entire ring, ìntegrally closed in its quotient field K. Let L be a finite separable 
extension of K, and let B be the integral closure of A in L. If A is Noetherian, show that 
B ìs a finite /1-module. [Hint: Let {co,,..., a>„} be a basis of L over K. Multiplying 
all elements of this basis by a suitable element of A , we may assume without loss of 
generality that all co, are integral over A. Let {co,,..., a>'„ } be the dual basis relative to 
the trace, so that Tr(co f co') = <5 fj . Write an element a of L integral over A in the form 

a = b x a>\ + • • • + b„(o„ 

with bj E K. Taking the trace Tríaco,), for / = 1,...,«, conclude that B is contained 
in the finite module A(o\ + • • • + Aaj' n .] Hence B is Noetherian. 

4. The preceding exercise applies to the case when A = Z and k = Q. Let L be a finite 
extension of Q and let o L be the ring of algebraic integers in L. Let a x ,. . . , a n be 
the distinct embeddings of L into the complex numbers. Embedded o L into a Euclidean 
space by the map 

a ( a x a ,. .. , a n a). 

Show that in any bounded region of space, there is only a finite number of elements 
of o L . [Hint: The coefficients in an integral equation for a are elementary symmetric 
functions of the conjugates of a and thus are bounded integers.] Use Exercise 5 of 
Chapter III to conclude that o L is a free Z-module of dimension ^ n. In fact, show 
that the dimension is n, a basis of o L over Z also being a basis of L over Q. 

5. Let E be a finite extension of Q, and let o L be the ring of algebraic integers of E. Let 
U be the group of units of o £ . Let a x ,.. ., a n be the distinct embeddings of E into 
C. Map U into a Euclidean space, by the map 

/ ; ai—► (log |crja|,..., log \ c„oc\). 

Show that l(U) ìs a free abelian group, finitely generated, by showing that in any fimte 
region of space, there is only a finite number of elements of l(U). Show that the kernel 
of / ìs a finite group, and is therefore the group of roots of unity in E. Thus U itself is a 
finitely generated abelian group. 

6. Generalize the results of §2 to infinite Galois extensions, especially Propositions 2.1 
and 2.5, using Zorn’s lemma. 

7. Dedekind rings. Let o be an entire ring which is Noetherian, integrally closed, and 
, such that every non-zero prime ideal is maximal. Define a fractional ideal û to be an 

o -submodule ¥= 0 of the quotient field K such that there exists c E o, c + 0 for which 
ca C o . Prove that the fractional ideals form a group under multiplication. Hint 
following van der Waerden: Prove the following statements in order: 

(a) Given an ideal û + 0 in o, there exists a product of prime ideals 
Pi •••P r Co. 

(b) Every maximal ideal p is invertible, i.e. if we let p -1 be the set of elements 
x E K such that x p C o , then p " 1 p = o. 

(c) Every non-zero ideal is invertible, by a fractional ideal. (Use the Noetherian 
property that if this is not true, there exists a maximal non-invertible ideal 
a, and get a contradiction.) 
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8. Using prime ideals instead of prime numbers for a Dedekind ring A, define the notion 
of content as in the Gauss lemma, and prove that if /(X), g(X) E A[X] are polynomials 
of degree ^ 0 with coefficients in A, then cont (fg) = cont(/)cont(p). Also if K is 
the quotient field of A, prove the same statement for/, g E K[X]. 

9. Let A be an entire ring, integrally closed. Let B be entire, integral over A. Let Q x , 
Q 2 be prime ideals of B with Q x D Q 2 but Q x =É Q 2 . Let P, = Q^ C\ A. Show that 
P x * P 2 . 

10. Let n be a positive integer and let (, (' be primitive n- th roots of unity. 

(a) Show that (1 - £)/(l - (') is an algebraic integer. 

(b) If n ^ 6 is divisible by at least two primes, show that 1 — (is a unit in the 
ring Z[(]. 

11. Let p be a prime and ( a primitive p- th root of unity. Show that there is a principal 
ideal J in Z[(] such that J p ~ l = (p) (the principal ideal generated by p). 


Symmetric Polynomials 

12. Let F be a field of characteristic 0. Let t\ ,..., t n be algebraically independent over F. 
Let s \,..., s n be the elementary symmetric functions. Then R = F[t\ is an 

integral extension of S = F[s\ ,... ,j„], and actually is its integral closure in the 
rational field F(t \,..., t n ). Let W be the group of permutation of the variables 

t\ 1 ‘ • • 5 t n . 

(a) Show that S = R w is the fixed subring of R under W. 

(b) Show that the elements /[' with 0 ^ n - i form a basis of R over 
S, so in particular, R is free over S. 

I am told that the above basis is due to Kronecker. There is a much more interesting 
basis, which can be defined as follows. 

Let dj,..., d n be the partial derivatives with respect to so d, = d/dti. Let 

P e F[t\ = F[t \,..., t n \. Substituting d,- for r, (i — 1,..., n) gives a partial differential 
operator P(d) = P(d \,..., d n ) on R. An element of S can also be viewed as an element of 
R. Let Q e R. We say that Q is fL-hannonic if P(d)Q = 0 for all symmetric polynomials 
P e S with 0 constant term. It can be shown that the IF-harmonic polynomials form a 
finite dimensional space. Furthermore, if {H \,..., H N } is a basis for this space over F , 
then it is also a basis for R over S. This is a special case of a general theorem of Che- 
valley. See [La 99b], where the special case is worked out in detail. 



CHAPTER VIII 


Transcendental Extensions 


Both for their own sake and for applications to the case of finite exten- 
sions of the rational numbers, one is led to deal with ground fields which are 
function fields, i.e. finitely generated over some field /c, possibly by elements 
which are not algebraic. This chapter gives some basic properties of such 
fields. 


§1. TRANSCENDENCE BASES 

Let K be an extension field of a field k. Let S be a subset of K. We 
recall that S (or the elements of S) is said to be algebraically independent 
over fc, if whenever we have a relation 

o = Z a (v) M (v) (S) = I a (v) f[ * vW 

xeS 

with coefficients a (v) e /c, almost all a (v) = 0, then we must necessarily have all 

«(v) 

We can introduce an ordering among algebraically independent subsets of 
X, by ascending inclusion. These subsets are obviously inductively ordered, 
and thus there exist maximal elements. If S is a subset of K which is 
algebraically independent over k 9 and if the cardinality of S is greatest among 
all such subsets, then we call this cardinality the transcendence degree or 
dimension of K over k. Actually, we shall need to distinguish only between 
finite transcendence degree or infinite transcendence degree. We observe that 
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the notion of transcendence degree bears to the notion of algebraic indepen- 
dence the same relation as the notion of dimension bears to the notion of 
linear independence. 

We frequently deal with families of elements of K, say a family {x f } ie/ , 
and say that such a family is algebraically independent over k if its elements 
are distinct (in other words, x t # Xj if i # j) and if the set consisting of the 
elements in this family is algebraically independent over k. 

A subset 5 of K which is algebraically independent over k and is maximal 
with respect to the inclusion ordering will be called a transcendence base of 
K over k. From the maximality, it is clear that if S is a transcendence base 
of K over /c, then K is algebraic over k(S). 

Theorem 1.1. Let K be an extension of a field k. Any two transcendence 
bases of K over k have the same cardinality. If T is a subset of K such that 
K is algebraic over k(T), and S is a subset of T which is algebraically indepen- 
dent over k , then there exists a transcendence base of K over k such that 

S C (B C r. 


Proof. We shall prove that if there exists one finite transcendence base, say 
{jcj , . . . , jt w }, m ^ 1, m minimal, then any other transcendence base must also 
have m elements. For this it will suffice to prove: If w Ì9 ... 9 w n are elements 
of K which are algebraically independent over k then n ^ m (for we can then 
use symmetry). By assumption, there exists a non-zero irreducible polynomial 
f x in m + 1 variables with coefficients in k such that 

/ 1 (^ 1 , , xj = 0. 

After renumbering x Ì9 ... , x m we may write f x = X gj(w Ì9 x 2 > . .. , xj x\ with 
some g N =/= 0 with some N ^ 1. No irreducible factor of g N vanishes on 
(vv^, x 2 ,. .. , x n ), otherwise w x would be a root of two distinct irreducible polyno- 
mials over k(x x , ... , xj. Hence x x is algebraic over k(w Ì9 x 2 , ... , xj and 
vv^, jc 2 , ... , x m are algebraically independent over k , otherwise the minimal- 
ity of m would be contradicted. Suppose inductively that after a suitable re- 
numbering of x 2 , ... , x m we have found Wj, ... , w r (r < n) such that K is 
algebraic over k(w {9 ... , w r , x r+Ì9 ... , xj. Then there exists a non-zero 
polynomial /in m + 1 variables with coefficients in k such that 

/(w r+1 , w x , . .. , w r , x r+ì9 ... 9 xJ = 0. 

Since the vv’s are algebraically independent over k 9 it follows by the same argument 
as in the first step that some x j9 say x r+1 , is algebraic over k(w Ì9 . . . , vv r+1 , 
jc r+2 , . . . , xj. Since a tower of algebraic extensions is algebraic, it follows 
that K is algebraic over k(w Ì9 . . . , vv r+1 , jc r+2 , ... , xj. We can repeat the 
procedure, and if n ^ m we can replace all the jc’s by vv’s, to see that K is 
algebraic over k(w x , . . . , vv m ). This shows that n^ m implies n = m 9 as desired. 
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We have now proved: Either the transcendence degree is finite, and is 
equal to the cardinality of any transcendence base, or it is infinite, and every 
transcendence base is infinite. The cardinality statement in the infinite case 
will be left as an exercise. We shall also leave as an exercise the statement 
that a set of algebraically independent elements can be completed to a 
transcendence base, selected from a given set l such that K is algebraic over 
k(T). (The reader will note the complete analogy of our statements with those 
concerning linear bases.) 

Note. The preceding section is the only one used in the next chapter. The 
remaining sections are more technical , especially §3 and §4 which will not be 
used in the rest of the book. Even §2 and §5 will only be mentioned a 
couple of times , and so the reader may omit them until they are referred to 
again. 


§2. NOETHER NORMALIZATION THEOREM 

Theorem 2.1. Let fe[xj,..., x„] = fe[x] be a finitely generated entire ring 
over a field fe, and assume that fe(x) has transcendence degree r. Then there 
exist elements y ì9 ..., y r in fe[x] such that fe[x] is integral over 

kLyì = fe[yi,...,y r ]. 

Proof. If (xj,...,x w ) are already algebraically independent over fc, we 
are done. If not, there is a non-trivial relation 

Z^{ 1 ••• X j n n = 0 

with each coelficient a U) e fe and a U) # 0. The sum is taken over a finite 
number of distinct n-tuples of integers (j ì9 j v è 0. Let m 2 , be 

positive integers, and put 

y 2 = *2 - •••» y n = x n - xi n . 

Substitute x t = y t + xî” 1 (i = 2, ...,n) in the above equation. Using vector 
notation, we put (m) = (1, m 2 ,..., m n ) and use the dot product (j)'(m) to 
denote j\ + m 2 j 2 + ” + m n j n - If we expand the relation after making the 

above substitution, we get 

£ c o)* < i ÍHm) + f(xi, y 2 > • • •. y») = o 

where / is a polynomial in which no pure power of x t appears. We now 
select d to be a large integer [say greater than any component of a vector (j) 
such that c U) # 0] and take 

0 m) = (í,d,d 2 ,...,d "). 




358 TRANSCENDENTAL EXTENSIONS 


VIII, §2 


Then all (j)-(m) are distinct for those (j) such that c U) # 0. In this way we 
obtain an integral equation for over k\_y 2 ,..., y n \. Since each x f (i > 1) 
is integral over k\x ì9 y 2 ,yj, it follows that /c[x] is integral over 
k\y 2 > •••» y n l- We can now proceed inductively, using the transitivity of 
integral extensions to shrink the number of /s until we reach an alge- 
braically independent set of /s. 

The advantage of the proof of Theorem 2.1 is that it is applicable when k 
is a finite field. The disadvantage is that it is not linear in x í9 ..., x n . We 
now deal with another technique which leads into certain aspects of algebraic 
geometry on which we shall comment after the next theorem. 

We start again with k[x ì9 ... 9 x n ~\ finitely generated over k and entire. 
Let (Uij) (i 9 j = 1,..., n) be algebraically independent elements over k(x), and 
let k u = k(u) = /c(u 0 ) alli-J . Put 

n 

yi = Z u ìj x j- 

j =l 

This amounts to a generic linear change of coordinates in n-space, to use 
geometric terminology. Again we let r be the transcendence degree of k(x) 
over k. 

Theorem 2.2. With the above notation , k M [x] is integral over 

Proof. Suppose some x t is not integral over k u \y ì9 ..., y r \. Then there 
exists a place (p of k u (y) finite on k u \_y í9 ..., y r \ but taking the value oo on 
some x t . Using Proposition 3.4 of Chapter VII, and renumbering the indices 
if necessary, say (p(xj/x n ) is finite for all i. Let zj = (p(Xj/x n ) for j = 1,..., n. 
Then dividing the equations y f = Ya u ij x j by x n (for i = 1,..., r) and applying 
the place, we get 

0 = u n z[ +Mi 2^2 + * ,, + u ín9 


0 — u rí z\ + u r2 z' 2 + • • • + u rn . 

The transcendence degree of k(z') over k cannot be r, for otherwise, the place 
(p would be an isomorphism of k(x) on its image. [Indeed, if, say, z \ 9 ..., z' r 
are algebraically independent and z { , = x f /x n , then z í9 ..., z r are also alge- 
braically independent, and so form a transcendence base for k(x) over k. 
Then the place is an isomorphism from k(z ì9 ... 9 z r ) to k(z\ 9 ... 9 z r ) 9 and 
hence is an isomorphism from k(x) to its image.] We then conclude that 

u ln9 ..., u rn €k(u ij9 z') with i = 1, ...,r; ; = 1, ...,n-l. 

Hence the transcendence degree of k(u) over k would be ^rn — 1, which is a 
contradiction, proving the theorem. 
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Corollary 2.3. Let k be a field, and let k(x) be a finitely generated 
extension of transcendence degree r. There exists a polynomial P(u) = 
P(Uìj) e fc[w] such that if (c) = (c 0 ) is a family of elements c 0 e fc satisfying 
P(c) # 0, and we let y[ = £c 0 Xj, then fc[x] is integral over fcfyj,y']. 

Proof By Theorem 2.2, each x f is integral over k u ly í9 ..., y r ]. The 
coefficients of an integral equation are rational functions in fc M . We let P(m) 
be a common denominator for these rational functions. If P(c) # 0, then 
there is a homomorphism 

(p:k{x)[u, PÍm) -1 ] -*■ k(x) 

such that cp(u ) = (c), and such that (p is the identity on fc(x). We can apply (p 
to an integral equation for x t over fc M [y] to get an integral equation for x f 
over fc[y'], thus concluding the proof. 

Remark. After Corollary 2.3, there remains the problem of finding ex- 
plicitly integral equations for x!,...,x n (or y r+í ,..., y„) over k u [y l9 ..., yj. 
This is an elimination problem, and I have decided to refrain from further 
involvement in algebraic geometry at this point. But it may be useful to 
describe the geometric language used to interpret Theorem 2.2 and further 
results in that line. After the generic change of coordinates, the map 

(j^l , * *, ^ (y 1. - - -. ^r) 

is the generic projection of the variety whose coordinate ring is fc[x] on 
affine r-space. This projection is finite, and in particular, the inverse image of 
a point on affine r-space is finite. Furthermore, if fc(x) is separable over fc (a 
notion which will be defined in §4), then the extension fc M (y) is finite separable 
over fc„(yi, •••, y r ) (in the sense of Chapter V). To determine the degree of 
this finite extension is essentially Bezout’s theorem. Cf. [La 58], Chapter 
VIII, §6. 

The above techniques were created by van der Waerden and Zariski, cf., 
for instance, also Exercises 5 and 6. These techniques have unfortunately not 
been completely absorbed in some more recent expositions of algebraic 
geometry. To give a concrete example: When Hartshorne considers the 
intersection of a variety and a sufficiently general hyperplane, he does not 
discuss the “generic” hyperplane (that is, with algebraically independent 
coefficients over a given ground field), and he assumes that the variety is 
non-singular from the start (see his Theorem 8.18 of Chapter 8, [Ha 77]). 
But the description of the intersection can be done without simplicity as- 
sumptions, as in Theorem 7 of [La 58], Chapter VII, §6, and the corre- 
sponding lemma. Something was lost in discarding the technique of the 
algebraically independent (m 0 ). 

After two decades when the methods illustrated in Chapter X have been 
prevalent, there is a return to the more explicit methods of generic construc- 
tions using the algebraically independent (m 0 ) and similar ones for some 



360 TRANSCENDENTAL EXTENSIONS 


VIII, §3 


applications because part of algebraic geometry and number theory are 
retuming to some problems asking for explicit or effective constructions, with 
bounds on the degrees of solutions of algebraic equations. See, for instance, 
[Ph 91-95], [So 90], and the bibliography at the end of Chapter X, §6. Return- 
ing to some techniques, however, does not mean abandoning others; it 
means only expanding available tools. 
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§3. LINEARLY DISJOINT EXTENSIONS 

In this section we discuss the way in which two extensions K and L of a 
field k behave with respect to each other. We assume that all the fields 
involved are contained in one field í), assumed algebraically closed. 

K is said to be linearly disjoint from L over k if every finite set of 
elements of K that is linearly independent over k is still such over L. 

The definition is unsymmetric, but we prove right away that the property 
of being linearly disjoint is actually symmetric for K and L. Assume K 
linearly disjoint from L over k. Let yi,... 9 y n be elements of L linearly 
independent over k. Suppose there is a non-trivial relation of linear depen- 
dence over X, 

(1) *1 yi + x 2 y 2 + • • • + X„y„ = 0. 

Say x x r are linearly independent over k, and x r+1 , x„ are linear 

r 

combinations x t = £ i = r 4- 1,..., n. We can write the relation (1) as 

fi=i 

follows: 

fi = 1 i=r+l 

and collecting terms, after inverting the second sum, we get 

Z (>v+ Z («.„)'.•)) = °- 

M = 1 \ i=r +1 / 


£ + z 


1=1 


Vi = o 
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The ys are linearly independent over fc, so the coefficients of x M are # 0. 
This contradicts the linear disjointness of K and L over fc. 

We now give two criteria for linear disjointness. 

Criterion 1. Suppose that K is the quotient field of a ring R and L the 
quotient field of a ring S. To test whether L and K are linearly disjoint, it 
suffices to show that if elements y í9 ..., y„ of S are linearly independent over 
fc, then there is no linear relation among the y 9 s with coefficients in R. 
Indeed, if elements y i9 ... 9 y n of L are linearly independent over fc, and if 
there is a relation x Y y x + • • • + x n y n = 0 with e X, then we can select y in 
S and x in R such that xy # 0, yy^ e S for all i, and xx f e R for all i. 
Multiplying the relation by xy gives a linear dependence between elements of 
R and S. However, the yy^ are obviously linearly independent over fc, and 
this proves our criterion. 

Criterion 2. Again let R be a subring of K such that K is its quotient 
field and R is a vector space over fc. Let {u a } be a basis of R considered as a 
vector space over fc. To prove K and L linearly disjoint over fc, it suffices to 
show that the elements {u a } of this basis remain linearly independent over L. 
Indeed, suppose this is the case. Let x ì9 ... 9 x m be elements of R linearly 
independent over'fc. They lie in a finite dimension vector space generated by 
some of the u a , say u l9 ... 9 u n . They can be completed to a basis for this 
space over fc. Lifting this vector space of dimension n over L, it must 
conserve its dimension because the u’s remain linearly independent by hy- 
pothesis, and hence the x’s must also remain linearly independent. 


Proposition 3.1. Let K be a field containing another field fc, and let 
L E be two other extensions of k. Then K and L are linearly disjoint 
over k if and only if K and E are linearly disjoint over k and KE, L are 
linearly disjoint over E. 


KL 



KE 




K 


E 
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Proof. Assume first that K , E are linearly disjoint over fc, and KE , L are 
linearly disjoint over E. Let {/c} be a basis of K as vector space over fc (we 
use the elements of this basis as their own indexing set), and let {a} be a 
basis of E over fc. Let {X} be a basis of L over E. Then {ocX} is a basis of L 
over fc. If K and L are not linearly disjoint over fc, then there exists a 
relation 

Z (l Aa = 0 with some c kAí[ # 0, c K>lai e k. 

Changing the order of summation gives 

z(e = 0 

A \k,X J 

contradicting the linear disjointness of L and KE over E. 

Conversely, assume that K and L are linearly disjoint over fc. Then a 
fortiori , K and E are also linearly disjoint over fc, and the field KE is the 
quotient field of the ring L[K] generated over E by all elements of K. This 
ring is a vector space over L, and a basis for K over fc is also a basis for this 
ring L[/C] over L. With this remark, and the criteria for linear disjointness, 
we see that it suffices to prove that the elements of such a basis remain 
linearly independent over L. At this point we see that the arguments given 
in the first part of the proof are reversible. We leave the formalism to the 
reader. 

We introduce another notion concerning two extensions K and L of a 
field fc. We shall say that K is free from L over k if every finite set of 
elements of K algebraically independent over fc remains such over L. If (x) 
and (y) are two sets of elements in fì, we say that they are free over k (or 
independent over k) if fc(x) and k(y) are free over fc. 

Just as with linear disjointness, our definition is unsymmetric, and we 
prove that the relationship expressed therein is actually symmetric. Assume 
therefore that K is free from L over fc. Let y ì9 -.. 9 y tt be elements of L, 
algebraically independent over fc. Suppose they become dependent over K. 
They become so in a subfield F of K finitely generated over fc, say of 
transcendence degree r over fc. Computing the transcendence degree of F(y) 
over fc in two ways gives a contradiction (cf. Exercise 5). 


F(y) 
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Proposition 3.2. If K and L are linearly disjoint over fc, then they are free 
over fc. 

Proof Let x i9 ..., x n be elements of K algebraically independent over fc. 
Suppose they become algebraically dependent over L. We get a relation 

Z )'aM x (x) = 0 

between monomials M a (x) with coeflScients y a in L. This gives a linear 
relation among the M a (x). But these are linearly independent over fc because 
the x’s are assumed algebraically independent over fc. This is a contradiction. 

Proposition 3.3. Let L be an extension of fc, and let (u) = (u u ..., u r ) be a 
set of quantities algebraically independent over L. Then the fìeld k(u) is 
linearly disjoint from L over fc. 

Proof According to the criteria for linear disjointness, it suffices to 
prove that the elements of a basis for the ring fc[u] that are linearly indepen- 
dent over fc remain so over L. In fact the monomials M(u) give a basis of 
k[u] over fc. They must remain linearly independent over L, because as 
we have seen, a linear relation gives an algebraic relation. This proves our 
proposition. 

Note finally that the property that two extensions K and L of a field fc 
are linearly disjoint or free is of finite type. To prove that they have either 
property, it suffices to do it for all subfields K 0 and L 0 of K and L 
respectively which are finitely generated over fc. This comes from the fact 
that the definitions involve only a finite number of quantities at a time. 


§4. SEPARABLE AND REGULAR EXTENSIONS 

Let K be a finitely generated extension of fc, K = k(x). We shall say that 
it is separably generated if we can find a transcendence basis (í 1 ,...,í r ) of 
K/k such that K is separably algebraic over k(t). Such a transcendence base 
is said to be a separating transcendence base for K over fc. 

We always denote by p the characteristic if it is not 0. The field obtained 
from fc by adjoining all p m -th roots of all elements of fc will be denoted by 
fc 1/pm . The compositum of all such fields for m = 1, 2,..., is denoted by fc 1/p °°. 

Proposition 4.1. The following conditions concerning an extension field K 

of k are equivalent: 

(i) K is linearly disjoint from fc 1/p °°. 

(ii) K is linearly disjoint from fc 1/pm for some m. 
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(iii) Every subfield of K containing k and finitely generated over k is 
separably generated. 

Proofi It is obvious that (i) implies (ii). In order to prove that (ii) 
implies (iii), we may clearly assume that K is finitely generated over fc, say 

K = k(x) = k(x í ,...,x n ). 

Let the transcendence degree of this extension be r. If r = n, the proof is 
complete. Otherwise, say x l9 ...,x r is a transcendence base. Then x r+1 is 
algebraic over k(x ì9 ..., x r ). Let f(X l9 ... 9 X r+l ) be a polynomial of lowest 
degree such that 

fi(x ^,..., x r+ ^) 0. 

Then / is irreducible. We contend that not all x t (i = 1,..., r -f 1) appear to 
the p-th power throughout. If they did, we could write f(X) = Y,c a M a (X) p 
where M a (X) are monomials in X ì9 ..., X r+Í and c a ek. This would imply 
that the M a (x) are linearly dependent over k l/p (taking the p-th root of the 
equation £c a M a (x) p = 0). However, the M a (x) are linearly independent over 
fc (otherwise we would get an equation for x l5 ..., x r+1 of lower degree) and 
we thus get a contradiction to the linear disjointness of fc(x) and k 1/p . Say 
X x does not appear to the p-th power throughout, but actually appears in 
f(X). We know that f(X) is irreducible in k[X i9 ...,A r r+1 ] and hence /(x) = 0 
is an irreducible equation for x t over fc(x 2 ,..., x r+1 ). Since X t does not 
appear to the p-th power throughout, this equation is a separable equation 
for x x over fc(x 2 ,...., x r+1 ), in other words, x x is separable algebraic over 
fc(x 2 ,..., x r+1 ). From this it follows that it is separable algebraic over 
fc(x 2 ,..., x„). If (x 2 ,..., x„) is a transcendence base, the proof is complete. If 
not, say that x 2 is separable over fc(x 3 ,..., x n ). Then fc(x) is separable over 
fc(x 3 ,..., x n ). Proceeding inductively, we see that the procedure can be 
continued until we get down to a transcendence base. This proves that (ii) 
implies (iii). It also proves that a separating transcendence base for fc(x) over 
fc can be selected from the given set of generators (x). 

To prove that (iii) implies (i) we may assume that K is finitely generated 
over fc. Let (u) be a transcendence base for K over fc. Then K is separably 
algebraic over k(u). By Proposition 3.3, k(u) and k ì/p °° are linearly disjoint. 
Let L = fc 1/p0 °. Then k(u)L is purely inseparable over fc(u), and hence is 
linearly disjoint from K over fc(u) by the elementary theory of finite algebraic 
extensions. Using Proposition 3.1, we conclude that K is linearly disjoint 
from L over fc, thereby proving our theorem. 

An extension K of fc satisfying the conditions of Proposition 4.1 is called 
separable. This definition is compatible with the use of the word for alge- 
braic extensions. 

The first condition of our theorem is known as MacLane’s criterion. It 
has the following immediate corollaries. 
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Corollary 4.2. If K is separable over k, and E is a subfield of K contain- 
ing k, then E is separable over k. 

Corollary 4.3. Let E be a separable extension of k, and K a separable 
extension of E. Then K is a separable extension of k. 

Proof Apply Proposition 3.1 and the definition of separability. 

Corollary 4.4. If k is perfect, every extension of k is separable. 

Corollary 4.5. Let K be a separable extension of k, and free from an 
extension L of k. Then KL is a separable extension of L. 

Proof. An element of KL has an expression in terms of a finite number 
of elements of K and L. Hence any finitely generated subfield of KL 
containing L is contained in a composite field FL, where F is a subfield of K 
finitely generated over k. By Corollary 4.2, we may assume that K is finitely 
generated over k. Let (í) be a transcendence base of K over k, so K is 
separable algebraic over k(t). By hypothesis, (í) is a transcendence base of 
KL over L, and since every element of K is separable algebraic over k(t), it 
is also separable over L(t). Hence KL is separably generated over L. This 
proves the corollary. 

Corollary 4.6. Let K and L be two separable extensions of k, free from 
each other over k. Then KL is separable over k. 

Proof. Use Corollaries 4.5 and 4.3. 

Corollary 4.7. Let K, L be two extensions of k, linearly disjoint over k. 
Then K is separable over k if and only if KL is separable over L. 

Proof. If K is not separable over k, it is not linearly disjoint from k ì/p 
over k, and hence a fortiori it is not linearly disjoint from Lk l/P over k. By 
Proposition 4.1, this implies that KL is not linearly disjoint from Lk i/P over 
L, and hence that KL is not separable over L. The converse is a special case 
of Corollary 4.5, taking into account that linearly disjoint fields are free. 

We conclude our discussion of separability with two results. The first one 
has already been proved in the first part of Proposition 4.1, but we state it 
here explicitly. 

Proposition 4.8. If K is a separable extension of k, and is finitely gener- 
ated, then a separating transcendence base can be selected from a given set 
of generators. 

To state the second result we denote by K pm the field obtained from K 
by raising all elements of K to the p m -th power. 
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Proposition 4.9. Let K be a finitely generated extension of a field k. If 

K pm k = K for some m , then K is separably algebraic over k. Conversely , if 

K is separably algebraic over k , then K pm k = K for all m. 

Proof. If K/k is separably algebraic, then the conclusion follows from 
the elementary theory of finite algebraic extensions. Conversely, if K/k is 
finite algebraic but not separable, then the maximal separable extension of k 
in K cannot be all of K, and hence K p k cannot be equal to K. Finally, if 
there exists an element t of K transcendental over k , then k(t ì/pm ) has degree 
p m over fc(í), and hence there exists a t such that t ì,pm does not lie in K. This 
proves our proposition. 

There is a class of extensions which behave particularly well from the 
point of view of changing the ground field, and are especially useful in 
algebraic geometry. We put some results together to deal with such exten- 
sions. Let K be an extension of a field fc, with algebraic closure K a . We 
claim that the following two conditions are equivalent: 

REG 1. fc is algebraically closed in K (i.e. every element of K algebraic 
over fc lies in fc), and K is separable over fc. 

REG 2. K is linearly disjoint from fc a over fc. 

We show the equivalence. Assume REG 2. By Proposition 4.1, we know that 
K is separably generated over fc. It is obvious that fc must be algebraically 
closed in K. Hence REG 2 implies REG 1. To prove the converse we need 
a lemma. 

Lemma 4.10. Let k be algebraically closed in extension K. Let x be 

some element of an extension of K , but algebraic over k. Then k(x) and K 

are linearly disjoint over fc, and \_k(x): fc] = [ K(x ): K]. 

Proof. Let f(X) be the irreducible polynomial for x over fc. Then / 
remains irreducible over K\ otherwise, its factors would have coefficients 
algebraic over fc, hence in fc. Powers of x form a basis of fc(x) over fc, hence 
the same powers form a basis of K(x) over K. This proves the lemma. 

To prove REG 2 from REG 1, we may assume without loss of generality 
that K is finitely generated over fc, and it suffices to prove that K is linearly 
disjoint from an arbitrary finite algebraic extension L of fc. If L is separable 
algebraic over fc, then it can be generated by one primitive element, and we 
can apply Lemma 4.10. 

More generally, let E be the maximal separable subfield of L containing 
fc. By Proposition 3.1, we see that it suffices to prove that KE and L are 
linearly disjoint over E. Let (t) be a separating transcendence base for K 
over fc. Then K is separably algebraic over fc(í). Furthermore, (t) is also a 
separating transcendence base for KE over L, and KE is separable algebraic 
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over E(t). Thus KE is separable over E, and by definition KE is linearly 
disjoint from L over K because L is purely inseparable over E. This proves 
that REG 1 implies REG 2. 

Thus we can define an extension K of fc to be regular if it satisfies either 
one of the equivalent conditions REG 1 or REG 2. 

Proposition 4.11. 

(a) Let K be a regular extension of k, and let E be a subfield of K containing 
k. Then E is regular over k. 

(b) Let E be a regular extension of k, and K a regular extension of E. 
Then K is a regular extension of k. 

(c) If k is algebraically closed, then every extension of k is regular. 

Proof. Each assertion is immediate from the definition conditions REG 
1 and REG 2. 

Theorem 4.12. Let K be a regular extension of k, let L be an arbitrary 

extension of fc, both contained in some larger field , and assume that K , L 

are free over k. Then K, L are linearly disjoint over k. 

Proof (Artin). Without loss of generality, we may assume that K is 
finitely generated over fc. Let x j,..., x n be elements of K linearly indepen- 
dent over fc. Suppose we have a relation of linear dependence 

Xiyj + • • • + x n y„ = 0 

with y t € L. Let cp be a fc a -valued place of L over fc. Let (í) be a transcen- 
dence base of K over fc. By hypothesis, the elements of (í) remain alge- 
braically independent over L, and hence (p can be extended to a place of KL 
which is identity on k(t). This place must then be an isomorphism of K on 
its image, because K is a finite algebraic extension of k(t) (remark at the 
end of Chapter VII, §3). After a suitable isomorphism, we may take a place 
equivalent to q> which is the identity on K. Say (p(yi/y n ) is finite for all i (use 
Proposition 3.4 of Chapter VII). We divide the relation of linear dependence 
by y n and apply (p to get Y, x i ( P(yi/yn) = 0» which gives a linear relation 
among the x f with coefficients in fc a , contradicting the linear disjointness. 
This proves the theorem. 

Theorem 4.13. Let K be a regular extension of k, free from an extension 

L of k over k. Then KL is a regular extension of L. 

Proof. From the hypothesis, we deduce that K is free from the algebraic 
closure L a of L over fc. By Theorem 4.12, K is linearly disjoint from L a over 
fc. By Proposition 3.1, KL is linearly disjoint from L a over L, and hence KL 
is regular over L. 
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Corollary 4.14. Let K , L be regular extensions of fc, free from each other 
over fc. Then KL is a regular extension of fc. 

Proof Use Corollary 4.13 and Proposition 4.11 (b). 

Theorem 4.13 is one of the main reasons for emphasizing the class of 
regular extensions: they remain regular under arbitrary base change of the 
ground field fc. Furthermore, Theorem 4.12 in the background is important 
in the study of polynomial ideals as in the next section, and we add 
some remarks here on its implications. We now assume that the reader is 
acquainted with the most basic properties of the tensor product (Chapter 
XVI, §1 and §2). 


Corollary 4.15. Let K = fc(x) be a finitely generated regular extension , 
free from an extension L of fc, and both contained in some larger field. 
Then the natural k-algebra homomorphism 

L ® k fc[x] -> L[x] 

is an isomorphism. 

Proof. By Theorem 4.12 the homomorphism is injective, and it is obvi- 
ously surjective, whence the corollary follows. 

Corollary 4.16. Let k(x) be a finitely generated regular extension , and let 
p be the prime ideal in fc[X] vanishing on (x), that is , consisting of all 
polynomials f(X)ek[X ] such that f(x) = 0. Let L be an extension of fc, 
free from k(x) over k. Let p L be the prime ideal in L[X] vanishing on (x). 
Then p L = pL[X], that is p L is the ideal generated by p in L[X], and in 
particular, this ideal is prime. 

Proof. Consider the exact sequence 

0->p-fc[X]-fc[x]-*0. 

Since we are dealing with vector spaces over a field, the sequence remains 
exact when tensored with any fc-space, so we get an exact sequence 

0 —► L ($$) k p — > L[X~\ — L ® k fc[x] -► 0. 

By Corollary 4.15, we know that L ® k k[x] ~ L[x], and the image of L ® k p 
in L[X\ is pL[X\ so the lemma is proved. 

Corollary 4.16 shows another aspect whereby regular extensions behave 
well under extension of the base field, namely the way the prime ideal p 
remains prime under such extensions. 
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§5. DERIVATIONS 


A derivation D of a ring R is a mapping D: R -+ R oî R into itself which is 
linear and satisfies the ordinary rule for derivatives, i.e., 


D(x -f y) = Dx -f Dy and D(xy) = xDy -f yDx. 

As an example of derivations, consider the polynomial ring k\_X~\ over a field 
k. For each variable X h the partial derivative d/dX t taken in the usual 
manner is a derivation of k[X\. 

Let R be an entire ring and let K be its quotient field. Let D: R -* R be a 
derivation. Then D extends uniquely to a derivation of K , by defining 


D(u/v) = 


vDu — uDv 


It is immediately verified that the expression on the right-hand side is 
independent of the way we represent an element of K as u/v (m, v e R), and 
satisfies the conditions defining a derivation. 

Note. In this section, we shall discuss derivations of fields. For deriva- 
tions in the context of rings and modules, see Chapter XIX, §3. 

A derivation of a field K is trlvial if Dx = 0 for all x e K. It is trivial over 
a subfìeld k of K if Dx = 0 for all xek. A derivation is always trivial over 
the prime field: One sees that 

D(l) = D(ì • 1) = 2D(1), 

whence D(\) = 0. 

We now consider the problem of extending derivations. Let 
L = K(x) = K(x í9 ...,x n ) 


be a finitely generated extension. If f e K[X\, we denote by df/dx^ the 
polynomials df/dX^ evaluated at (x). Given a derivation D on K , does there 
exist a derivation D* on L coinciding with D on K1 If f(X)eK[X\ is a 
polynomial vanishing on (x), then any such D* must satisfy 

(1) 0 = D*f(x) = f D (x) + X (ôf/dx^Xb 

where f D denotes the polynomial obtained by applying D to all coelficients 
of /. Note that if relation (1) is satisfied for every element in a finite set of 
generators of the ideal in K\X\ vanishing on (x), then (1) is satisfied by every 
polynomial of this ideal. This is an immediate consequence of the rules for 
derivations. The preceding ideal will also be called the ideal determined by 
(x) in K\X\. 
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The above necessary condition for the existence of a D* turns out to be 
sufficient. 


Theorem 5.1. Let D be a derivation of a field K. Let 


(x) = {x u ...,x n ) 

be a finite family of elements in an extension of K. Let {f a (X)} be a set of 
generators for the ideal determined by (x) in K[X~\. Then , if (u) is any set 
of elements of K(x) satisfying the equations 

o = f°(x) + Y t (dfjdx t )u i9 

there is one and only one derivation D* of K(x) coinciding with D on K , 
and such that D*x * = for every i. 

Proof. The necessity has been shown above. Conversely, if g(x\ h(x) are 
in K\_x\ , and h(x) # 0, one verifies immediately that the mapping D* defined 
by the formulas 

D*g(x) = g D (x) + 


D*(g/h) = 


hD*g — gD*h 
h 2 ’ 


is well defined and is a derivation of K(x). 

Consider the special case where (x) consists of one element x. Let Dbea 
given derivation on K. 

Case 1. x is separable algebraic over K. Let f(X) be the irreducible 
polynomial satisfied by x over K. Then f'(x) # 0. We have 

0 = / D (x)+/'(x)u, 

whence u = —f D (x)/f'(x). Hence D extends to K(x) uniquely. If D is trivial 
on K , then D is trivial on K(x). 

Case 2. x is transcendental over K. Then D extends, and u can be 
selected arbitrarily in K(x). 

Case 3. x is purely inseparable over K, so x p — a = 0, with ae K. Then 
D extends to K(x) if and only if Da = 0. In particular if D is trivial on K , 
then u can be selected arbitrarily. 


Proposition 5.2. A finitely generated extension K(x) over K is separable 
algebraic if and only if every derivation D of K(x) which is trivial on K is 
trivial on K(x). 

Proof. If K(x) is separable algebraic over K , this is Case 1. Conversely, 
if it is not, we can make a tower of extensions between K and K(x% such 
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that each step is covered by one of the three above cases. At least one step 
will be covered by Case 2 or 3. Taking the uppermost step of this latter 
type, one sees immediately how to construct a derivation trivial on the 
bottom and nontrivial on top of the tower. 

Proposition 5.3. Given X and elements (x) = (x í9 ..., x n ) in some extension 

field , assume that there exist n polynomials fi e K[X~\ such that: 

(i) fi(x) = 0, and 

(ii) det (dfi/ôXj) # 0. 

Then (x) is separably algebraic over K. 

Proofi Let D be a derivation on K(x ), trivial on K. Having /*(x) = 0 we 
must have Dfi(x) = 0, whence the Dx t satisfy n linear equations such that the 
coefficient matrix has non-zero determinant. Hence Dx £ = 0, so D is trivial 
on K(x). Hence X(x) is separable algebraic over K by Proposition 5.2. 

The following proposition will follow directly from Cases 1 and 2. 

Proposition 5.4. Let K = k(x) be a finitely generated extension of k. An 

element z of K is in K p k if and only if every derivation D of K over k is 

such that Dz = 0. 

Proof. If z is in K p k , then it is obvious that every derivation D of K 
over k vanishes on z. Conversely, if z £ K p k , then z is purely inseparable 
over K p k , and by Case 3 of the extension theorem, we can find a derivation 
D trivial on K p k such that Dz = 1. This derivation is at first defined on the 
field K p k(z). One can extend it to K as follows. Suppose there is an element 
vv e K such that vv £ K p k(z). Then w p e K p k , and D vanishes on vv p . We can 
then again apply Case 3 to extend D from K p k(z) to K p k(z , vv). Proceeding 
stepwise, we finally reach X, thus proving our proposition. 

The derivations D of a field K form a vector space over K if we define zD 
for z e K by (zD)(x) = zDx. 

Let K be a finitely generated extension of /c, of dimension r over k. We 
denote by £) the X-vector space of derivations D of K over k (derivations of 
K which are trivial on k). For each z e X, we have a pairing 

(D, z) h-► Dz 

of (£), X) into X. Each element z of X gives therefore a X-linear functional 
of $). This functional is denoted by dz. We have 

d(yz) = y dz + z dy, 
d(y -f z) = dy -F dz. 

These linear functionals form a subspace $ of the dual space of 3), if we 
define y dz by (Z>, y dz) i—► yDz. 



372 TRANSCENDENTAL EXTENSIONS 


VIII, §5 


Proposition 5.5. Assume that K is a separably generated and finitely 
generated extension of k of transcendence degree r. Then the vector space 
D (over K) of derivations of K over k has dimension r. Elements t l9 ..., t r 
of K from a separating transcendence base of K over k if and only if 
dt u ..., dt r form a basis of the dual space of D over K. 

Proof. If tt r is a separating transcendence base for K over fc, then 
we can find derivations D l9 ..., D r of K over fc such that D,íy = ô ij9 by Cases 
1 and 2 of the extension theorem. Given D g£), let vv, = Dt^. Then clearly 
D = Yj vv, A, an d so the D t form a basis for £) over K , and the dt { form the 
dual basis. Conversely, if dt ì9 ..., dt r is a basis for 2F over K , and if K is not 
separably generated over fc(í), then by Cases 2 and 3 we can find a derivation 
D which is trivial on k(t) but nontrivial on K. If D x , ..., D r is the dual basis 
of dt ì9 ..., dt r (so Ditj = ôij) then D, D í9 ..., D r would be linearly independent 
over K , contradicting the first part of the theorem. 

Corollary 5.6. Let K be a finitely generated and separably generated 
extension of k. Let z be an element of K transcendental over k. Then K is 
separable over k(z) if and only if there exists a derivation D of K over k 
such that Dz # 0. 

Proof. If K is separable over fc(z), then z can be completed to a separat- 
ing base of K over fc and we can apply the proposition. If Dz ^ 0, then 
dz # 0, and we can complete dz to a basis of ÍF over K. Again from the 
proposition, it follows that K will be separable over fc(z). 

Note. Here we have discussed derivations of fields. For derivations in 
the context of rings and modules, see Chapter XVI. 

As an application, we prove: 

Theorem 5.7. (Zariski-Matsusaka). Let K be a finitely generated sepa- 
rable extension of a field k. Let y, z e K and z $ K p k if the characteristic 
is p > 0. Let u be transcendental over K , and put k u = fc(u), K u = K(u). 

(a) For all except possibly one value of c e fc, K is a separable extension of 
k(y + cz). Furthermore , K u is separable over k u (y + uz). 

(b) Assume that K is regular over fc, and that its transcendence degree is at 
least 2. Then for all but a finite number of elements c e fc, K is 
a regular extension of k(y + cz). Furthermore , K u is regular over 
k u (y + uz). 

Proofi We shall use throughout the fact that a subfield of a finitely 
generated extension is also finitely generated (see Exercise 4). 

If vv is an element of K , and if there exists a derivation D of K over 
fc such that Dvv # 0, then K is separable over fc(vv), by Corollary 5.6. Also 
by Corollary 5.6, there exists D such that Dz # 0. Then for all elements 
c e fc, except possibly one, we have D(y + cz) = Dy + cDz # 0. Also we 
may extend D to K u over k u by putting Du = 0, and then one sees that 
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D(y + uz)= Dy + uDz # 0, so K is separable over k(y + cz) except possibly 
for one value of c, and K u is separable over k u (y + uz). In what follows, 
we assume that the constants c í9 c 2 , ... are diíferent from the exceptional 
constant, and hence that K is separable over k(y + c t z) for i = 1, 2. 

Assume next that K is regular over k and that the transcendence degree 
is at least 2. Let £ f = k(y -f c f z) (i = 1, 2) and let E[ be the algebraic closure 
of Ei in K. We must show that E[ = L, for all but a finite number of 
constants. Note that k(y 9 z) = E^E^ is the compositum of E { and £ 2 , and 
that k(y , z) has transcendence degree 2 over k. Hence E[ and E' 2 are free 
over k. Being subfields of a regular extension of fc, they are regular over fc, 
and are therefore linearly disjoint by Theorem 4.12. 


K 




By construction, E[ and E 2 are finite separable algebraic extensions of 
and E 2 respectively. Let L be the separable algebraic closure of fc(y, z) in K. 
There is only a finite number of intermediate fields between fc(y, z) and L. 
Furthermore, by Proposition 3.1 the fields E[(y, z) and E 2 (y 9 z) are linearly 
disjoint over k(y 9 z). Let Cj range over the finite number of constants which 
will exhaust the intermediate extensions between L and k(y 9 z) obtainable by 
lifting over k(y 9 z) a field of type E[. If c 2 is now chosen different from any 
one of these constants c í9 then the only way in which the condition of linear 
disjointness mentioned above can be compatible with our choice of c 2 is that 
E 2 (y 9 z) = k(y 9 z) 9 i.e. that E 2 = k(y + c 2 z). This means that k(y -f c 2 z) is 
algebraically closed in K 9 and hence that K is regular over k(y -f c 2 z). 

As for K U9 let u ì9 u l9 ... be infinitely many elements algebraically indepen- 
dent over K. Let fc' = k(u í9 u 2 ,...) and K' = K(u ì9 u 2 , ...) be the fields 
obtained by adjoining these elements to fc and K respectively. By what has 
already been proved, we know that K' is regular over k'(u 4- u { z) for all 
but a finite number of integers i 9 say for i = 1. Our assertion (a) is then 
a consequence of Corollary 4.14. This concludes the proof of Theorem 5.7. 
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Theorem 5.8. Let K = fc(x l5 .x n ) = k(x) be a finitely generated regular 
extension of a field fc. Let u í9 ... 9 u n be algebraically independent over 
k(x). Let 

1 = “iXi + *•* + u n x nì 

and let k u = k(u í9 ... 9 u n , u n+1 ). Then k u (x) is separable over k u9 and if the 
transcendence degree of k(x) over k is ^ 2, then k u (x) is regular over k u . 

Proofi By the separability of k(x) over fc, some x t does not lie in K p k, 
say x n $ K p k. Then we take 

y = MjX^ + ••• + and z = x n , 

so that u n+1 = y + u n z, and we apply Theorem 5.7 to conclude the proof. 

Remark. In the geometric language of the next chapter, Theorem 5.8 
asserts that the intersection of a fc-variety with a generic hyperplane 

Ui X t H-+ u n X n — u n+1 = 0 

is a fc u -variety, if the dimension of the fc-variety is ^2. In any case, the 
extension fc u (x) is separable over k u . 


EXERCISES 

1. Prove that the complex numbers have infìnitely many automorphisms. [ Hint: 
Use transcendence bases.] Describe all automorphisms and their cardinality. 

2. A subfield fc of a field K is said to be algebraically closed in K if every element of 
K which is algebraic over k is contained in fc. Prove: If fc is algebraically closed 
in K , and K, L are free over fc, and L is separable over fc or K is separable over 
fc, then L is algebraically closed in KL. 

3. Let fc c: E c K be extension fields. Show that 

tr. deg. (K/k) = tr. deg. (K/E) + tr. deg. (L/fc). 

If {xj is a transcendence base of E/k, and {yj} is a transcendence base of K/E, 
then {x f , yfi is a transcendence base of K/k. 

4. Let K/k be a finitely generated extension, and let K => E => fc be a subextension. 
Show that E/k is finitely generated. 

5. Let fc be a field and fc(Xi,..., x n ) = fc(x) a finite separable extension. Let 
tíi,...,u H be algebraically independent over fc. Let 

vv = u^Xi + •• • + u n x n . 

Let fc u = k(u l9 ..., u n ). Show that fc u (w) = fc u (x). 
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6. Let k(x) = l c(x j,..., x„) be a separable extension of transcendence degree r ^ 1. 
Let Uìj ; ( i = 1,..., r ; j = 1,..., n) be algebraically independent over k(x). Let 

n 

y- = X u y x y- 

j = l 

Let k u = /c(M 0 ) allitJ - 

(a) Show that /c tí (x) is separable algebraic over k(y l9 ...,y r ). 

(b) Show that there exists a polynomial P(u)6k[u] having the following prop- 
erty. Let (c) = (c 0 ) be elements of k such that P(c) # 0. Let 

n 

y'i = X c a x i- 

7=1 

Then k(x) i^s separable algebraic over k(y'). 

1. Let k be a field and k[x í9 ..., x M ] = R a finitely generated entire ring over k with 
quotient field k(x). Let L be a finite extension of fc(x). Let I be the integral 
closure of R in L. Show that I is a finite jR-module. [Use Noether normalization, 
and deal with the inseparability problem and the separable case in two steps.] 

8. Let D be a derivation of a field K. Then D n : K -* K is a linear map. Let 
P„ = Ker D n , so P„ is an additive subgroup of K. An element x e K is called a 
logarithmic derivative (in K) if there exists y e K such that x = Dy/y. Prove: 

(a) An element xe K is the logarithmic derivative of an element y e P n but 
y í P n ~i (n> 0) if and only if 

(D + x)"(l) = 0 and (D -h x)" -1 (l) ^ 0. 

(b) Assume that K = (J P n , i.e. given xe K then x e P n for some n > 0. Let F be 
a subfield of K such that DF cz F. Prove that x is a logarithmic derivative in 
F if and only if x is a logarithmic derivative in K. [ Hint: If x = Dy/y then 
(D ■f x) = y~ l D o y and conversely.] 

9. Let k be a field of characteristic 0, and let z í9 ..., z r be algebraically independent 
over k. Let (e^), i = 1, ..., m and j = 1, ..., r be a matrix of integers with r ^ m, 
and assume that this matrix has rank m. Let 

vv f = z\ il • • • z r ir for i = 1, ..., m. 

Show that Wj, ..., w m are algebraically independent over k. [Hint: Consider the 
K-homomorphism mapping the K -space of derivations of K/k into K {r) given by 

Dv-+(Dz x /z u ... 9 Dz r /z r \ 

and derive a linear condition for those D vanishing on k(w í9 ..., vv m ).] 

10. Let /c, (z) be as in Exercise 9. Show that if P is a rational function then 

d(P(z)) = grad P(z)-dz, 

using vector notation, i.e. dz = (dz l9 ..., dz r ) and grad P = (D X P 9 ..., D r P). Define 
d log P and express it in terms of coordinates. If P, Q are rational functions in 
k(z) show that 


d log (PQ) = d log P -I- d log Q. 




CHAPTER IX_ 

Algebraic Spaces 


This chapter gives the basic results concerning solutions of polynomial equa- 
tions in several variables over a field k. First it will be proved that if such 
equations have a common zero in some field, then they have a common zero in 
the algebraic closure of k , and such a zero can be obtained by the process known 
as specialization. However, it is useful to deal with transcendental extensions 
of k as well. Indeed, if p is a prime ideal in k[X] = k[X ,,. . . , X n ] f then 
k[X]/p is a finitely generated ring over k, and the images x t of X t in this ring 
may be transcendental over k, so we are led to consider such rings. 

Even if we want to deal only with polynomial equations over a field, we are 
led in a natural way to deal with equations over the integers Z. Indeed, if the 
equations are homogeneous in the variables, then we shall prove in §3 and §4 
that there are universal polynomials in their coefficients which determine whether 
these equations have a common zero or not. “Universal” means that the coef- 
ficients are integers, and any given special case comes from specializing these 
universal polynomials to the special case. 

Being led to consider polynomial equations over Z, we then consider ideals 
û in Z[X]. The zeros of such an ideal form what is called an algebraic space. If 
p is a prime ideal, the zeros of p form what is called an arithmetic variety. We 
shall meet the first example in the discussion of elimination theory, for which 
I follow van der Waerden’s treatment in the first two editions of his Moderne 
Algebra , Chapter XI. 

However, when taking the polynomial ring Z[X]/a for some ideal a, it usually 
happens that such a factor ring has divisors of zero, or even nilpotent elements. 
Thus it is also natural to consider arbitrary commutative rings, and to lay the 
foundations of algebraic geometry over arbitrary commutative rings as did Groth- 
endieck. We give some basic definitions for this purpose in §5. Whereas the 
present chapter gives the flavor of algebraic geometry dealing with specific 
polynomial ideals, the next chapter gives the flavor of geometry developing from 
commutative algebra, and its systematic application to the more general cases 
just mentioned. 


377 



378 ALGEBRAIC SPACES 


IX, §1 

The present chapter and the next will also serve the purpose of giving the 
reader an introduction to books on algebraic geometry, notably Hartshorne’s 
systematic basic account. For instance, I have included those results which are 
needed for Hartshorne’s Chapter I and II. 


§1. HILBERT'S NULLSTELLENSATZ 

The Nullstellensatz has to do with a special case of the extension theorem 
for homomorphisms, applied to finitely generated rings over fields. 

Theorem 1.1. Let k be a field , and let /c[x] = k\_x u ..., xJ be a finitely 
generated ring over k. Let (p: k -* L be an embedding of k into an alge- 
braically closed field L. Then there exists an extension of (p to a homo- 
morphism of /c[x] into L. 

Proof. Let 3JÎ be a maximal ideal of /c[x]. Let o be the canonical homo- 
morphism o : /c[x] -► /c[x]/9W. Then ok[ox Ì9 ..., ox „] is a fìeld, and is in fact 
an extension field of ok. If we can prove our theorem when the fìnitely generated 
ring is in fact a field, then we apply (p © o~ 1 on ok and extend this to a homo- 
morphism of ok[ox u ..., oxf\ into L to get what we want. 

Without loss of generality, we therefore assume that /c[x] is a fìeld. If it is 
algebraic over /c, we are done (by the known result for algebraic extensions). 
Otherwise, let t x ,...,t r be a transcendence basis, r ^ 1. Without loss of 
generality, we may assume that (p is the identity on /c. Each element x l9 ..., x n 
is algebraic over k(t l9 ..., t r ). If we multiply the irreducible polynomial 
Irr(x, , /c(í), X ) by a suitable non-zero element of k[f] 9 then we get a polynomial 
all of whose coefficients lie in k[t~]. Let a x (t\ ..., a n (t) be the set of the leading 
coefficients of these polynomials, and let a(t) be their product, 


a(t) = a t (t) ■ ■ ■ a„(t). 

Since a(t) 4 0, there exist elements t\ 9 ... , t' r e k a such that a(t') 4 0, and 
hence aft') 4 0 for any i. Each x { is integral over the ring 



Consider the homomorphism 

(p:k[t u ... 9 t r ] -► k a 

such that (p is the identity on /c, and (p(tj) = t). Let p be its kernel. Then a(t) $ p. 
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Our homomorphism cp extends uniquely to the local ring k[t\ and by the 
preceding remarks, it extends to a homomorphism of 

^EOpE-^l» • • • > 

into k a , using Proposition 3.1 of Chapter VII. This proves what we wanted. 

Corollary 1.2. Let k be a ýield and k[x í9 ..., x„] a finitely generated ex- 
tension ring of k. If /c[x] is a field, then /c[x] is algebraic over k. 

Proof. All homomorphisms of a field are isomorphisms (onto the image), 
and there exists a homomorphism of /c[x] over k into the algebraic closure of k. 

Corollary 1.3. Let k[x x , ..., xj be a finitely generated entire ring over a 
field k, and let y x ,..., y m be non-zero elements of this ring. Then there exists 
a homomorphism 


i/j : /c[x] /c a 

over k such that ^(y 7 ) # 0 for all j = 1, ..., m. 

Proof. Consider the ring k[x u ..., x„, yï *,..., y~ *] and apply the 
theorem to this ring. 

Let S be a set of polynomials in the polynomial ring k[X ..., Xf\ in n 
variables. Let L be an extension field of k. By a zero of S in L one means an 
n-tuple of elements (c l9 ..., c„) in L such that 

f(c í ,...,c n ) = 0 

for all f e S. If S consists of one polynomial/ then we also say that (c) is a zero 
of/ The set of all zeros of S is called an algebraic set in L (or more accurately 
in L (n) ). Let a be the ideal generated by all elements of S. Since S c= a it is clear 
that every zero of û is also a zero of S. However, the converse obviously holds, 
namely every zero of S is also a zero of a because every element of a is of type 

gÁX)MX) + • • • + g m (X)f m (X) 

with e S and g ( e /c[A"]. Thus when considering zeros of a set S, we may 
just consider zeros of an ideal. We note parenthetically that every ideal is 
finitely generated, and so every algebraic set is the set of zeros of a finite number 
of polynomials. As another corollary of Theorem 1.1, we get: 

Theorem 1.4. Let a be an ideal in /c[A"] = k[X ì ,...,X n ]. Then either 
a = k[ A"] or a has a zero in /c a . 
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Proof. Suppose û # k[X ]. Then û is contained in some maximal ideal 
m, and /c[A"]/m is a field, which is a finitely generated extension of /c, because 
it is generated by the images of X u ... , X n mod m. By Corollary 2.2, this 
field is algebraic over /c, and can therefore be embedded in the algebraic closure 
/c a . The homomorphism on k[X] obtained by the composition of the canonical 
map mod m, followed by this embedded gives the desired zero of a, and con- 
cludes the proof of the theorem. 

In §3 we shall consider conditions on a family of polynomials to have a 
common zero. Theorem 1.4 implies that if they have a common zero in some 
field, then they have a common zero in the algebraic closure of the field generated 
by their coefficients over the prime field. 

Theorem 1.5. (Hilbert’s Nullstellensatz). Let a be an ideal in k\X ]. Let 

fbeapolynomial in k[X~\ such thatf{c) = 0 for every zero ( c) = (c í9 ..., c n ) 

of a in k a . Then there exists an integer m > 0 such that f m e a. 

Proof. We may assume that / ^ 0. We use the Rabinowitsch trick of 
introducing a new variable T, and of considering the ideal a' generated by 
a and 1 — Yf in /c[ X, Y\. By Theorem 1.4, and the current assumption, the 
ideal a' must be the whole polynomial ring k[X, T], so there exist polynomials 
gi e k[X, Y\ and h ( e a such that 


1 - 0oO ~ Xf) + 01^1 + * * * + 9rK- 

We substitute / _1 for Y and multiply by an appropriate power / m of / to 
clear denominators on the right-hand side. This concludes the proof. 

For questions involving how effective the Nullstellensatz can be made, see 
the following references also related to the discussion of elimination theory 
discussed later in this chapter. 
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§2. ALGEBRAIC SETS, SPACES AND VARIETIES 

We shall make some very elementary remarks on algebraic sets. Let k be a 
field, and let A be an algebraic set of zeros in some fixed algebraically closed 
extension field of k. The set of all polynomials / e k[X í9 ..., Xf\ such that 
f(x) = 0 for all (x) e A is obviously an ideal û in /c[A"], and is determined by 
A. We shall call it the ideal belonging to /1, or say that it is associated with A. 
If A is the set of zeros of a set S of polynomials, then S c= a, but û may be bigger 
than S. On the other hand, we observe that A is also the set of zeros of û. 

Let A, B be algebraic sets, and û, b their associated ideals. Then it is clear 
that A c= B if and only if û => b. Hence A = B if and only if û = b. This has an 
important consequence. Since the polynomial ring /c[AJ is Noetherian, it 
follows that algebraic sets satisfy the dual property, namely every descending 
sequence of algebraic sets 


y4j A 2 => * * ‘ 


must be such that A m = A m+Í = • • • for some integer m , i.e. all A v are equal for 
v ^ m. Furthermore, dually to another property characterizing the Noetherian 
condition, we conclude that every non-empty set of algebraic sets contains a 
minimal element. 

Theorem 2.1. The finite union and the finite intersection of algebraic sets 
are algebraic sets. If A,B are the algebraic sets of zeros of ideals û, b, respec- 
tively , then A u B is the set of zeros of a nb and A n B is the set of zeros of 
(o, b). 

Proof. We first consider A u B. Let (x) e A kj B. Then (x) is a zero 
of û n b. Conversely, let (x) be a zero of û n b, and suppose (x) £ A. There 
exists a polynomial f e a such that /(x) / 0. But ab c= a n b and hence 
(fg)(x) = 0 for all g eb, whence g(x) = 0 for all g eb. Hence (x) lies in B , and 
A u B is an algebraic set of zeros of a n b. 

To prove that A n B is an algebraic set, let (x) e A n B. Then (x) is a zero 
of (a, b). Conversely, let (x) be a zero of (a, b). Then obviously (x) e A n B , as 
desired. This proves our theorem. 

An algebraic set V is called /c-irreducible if it cannot be expressed as a union 
V = A u B of algebraic sets A, B with A , B distinct from V. We also say ir- 
reducible instead of A:-irreducible. 

Theorem 2.2. Let A be an algebraic set. 

(i) Then A can be expressed as a finite union of irreducible algebraic sets 
A = Vj U ... U V r . 

(ii) Ifthere is no inclusion relation among the V it i.e. ifV^ Vj for i 4=- j, then 
the representation is unique. 
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(iii) Let W, V ìf , V r be irreducible algebraic sets such that 

W C V x U . . . U V r . 

Then W C V- for some i. 

Proof. We first show existence. Suppose the set of algebraic sets which 
cannot be represented as a finite union of irreducible ones is not empty. Let 

V be a minimal element in its. Then V cannot be irreducible, and we can write 

V — Akj B where A, B are algebraic sets, but A # V and B # V. Since each 
one of A, B is strictly smaller than V, we can express A , B as finite unions of 
irreducible algebraic sets, and thus get an expression for V, contradiction. 

The uniqueness will follow from (iii), which we prove next. Let W be con- 
tained in the union Vj U ... U V r . Then 

w = (w n v,) u ... u (w n v r ). 

Since each W D V t is an algebraic set, by the irreducibility of W we must have 
W = W fl V- for some i. Hence W C V t for some i, thus proving (iii). 

Now to prove (ii), apply (iii) to each Wj. Then for each j there is some i such 
that Wj C V t . Similarly for each i there exists v such that V { C W v . Since there 
is no inclusion relation among the Wf s, we must have Wj = V- = W v . This proves 
that each Wj appears among the V^’s and each V t appears among the Wf s, and 
proves the uniqueness of the representation. It also concludes the proof of Theo- 
rem 2.2. 

Theorem 2.3 An algebraic set is irreducible ifand only ifits associated ideal 
is prime. 

Proof. Let V be irreducible and let p be its associated ideal. If p is not 
prime, we can find two polynomials /, g e k[X] such that/^p , g £ p, but 
fg e p. Let a = (p,/) and b = (p, g). Let A be the algebraic set of zeros of û, 
and B the algebraic set of zeros of b. Then A C V, A =f= V and B C V, B i= V. 
Furthermore A U B = V. Indeed, A U B C V trivially. Conversely, let (jc) e V. 
Then (fg)(x) = 0 implies/(x) or g(x) = 0. Hence (jc) e A or (jc) e B, proving 

V = A U B, and V is not irreducible. Conversely, let V be the algebraic set 
of zeros of a prime ideal p. Suppose V = A U B with A V and B =/= V. 
Let û, b be the ideals associated with A and B respectively. There exist poly- 
nomials /e û,/^ p and g e b, g £ p. But fg vanishes on A U B and hence lies 
in p, contradiction which proves the theorem. 

Warning. Given a field k and a prime ideal p in k[X\, it may be that the 
ideal generated by p in k*[X] is not prime, and the algebraic set defined over k a 
by pk*[X] has more than one component, and so is not irreducible. Hence the 
prefix referring to k is really necessary. 

It is also useful to extend the terminology of algebraic sets as follows. Given 
an ideal a C k[X], to each field K containing k we can associate to û the set 
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^(À') consisting of the zeros of û in K. Thus 3£ a is an association 

3£ a : K 2£ a (ff) C & n \ 

We shall speak of 3? 0 itself as an algebraic space, so that 3 Í Q is not a set, but 
to each field K associates the set 3 í a (K ). Thus 3 Í Q is a functor from extensions 
K of k to sets (functorial with respect to field isomorphisms). By a k-variety we 
mean the algebraic space associated with a prime ideal p. 

The notion of associated ideal applies also to such 3Ç Q , and the associated 
ideal of 3 i Q is also rad(û). We shall omit the subscript a and write simply 3£ for 
this generalized notion of algebraic space. Of course we have 

Gf — Gf 

rad(û)* 

We say that 3E a (AT) is the set of points of 3£ a in K. By the Hilbert Nullstellensatz, 
Theorem 1.1, it follows that if K C K' are two algebraically closed fields 
containing k , then the ideals associated with ^ a {K) and 3Ê Û (A'') are equal to each 
other, and also equal to rad(a). Thus the smallest algebraically closed field k a 
containing k already determines these ideals. However, it is also useful toconsider 
larger fields which contain transcendental elements, as we shall see. 

As another example, consider the polynomial ring k[X ì9 . . . , X n ] = k[X]. 
Let \ n denote the algebraic space associated with the zero ideal. Then A n 
is called affìne n-space. Let ^ be a field containing k. For each n-tuple 
(Cj,. . . , c n ) E K (n) we get a homomorphism 

<p: k[X l9 ... 9 X n ]-*K 

such that <p{Xi) = c { for all i. Thus points in A n (£) correspond bijectively to 
homomorphisms of k{X) into K. 

More generally, let V be a k-variety with associated prime ideal p. Then 
k[X]/p is entire. Denote by £ z the image ofX, under the canonical homomorphism 
k[X] —» k[X]/p. We call (£) the generic point of V over k. On the other hand, 
let (jc) be a point of V in some field K. Then p vanishes on (at), so the homomor- 
phism <p : k[X] —» k[x] sending X z i-» jc z factors through k[X]/p = k[Ç], whence 
we obtain a natural homomorphism k[Ç\ —» k[x]. If this homomorphism is an 
isomorphism, then we call {x) a generic point of V in K. 

Given two points (x) e A"^) and {x') e A n {K'), we say that (jc') is a 
specialization of (jc) (over k) if the map jc z i-» x\ is induced by a homomorphism 
k[x] —» k[x']. From the definition of a generic point of a variety, it is then 
immediate that: 

A variety V is the set of specializations of its generic point, or of a generic 

point. 

In other words, V{K) is the set of specializations of (£) in K for every field K 
containing k. 

Let us look at the converse construction of algebraic sets. Let (x) = 
(jc^, . . . , x n ) be an n-tuple with coordinates jc z e K for some extension field 
K of k. Let p be the ideal in k[X] consisting of all polynomials f{X) such that 
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f(x ) = 0. We call p the ideal vanishing on (jc). Then p is prime, because if 
fg e p so f(x)g(x) = 0, then feporgep since K has no divisors of 0. Hence 
3£ p is a fc-variety V, and (jt) is a generic point of V over k because k[X]/p~ k[x]. 

For future use, we state the next result for the polynomial ring over a factorial 
ring rather than over a field. 

Theorem 2.4. LetR be a factorial ring, and letW l ,...,W m bem independent 
variables over its quotient field k. Let k(w x , ..., w m ) be an extension of tran- 
scendence degree m — 1. Then the ideal in R[W] vanishing on (vv) is principal. 

Proof By hypothesis there is some polynomial P(W) e /?[W] of degree 
^ 1 vanishing on (w), and after taking an irreducible factor we may assume 
that this polynomial is irreducible, and so is a prime element in the factorial ring 
R[W]. Let G(W) e R[W] vanish on (vv). To prove that P divides G, after selecting 
some irreducible factor of G vanishing on (vv) if necessary, we may assume 
without loss of generality that G is a prime element in R[W]. One of the variables 
Wj occurs in P(W ), say W m , so that vv m is algebraic over k(w u . . ., w m _,). Then 
(w 1? . . ., w m _ t ) are algebraically independent, and hence W m also occurs in 
G. Furthermore, P(w lf ..., w m _ 1? W m ) is irreducible as a polynomial in 
k(w x ,..., by the Gauss lemma as in Chapter IV, Theorem 2.3. 

Hence there exists a polynomial H(W m ) e k(w Ì9 ... , w m _j)[W m ] such that 

G(W) = H(WJP(W). 

Let R' = R[w ì9 ..., w m _J. Then P, G have content 1 as polynomials in 
R f [W m ]. By Chapter IV Corollary 2.2 we conclude that H e R'[WJ /?[W], 
which proves Theorem 2.4. 

Next we consider homogeneous ideals and projective space. A polynomial 
f(X) e k[X] can be written as a linear combination 

f(X) = 2 C {V) M W (X) 

with monomials M {v) (X) = X\ x • • • Xf n n and c (v) e k. We denote the degree of 
M (v) by 

| v\ = deg M (v) = 2 v r 

If in this expression for / the degrees of the monomials XS v) are all the same 
(whenever the coefficient c (v) is 0), then we say that/is a form, or also that 
/ is a homogeneous (of that degree). An arbitrary polynomial f(X) in K[X] can 
also be written 

f(X) = 2f w (X), 

where each f (d) is a form of degree d (which may be 0). We call f {d) the 
homogeneous part of /of degree d. 

An ideal û of k[X] is called homogeneous if whenever f e a then each 
homogeneous part/^ also lies in a. 
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Proposition 2.5. An ideal û is homogeneous if and only if û has a set of 
generators over k[X] consisting offorms. 

Proof Suppose a is homogeneous and that / r are generators. fiy 

hypothesis, for each integer d ^ 0 the homogeneous components / (í/) also lie in 
û, and the set of such ff^ (for all i, d) form a set of homogeneous generators. 
Conversely, let /be a homogeneous element in a and let g e K[X] be arbitrary. 
For each d , g (d) f lies in a, and g (d) f is homogeneous, so all the homogeneous 
components of gf also lie in a. Applying this remark to the case when/ ranges 
over a set of homogeneous generators for a shows that a is homogeneous, and 
concludes the proof of the proposition. 

An algebraic space is called homogeneous if for every point (jc) e 2Ê and 
t transcencental over k(x), the point (tx) also lies in . If t, u are transcendental 
over k(x), then there is an isomorphism 

k[x , t] ■—> k[x , u] 

which sends t on u and restricts to the identity on k[x], so to verify the above 
condition, it suffices to verify it for some transcendental t over k(x). 

Proposition 2.6. An algebraic space is homogeneous if and only if its 
associated ideal a is homogeneous. 

Proof. Suppose is homogeneous. Let f(X) e k[X ] vanish on 2E . For each 
(x) e 3 í and t transcendental over k(x) we have 

0 = f(x) = f(tx) = 2 t d f (d) (x). 

d 

Therefore f (d) (x) = 0 for all d, ^hence/^ e a for all d. Hence a is homogeneous. 
Conversely, suppose a homogeneous. By the Hilbert Nullstellensatz, we know 
that 2£ consists of the zeros of a, and hence consists of the zeros of a set of 
homogeneous generators for a. But if/is one of those homogeneous generators 
of degree d, and (x) is a point of , then for t transcendental over k(x) we have 

0 =f(x) = t d f(x) =f(tx), 

so (tx) is also a zero of a. Hence 2£ is homogeneous, thus proving the proposition. 

Proposition 2.7. Let be a homogeneous algebraic space. Then each irre- 
ducible component V of is also homogeneous. 

Proof. Let V = V x ,..., V r be the irreducible components of 3£, without 
inclusion relation. By Remark 3.3 we know that V x (£ V 2 U . . . U V r , so there 
is apoint (jc) e Vj such that (jt) ^ V, for i = 2,... , r. By hypothesis, for t transcen- 
dental over k(x) it follows that (tx) e so (tx) e V ( for some i. Specializing to 
t = 1, we conclude that (jt) e V i9 so i = 1, which proves that Vi is homoge- 
neous, as was to be shown. 

Let V be a variety defined over k by a prime ideal p in k[X]. Let (jt) be a 
generic point of V over k. We say that (jt) is homogeneous (over k) if for t 
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transcendental over the point ( tx) is also a point of V, or in other words, 
(tx) is a specialization of ( x). If this is the case, then we have an isomorphism 

k[x ì9 .. •, x n ] « k[tx ì9 ..., tx n ] 9 

which is the identity on k and sends jq on tx r It then follows from the preceding 
propositions that the following conditions are equivalent for a variety V over k: 

V is homogeneous. 

The prime ideal ofV in k[X] is homogeneous. 

A generic point of V over k is homogeneous. 

A homogeneous ideal always has a zero, namely the origin (0), which will 
be called the trivial zero. We shall want to know when a homogeneous algebraic 
set has a non-trivial zero (in some algebraically closed field). For this we introduce 
the terminology of projective space as follows. Let (jc) be some point in A n and 
À an element of some field containing k(x). Then we denote by (àjc) the point 
(àjcj, . . . , àjc„). Twopoints (jc), (y) e A n (K) for some field K are called equivalent 
if not all their coordinates are 0, and there exists some element À e K, à 0, 
such that (àjc) = (y). The equivalence classes of such points in A n (K) are called 
the points of projective space in K. We denote this projective space by P 71 " 1 , 
and the set of points of projective space in K by P /I_ 1 (AT). We define an algebraic 
space in projective space to be the non-trivial zeros of a homogeneous ideal, 
with two zeros identified if they differ by a common non-zero factor. 

Algebraic spaces over rings 

As we shall see in the next section, it is not sufficient to look only at ideals 
in k[X] for some field k. Sometimes, even often, one wants todeal withpolynomial 
equations over the integers Z, for several reasons. In the example of the next 
sections, we shall find universal conditions over Z on the coefficients of a system 
of forms so that these forms have a non-trivial common zero. Furthermore, in 
number theory—diophantine questions—one wants to consider systems of equa- 
tions with integer coefficients, and to determine solutions of these equations in 
the integers or in the rational numbers, or solutions obtained by reducing mod 
p for a prime p. Thus one is led to extend the notions of algebraic space and 
variety as follows. Even though the applications of the next section will be over 
Z, we shall now give general definitions over an arbitrary commutative ring R. 

Let f(X) e R[X] = AtXj,..., X n ] be a polynomial with coefficients in R. 
Let R A be an A-algebra, by which for the rest of this chapter we mean a 
homomorphism of commutative rings. We obtain a corresponding homomorphism 

R[X] A[X] 

on the polynomial rings, denoted by/ f A whereby the coefficients of f A are 
the images of the coefficients of /under the homomorphism R A. By a zero 
of/in A we mean a zero of f A in A. Similarly, let S be a set of polynomials in 
R[X]. By a zero of S in A we mean a common zero in A of all polynomials 
/ e S. Let a be the ideal generated by S in A[X]. Then a zero of S in A is also 
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a zero of û in A. We denote the set of zeros of S in A by 2£s(A), so that we have 

% S (A) = 3E a (A). 

We call 2 E q (ì 4) an algebraic set over R. Thus we have an association 

Z a (A) 

which to each R -algebra associates the set of zeros of û in that algebra. We note 
that /?-algebras form a category, whereby a morphism is a ring homomorphism 
(p:A-*A' making the following diagram commutative: 


R 



A 

<P 



A' 


Then it is immediately verified that 2£ fl is a functor from the category of R- 
algebras to the category of sets. Again we call 2£ a an algebraic space over R. 

If R is Noetherian, then R[X ] is also Noetherian (Chapter IV, Theorem 4.1), 
and so if a is an ideal, then there is always some finite set of polynomials S 
generating the ideal, so = 2 Í Q . 

The notion of radical of a is again defined as the set of polynomials 
h e /?[X] such that h N Ea for some positive integer N. Then the following state- 
ment is immediate: 

Suppose that R is entire. Then for every R-algebra R K with a field K, xve 

have 

2 î a (K) = 2E rad(û )W. 

We can define affìne space A" over R. Its points consist of all n-tuples 
(jtj,..., x n ) = (x) with Xj in some /?-algebra A. Thus A n is again an association 

A i—► A n (A) 

from /?-algebras to sets of points. Such points are in bijection with 
homormorphisms 

R[X] A 

from the polynomial ring over R into A. In the next section we shall limit ourselves 
to the case when A = K is a field, and we shall consider only the functor 
K i-» A n (K) for fields K. Furthermore, we shall deal especially with the case 
when R = Z, so Z has a unique homomorphism into a field K. Thus a field K 
can always be viewed as a Z-algebra. 

Suppose finally that R is entire (for simplicity). We can also consider projective 
space over R. Let a be an ideal in /?[X]. We define a to be homogeneous just as 
before. Then a homogeneous ideal in R[X] can be viewed as defining an algebraic 
subset in projective space P n (K) for each field K (as an /?-algebra). If R = Z, 
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then a defines an algebraic subset in P' I (Â r ) for every field K. Similarly, one can 
define the notion of a homogeneous algebraic space 3£ over R, and over the 
integers Z a fortiori. Propositions 2.6 and 2.7 and their proofs are also valid in 
this more general case, viewing 3£ = 3£ Q as a functor from fields K to sets P ,I (À r ). 

If a is a prime ideal p, then we call 3£ p an A-variety V. If R is Noetherian, 
so R[X] is Noetherian, it follows as before that an algebraic space 3£ over R is 
a finite union of A-varieties without inclusion relations. We shall carry this out 
in §5, in the very general context of commutative rings. Just as we did over a 
field, we may form the factor ring Z[X]/p and the image (jc) of (X) in this factor 
ring is called a generic point of V. 


§3. PROJECTIONS AND ELIMINATION 

Let (W) = (W u ... , W m ) and (X) = (X Ì9 ... , X n ) be two sets of independent 
variables. Then ideals in k[W, X] define algebraic spaces in the product space 
A m+n . Let a be an ideal in k[W, X]. Let a x = û fl k[W]. Let be the algebraic 
space of zeros of a and let 2t x be the algebraic space of zeros of a x . We have 
the projection 

pr \% m+n -^% m or pr: A m+n —> A m 

which maps a point (vv, jc) to its first set of coordinates (vv). It is clear that 
pr 3E C 3£,. In general it is not true that pr3 1= 3 t x . For example, the ideal pgen- 
erated by the single polynomial W\ - W 2 X x = 0 is prime. Its intersection with 
k[W x , W 2 ] is the zero ideal. But it is not true that every point in the affine 
(W x , W 2 )-space is the projection of a point in the variety 3£ p . For instance, the 
point (1, 0) is not the projection of any zero of p. One says in such a case that 
the projection is incomplete. We shall now consider a situation when such a 
phenomenon does not occur. 

In the first place, let p be a prime ideal in k[W, X] and let V be its variety 
of zeros. Let (vv, jc) be a generic point of V. Let p x = p n k[W]. Then (vv) is a 
generic point of the variety V x which is the algebraic space zeros of p x . This is 
immediate from the canonical injective homomorphism 

k[W]/p x -> k[W, X]/p. 

Thus the generic point (vv) of is the projection of the generic point (vv, jc) of 
V. The question is whether a special point (vv') of Vj is the projection of a point 
of V. 

In the subsequent applications, we shall consider ideals which are homo- 
geneous only in theX-variables, and similarly algebraic subsets which are homo- 
geneous in the second set of variables in A n . 
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An ideal û in k[W , X] which is homogeneous in (X) defines an algebraic space 
in A m x P" -1 . If V is an irreducible component of the algebraic set defìned by 
û, then we may view V as a subvariety of A m x P n_1 . Let pbe the prime ideal 
associated with V . Then p is homogeneous in (X). Let p^ = p n k[W]. We shall 
see that the situation of an incomplete projection mentioned previously is elim- 
inated when we deal with projective space. 

We can also consider the product A m x P w , defined by the zero ideal over 
Z. For each field K, the set of points of A m x P n in K is A m (K) x P n (K). An 
ideal û in Z [W, X], homogeneous in (X), defines an algebraic space 2£ = 2£ Q in 
A m x P n . We may form its projection 2 t x on the first factor. This applies in 
particular when û is a prime ideal p, in which case we call 2£ an arithmetic 
subvariety of A m x P w . Its projection Vj is an arithmetic subvariety of A m , 
associated with the prime ideal p x = pfl Z[W]. 

Theorem 3.1. Let (W) = (W x ,..., W m ) and (X) = (X,,..., X n ) be indepen- 
dent families of variables. Let p be a prime ideal in k[W, X] (resp. Z[W, X]) 
and assume p is homogeneous in (X). Let V be the corresponding irreducible 
algebraic space in A m x P"" 1 . Let p^ = p D k[W] (resp. p D Z[W]), and let 
V x be the projection of V on the fìrst factor. Then Vj is the algebraic space 
of zeros of p x in A m . 

Proof. Let V have generic point (vv, x). We have to prove that every zero 
(vv') of p x in a field is the projection of some zero (vv', x') of p such that not all 
the coordinates of (x') are equal to 0. By assumption, not all the coordinates of 
(jc) are equal to 0, since we viewed V as a subset of A m x P n_1 . For definiteness, 
say we are dealing with the case of a field k. By Chapter VII, Proposition 3.3, 
the homomorphism k[w ] —» k[w'] can be extended to a place <p of A:(vv, x). 
By Proposition 3.4 of Chapter VII, there is some coordinate Xj such that 
(p(Xi/xj) 00 for all i = 1,...,«. We let x[ = <p(Xi/xj) for all i to conclude the 
proof. The proof is similar when dealing with algebraic spaces over Z, replacing 
k by Z. 

Remarks. Given the point (vv') e A m , the point (vv', x') in A m x P' 1-1 may 
of course not lie in k(w'). The coordinates (jt') could even be transcendental 
over k(x'). By any one of the forms of the Hilbert Nullstellensatz, say Corollary 
1.3 of Theorem 1.1, we do know that (jc') could be found algebraic over k(w'), 
however. In light of the various versions of the Nullstellensatz, if a set of forms 
has a non-trivial common zero in some field, then it has a non-trivial common 
zero in the algebraic closure of the field generated by the coefficients of the 
forms over the prime field. In a theorem such as Theorem 1.2 below, the conditions 
on the coefficients for the forms to have a non-trivial common zero (or a zero 
in projective space) are therefore also conditions for the forms to have such a 
zero in that algebraic closure. 

We shall apply Theorem 3.1 to show that gi ven a finite family of homogeneous 
polynomials, the property that they have a non-trivial common zero in some 
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algebraically closed íîeld can be expressed in terms of a finite number of universal 
polynomial equations in their coefficients. We make this more precise as follows. 

Consider a finite set of forms (/) = (f Ì9 ..., f r ). Let d ì9 . .., d r be their 
degrees. We assume d { ^ 1 for i = 1,..., r. Each/• can be written 

(1) /i = 2w íiW W w CX) 

where M {V) (X) is a monomial in (X) of degree d h and w iM is a coefficient. We 
shall say that (/) has a non-trivial zero (x) if (x) ± (0) and f(x) = 0 for all i. 
We let (vv) = (w)ybe the point obtained by arranging the coefficients w i (v) of 
the forms in some definite order, and we consider this point as a point in some 
affine space A m , where m is the number of such coefficients. This integer m is 
determined by the given degrees d ì9 ..., d r . In other words, given such degrees, 
the set of all forms (/) = (f ì9 ... , f r ) with these degrees is in bijection with 
the points of A m . 

Theorem 3.2. (Fundamental theorem of elimination theory.) Given 
degrees d ìt ..., d r , îhe set of all forms (/,..., f r ) in n variables having a 
non-trivial common zero is an algebraic subspace of A m over Z. 

Proof Let (W) = (W iM ) be a family of variables independent of (X). Let 
(F) = (F ì9 . .., F r ) be the family of polynomials in Z[W, X ] given by 

(2) F'(W, X) = 2 W iÁV) M (v) (X) 

where M^ V) (X) ranges over all monomials in (X) of degree d i9 so (W) = (W) F . 
We call F ì9 ..., F r generic forms. Let 

û = ideal in Z[W, X] generated by F ì9 .. ., F r . 

Then a is homogeneous in (X). Thus we are in the situation of Theorem 3.1, 
with a defining an algebraic space Ot in A m x P" -1 . Note that (vv) is a specialization 
of (W), or, as we also say, (/) is a specialization of (F). As in Theorem 3.1, 
let C2.| be the projection of Ct onthefirstfactor. Thendirectly fromthedefinitions, 
(/) has a non-trivial zero if and only if (vv)y lies in d ì9 so Theorem 3.2 is a 
special case of Theorem 3.1. 

Corollary 3.3. Let (/) be afamily of n forms in ri variables, and assume 
that (w)j is a generic point of A m , i.e. that the coejficients of these forms are 
algebraically independent. Then (/) does not have a non-trivial zero. 

Proof. There exists a specialization of (/) which has only the trivial zero, 
namely/i' = Xf ',... ,f' n = X d n n . 

Next we follow van der Waerden in showing that (3 and hence Ct x are irreducible. 

Theorem 3.4. The algebraic space Ct j offorms having a non-trivial common 
zero in Theorem 3.2 is actually a Z-variety, i.e. it is irreducible. The prime ideal 
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p in Z[W, X] associated with G consists of all polynomials G(W, X ) e Z[W, X] 
í/iaí for some index j there is an integer s ^ 0 satisfying 

(*)j XjG(W, X) = 0 mod (F x ,..., F r ); that is , XjG(W, X) e û. 

If relation (*) holds for one index j, then it holds for every j = 1, . .., n. (Of 
course, the integer s depends on j .) 

Proof Weconstructagenericpointof GL Weselectany oneofthe variables, 
say X q , and rewrite the forms F t as follows: 

Fi(W , X) = Ff + Z f -Xŷ 

where Ff is the sum of all monomials except the monomial containing Xỳ. 
The coefficients (W) are thereby split into two families, which we denote by (T) 
and (Z), where (Z) = (Z l9 ..., Z r ) are the coefficients of (X d f ,. . . , X q r )in 
(F Ì9 ... 9 F r ), and (T) is the remaining family of coefficients of Ff,.. . , Ff. 
We have (W) = (Y, Z), and we may write the polynomials F t in the form 

Fì(W 9 X) = Fì(Y 9 Z, X) = Ff(Y, X) + Z'Xỳ. 

Corresponding to the variables (Y, X) we choose quantities (y, x) algebraically 
independent over Z. We let 

(3) z, = -Ff(>, x)/x d q ‘ = —Ff(y, x/x q ). 

We shall prove that (y, z, x) is a generic point of G. 

From our construction, it is immediately clear that Ffy, z, x) = 0 for all /, 
and consequently if G(W, X) e Z[W, X] satisfies (*), then G(y, z, x) = 0. 

Conversely, let G(Y, Z, X) e Z[Y, Z, X] = Z[W 9 X] satisfy G(y f z, x) = 0. 
From Taylor’s formula in several variables we obtain 

G(Y, Z, X) = G(Y,..., —Ff/X q ' + Z, + Ff/X d q >, . X) 

= G(Y, —Ff/X d ‘, X) + 2 (Z, + Ff/X d r t H„(Y, Z, X), 

where the sum is taken over terms having one factor (Z t + Ff/X di ) to some 
power fii > 0, and some factor H^. in Z[Y, Z, X]. From the way (y, z, x) was 
constructed, and the fact that G(y, z, x) = 0, we see that the first term vanishes, 
and hence 

G(Y, Z, X) = 2 (Z/ + Ff/Xỳr-H^Y, Z, X). 

Clearing denominators of X q , for some integer 5 we get 
X s q G(Y, Z, X) = 0 mod (F h . . . , F r ), 

or in other words, (*) q is satisfied. This concludes the proof of the theorem. 

Remark. Of course the same statement and proof as in Theorem 3.4 
holds with Z replaced by a field k. In that case, we denote by the ideal in 
k[W, X] generated by the generic forms, and similarly by the associated prime 
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ideal. Then 

û*,i = a k H k[W] and p M = p k D *[W]. 

The ideal p in Theorem 3.4 will be called the prime associated with the 
ideal of generic forms. The intersection p^ = p fl Z[W] will be called the prime 
elimination ideal of these forms. If C i denotes as before the zeros of p(or of 
û), and O x is its projection on the first factor, then p^ is the prime associated 
with ûp The same terminology will be used if instead of Z we work over a 
field k. (Note : homogeneous elements of pj have been called inertia forms in 
the classical literature, following Hurwitz. I am avoiding this terminology be- 
cause the word “inertia” is now used in a standard way for inertia groups as in 
Chapter VII, §2.) The variety of zeros of p^ will be called the resultant vari- 
ety. It is determined by the given degrees d Ì9 ..., d n9 so we could denote it 
by O x (d x ,. . . , d n ). 

Exercise. Show that if p is the prime associated with the ideal of generic 
forms, then p fl Z = (0) is the zero ideal. 

Theorem 3.5. Assume r = n, so we deal with n forms in n variables. Then 
P^ is principal, generated by a single polynomial , so Gj is what one calls a 
hypersurface. If(w) is a generic point of O x over a field k , then the transcen- 
dence degree of k(w) over k is m — 1. 

Proof. We prove the second statement first, and use the same notation as in 
the proof of Theorem 3.4. Let Uj = Xj/x n . Then u n = 1 and (y), («,,. . . , u n - x ) 
are algebraically independent. By (3), we have z t = —Ff(y , u ), so 

k(w) = k(y , z) C k(y, u ), 

and so the transcendence degree of k(w) over k is ^ m — 1. We claim that this 
transcendence degree is m — 1. It will suffice to prove that u x ,.. . 9 u n - x are 
algebraic over k(w) = k(y , z). Suppose this is not the case. Then there exists a 
place <p of k(w , w), which is the identity on k(w) and maps some Uj on oo. Select 
an index q such that (p(uju q ) is finite for all i = 1,. . . , n — 1. Let = uju q 
and v\ = <p(uju q ). Denote by Y iq the coefficient of X d f in F t and let T* denote 
the variables (T) from which Y Xq ,. .., Y nq are deleted. By (3) we have for 
i = 1, . . . , n: 

0 = y,<i u q + 2 Ì + Ff*(y* u ) 

= y,q + 2 i/Uq' + Ff*(y* u/u q ). 

Applying the place yields 

0 = Yiq + Ff*(y* v'). 

In particular, y iq e k(y*, v') for each i = 1,..., n. But the transcendence degree 
of k(v') over k is at most n — 1, while the elements ( y Xq ,. . . , y nq9 y*) are 
algebraically independent over k 9 which gives a contradiction proving the 
theorem. 
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Remark. There is a result (I learned it from [Jo 80]) which is more precise 
than Theorem 3.5. Indeed, let Gt as in Theorem 3.5 be the variety of zeros of 
p, and Ct ^ its projection. Then this projection is birational in the following sense. 
Using the notation of the proof of Theorem 3.5, the result is not only that k(w) 
has transcendence degree m — 1 over k , but actually we have 

Q (y, z) = Q(w) = Q(y, u). 

Proof. Let p^ = ( R ), so R is the resultant, generating the principal ideal 
Pj. We shall need the following lemma. 


Lemma 3.6. There is a positive integer s with the following properties. Fix 
an index i with 1 ^ i ^ n — 1. For each pair of n-tuples of integers ^ 0 

(a) = («j,.. . , a n ) and (p) = (ft,..., P n ) 

with | a | = | /31 = d if we have 


M ^ X) ^ ~ M ^ {X) wz) 


0 mod (F„ ..., F„). 


To see this, we use the fact from Theorem 3.4 that for some 5, 

X s n R(W) = Q X F X + • + Q n F n with Qj e Z [W, X ]. 

Differentiating with respect to W i ( ^ we get 

x s „ - QiM (P> (X) mod (F,.F„), 

and similarly 

X n ■ Q'M^iX) mod (F„ ..., F n ). 

ayy iXot) 

We multiply the first congruence by M {a) (X) and the second by M (j3) (X), and we 
subtract to get our lemma. 

From the above we conclude that 


vanishes on û, i.e. on the point (w, «), after we put X n = 1. Then we select 
M (a) (X) = Xf and M (P) (X) = Xf~ x X n for i= 1,. .., n - 1, 
and we see that we have the rational expression 




dR/dWj,(p) 

dR/dW Ua) 


, for i = 1,.. . , n - 1, 

(W)=(w) 


thus showing that Q(«) C Q(w), as asserted. 



394 ALGEBRAIC SPACES 


IX, §3 


We note that the argument also works over the prime field of characteristic 
p. The only additional remark to be made is that there is some partial derivative 
dR/dW iÁot) which does not vanish on (vv). This is a minor technical matter, which 
we leave to the reader. 

The above argument is taken from [Jo 80], Proposition 3.3.1. Jouanolou links 
old-time results as in Macaulay [Ma 16] with more recent techniques of com- 
mutative algebra, including the Koszul complex (which will be discussed in 
Chapter XXI). See also his monographs [Jo 90], [Jo 91]. 

Still following van der Waerden, we shall now give a fairly explicit deter- 
mination of the polynomial generating the ideal in Theorem 3.5. We deal with 
the generic forms FfW, X) (i = 1,..., n). According to Theorem 3.5, the ideal 
Pï is generated by a single element. Because the units in Z [W] consist only of 
±1, it follows that this element is well defined up to a sign. Let 

R(W) = R(F Ì9 ...,F n ) 


be one choice of this element. Later we shall see how to pick in a canonical way 
one of these two possible choices. We shall prove various properties of this 
element, which will be called the resultant of F Ì9 ..., F n . 

For each i = 1,..., n we let D t be the product of the degrees with d t omitted; 
that is, 

A = d x ■ • • di • • • d n . 

We let d be the positive integer such that d - 1 = 2 _ !)• 


Lemma 3.7. Given one ofthe indices, say n, there is an element R n (W) lying 
in p j , satisfying the following properties. 

(a) For each i, R n (W)Xf = 0 mod (F u ..., F n ) in Z[W 9 X]. 

(b) For each i, R n (W) is homogeneous in the set of variables (W iM ) 9 and is of 
degree D n in (W nM ), i.e. in the coefftcient of F n . 

(c) As a polynomial in Z [W] 9 /?„(W) has content 1, i.e. is primitive. 

Proof. The polynomial R n (W) will actually be explicitly constructed. Let 
M^X) denote the monomials of degree | cr| = d. We partition the indexing set 
S = {o-} into disjoint subsets as follows. 

Let = {o-ï} be the set of indices such that M^fX) is divisible by Xf 1 . 

Let S 2 = {(t 2 ) be the set of indices such that M^fX) is divisible by Xỳ but 
not by Xf 1 . 

Let S n = {a n } be the set of indices such that M^fX) is divisible by X d n n but 
not by Xf 1 ,..., X d n if. 
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Then S is the disjoint union of S t ,. .., S n . Write each monomial as follows: 
M aì (X) = HvPQXÌ ' so deg H„ x = d - d x 

M ai (X) = H an (X)X d „" so deg H„ n = d - d n . 

Then the number of polynomials 

H^F U . . . , H an F n (with (T { e S u ..., a n e S n ) 

is precisely equal to the number of monomials of degree d. We let R n be the 
determinant of the coefficients of these polynomials, viewed as forms in (X) with 
coefficients in Z [W]. Then R n = R n (W) e Z [W]. We claim that satisfies 

the properties of the lemma. 

First we note that if cr n e S n , then H an (X) is divisible by a power of X t at 
most d t - 1, for i = 1,..., n - 1. On the other hand, the degree of H an (X) in 
X n is determined by the condition that the total degree is d - d n . Hence S n has 
exactly D n elements. It follows at once that R n (W) is homogeneous of degree D n 
in the coefficients of F n , i.e. in (W n (u) ). From the construction it also follows 
that R n is homogeneous in each set of variables (W iM ) for each i = 1,. . ., 
n — 1. 

If we specialize the forms F t (i = 1,...,«) to Xf, then R n specializes to 1, 
and hence R n ± 0 and R n is primitive. For each cr^ we can write 

H, r F, = 2 C lT ^(W)M, r (X) y 
a-es 

where M a (X) (a e S) ranges over all monomials of degree d in (X), and C a a .(W) 
is one of the variables (W). Then by definition 

R n (W) = det (C a9ai (W) iai e SÙ , • • • , C a ^ an (W\ a ^ Sn) ) = det(C). 

^here^o-ï e S x , . . . , a n e S n indexes the columns, and cr indexes the rows. Let 
B = C be the matrix with components in Z[W, X] such that 

BC = det (C)I = R n I. 

(See Chapter XIII, Corollary 4.17.) Then for each cr, we have 
R n (W)M a (X) = 2 2 B IVl F,. 

i (Tì<eSì ’ 

Given i, we take for cr the index such that M a (X) = Xf in order to obtain the 
first relation in Lemma 3.7. By Theorem 3.4, we conclude that R n (W) e p P This 
concludes the proof of the lemma. 

Of course, we picked an index n to fix ideas. For each i one has a polynomial 
R t satisfying the analogous properties, and in particular homogeneous of degree 
D t in the variables (W iM ) which are the coefficients of the form F r 
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Theorem 3.8. Let R be the resultant ofthe n generic forms F t over Z, in n 
variables. Then R satisfies the folloxving properties. 


(a) R is the greatest common divisor in Z [W] of the polynomials R ìt ..., R n . 

(b) R is homogeneous of degree D z in the coeffcients of F r 

(c) Let F t = ... + W i(di) Xf y so W iÁdi) is the coefficient ofXf. Then R contains 
the monomial 


n 


±Ylw 

/= l 


D t 

UdiY 


Proof. The idea will be to specialize the forms F ì9 ... 9 F n to products of 
generic linear forms, where we can tell what is going on. For that we need a 
lemma of a more general property eventually to be proved. We shall use the 
following notation. If f\ 9 . .. 9 f n are forms with coefficients (vv), then we write 

R(f\> • • • >f n ) = R(w). 


Lemma 3.9. Let G, H be generic independent forms with deg(G//) = d x . 
Then R(GH , F 2 ,. .., F n ) is divisible by R(G , F 2 ,..., F n )R(H , F 2 ,. . ., F n ). 

Proof. By Theorem 3.5, there is an expression 

X s n R(F Ì9 . . ., F n ) = Q X F X + • • • + Q n F n with Q t e Z[W 9 X]. 

Let W G , W H , W F2 ,. . ., W Fn be the coefficients of G, //, F 2 ,. . . , F n respectively, 
and let (vv) be the coefficients of GH , F 2 ,..., F n . Then 

R(xv) = R(GH, F 2 , . . . , F n ), 

and we obtain 

X s n R(xv) = Ôï(w, X)GH + g 2 (w, X)F 2 + Q n (xv 9 X)F n . 

Hence R(GH 9 F 2 ,. . . , F n ) belongs to the elimination ideal of G, F 2 ,. . . , F n in 
the ring Z [W G , W H9 W Fl ,. .., W F J, and similarly with H instead of G. Since 
W H is a family of independent variables over Z [W G9 W Fl9 . . . , W F/j ], it follows 
that R(G 9 F 2 ,. . . , F„) divides R(GH 9 F 2 ,. . . , F„) in that ring, and similarly for 
R(H 9 F 2 ,. . . , F n ). But (W G ) and (W H ) are independent sets of variables, and so 
R(G 9 F 2 ,. . . , F n ) 9 R(H 9 F 2 ,... , F„) are distinct prime elements in that ring, so 
their product divides R(GH 9 F 2 ,. . ., F n ) as stated, thus proving the lemma. 

Lemma 3.9 applies to any specialized family of polynomials g 9 h 9 f Ì9 . . . , 
f n with coefficients in a field k. Observe that for a system of n linear forms in 
n variables, the resultant is simply the determinant of the coefficients. Thus if 
L x ,.. . , L n are generically independent linear forms in the variables X x ,. . . , X n9 
then their resultant R(L X ,. . ., L n ) is homogeneous of degree 1 in the coefficients 
of L z for each i. We apply Lemma 3.9 to the case of forms f ì9 . .. 9 f n ~ ì9 which 
are products of generically independent linear forms. By Lemma 3.9 we conclude 
that for this specialized family of form, their resultant has degree at least D n in 
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the coefficients of F n , so for the generic forms F ì9 .. . , F n their resultant has 
degree at least D n in the coefficients of F n . Similarly R(F Ì9 . . ., F n ) has degree 
at least D t in the coefficients of F t for each i. But R divides the n elements 
/^(IT),. . . , R n (W) constructed in Lemma 3.7. Therefore we conclude that R has 
degree exactly D t in the coefficients of F z . By Theorem 3.5, we know that R 
divides each R h Let G be the greatest common divisor of R n in Z[W]. 

Then R divides G and has the same degree in each set of variables (W i{v) ) for 
i = 1,. . . , n. Hence there exists c e Z such that G = cR. We must have 
c = ±1, because, say, R n is primitive in Z [W]. This proves (a) and (b) of the 
theorem. 

As to the third part, we specialize the forms to f t = Xf‘ 9 i = 1,.. . , n. Then 
R n specializes to 1, and since R divides R n it follows that R itself specializes to 
±1. Since all coefficients of the forms specialize to 0 except those which we 
denoted by W i (<di) , it follows that R(W) contains the monomial which is the product 
of these variables to the power D h up to the sign ±1. This proves (c), and 
concludes the proof of Theorem 3.8. 

We can now normalize the resultant by choosing the sign such that R contains 
the monomial 


n 

“ = n Ku,y 

with coefficient +1. This condition determines R uniquely, and we then denote 
R also by 

R = Res(F 1? . . . , F n ). 

Given forms f ì9 . . . ,f n with coefficients (vv) in a field K (actually any commu- 
tative ring), we can then define their resultant 


Res(/j, ...J n ) = R(w) 


with the normalized polynomial R. With this normalization, we then have a 
stronger result than Lemma 3.9. 

Theorem 3.10. Let f x = gh be a product offorms such that d tg(gh) = d x . 

Let / 2 , . . ., f n be arbitrary forms of degrees d 2 , ..., d n . Then 

Res (gh, / 2 ,..., f n ) = Res (g, / 2 ,..., /„)Res(/z, / 2 ,..., f n ). 

Proof. From the fact that the degrees have to add in a product of poly nomials, 
together with Theorem 3.8(a) and (b), we now see in Lemma 3.9 that we must 
have the precise equality in what was only a divisibility before we knew the 
precise degree of R in each set of variables. 

Theorem 3.10 is very useful in proving further properties of the determinant, 
because it allows a reduction to simple cases under factorization of polynomials. 
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For instance one has: 

Theorem 3.11. Let F x . F n be the generic forms in n variables, and let 

F ìt ... t F n be theforms obtained by substituting X n = 0, so that F ìt , F n _ x 
are the generic forms inn — 1 variables. Let n ^ 2. Then 

Res(F„ X d n ') = Res<F„ ..., F n . x ) d ". 

Proof. By Theorem 3.10 it suffices to prove the assertion when d n = 1. By 
Theorem 3.4, for each i = 1,..., n — 1 we have an expression 

(*) XfRes (F l9 ... 9 F n - Ì9 X n ) = Q X F X + ••• + Q n - x F n . x + Q n X n 

with Qj e Z [W, X] (depending on the choice of /). The left-hand side can be 
written as a polynomial in the coefficients of F ì9 ..., F n _ x with the notation 

X' t RW Fl ,.... W Fm _ t , l Xn ) = XfP(W Fl ,W Fm „) = XfP(W say; 

thus in the generic linear form in X Xt ..., X n we have specialized all the coef- 
ficients to 0 except the coefficient of X n , which we have specialized to 1. Sub- 
stitute X n = 0 in the right side of (*). By Theorem 3.4, we conclude that 
P(W^ _1) ) lies^ in the resultant ideal of F x ,..., F w _j, and therefore 
Res(F,,..., F n „ x ) divides P(VT (,2-1) ). By Theorem 3.8 we know that 
P(W (n ~_^) has the same homogeneity degree in Wp. (i = 11) 
as Res(F 1? . . . , F n _ x ). Hence there is c e Z such that 

cRqs(F x ,. . ., F n _ j) = Res(Fj,. .., F n _ x , X n ). 

One finds c = 1 by specializing F x ,... 9 F n _ x to Xf 1 ,. . . , X n n S x l respectively, 
thus concluding the proof. 

The next basic lemma is stated for the generic case, for instance in Macaulay 
[Ma 16], and is taken up again in [Jo 90], Lemma 5.6. 

Lemma 3.12. Let A be a commutative ring. Let f x ,---,f n > g x , • • • > 9 n be 
homogeneous polynomials in A[X x ,..., X n ]. Assume that 

(g x ,...,g n )C(f Xt ...,f n ) 

as ideals in A[X\. Then 

Res(/j, ...,/„) divides Res (g x ,..., g n ) in A. 


Proof. Express each g t = 2^/ with h^ homogeneous in A[X]. By spe- 
cialization, we may then assume that g { = X ^ijFj where and Fj have alge- 
braically independent coefficients over Z. By Theorem 3.4, for each i we have 
a relation 


Xj Res (g ì9 . .. , g n ) = Q x g x + • • • + Q n g n with some Q { e Z [W H , W f ] 9 
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where W H , W F denote the independent variable coefficients of the polynomials 
Hij and Fj respectively. In particular, 

(*) Xj Res (g ì9 ..., g n ) = 0 mod (F u ..., F n )Z[W H , W f , X\. 

Note that Rts(g ì9 . . . , g n ) = P(W H , W F ) e Z[W h , W f \ is a polynomial with 
integer coefficients. If ( w F ) is a generic point of the resultant variety d x over 
Z, then P(W H , w F ) = 0 by (*). Hence Res(Fj,.. . , F n ) divides P(W H , W F ), thus 
proving the lemma. 

Theorem 3.13. Let A be a commutative ring and let d x ,..., d n be integers 
^ 1 as usual. Let f be homogeneous of degree d { in A[X\ = A[X x ,..., X n \. 

Let d be an integer ^ 1, and let g t . g n be homogeneous of degree d in 

A[X\. Then 

fi ° g ~ fi(d ì * • • • > 9n^ 
is homogeneous of degree dd if and 

Res(/[ 0 g, •••,/„ o g) = Res (g { ,.... g n ) d ' "' dn Res(/,,... ,f n ) d "~' in A. 
Proof. We start with the standard relation of Theorem 3.4: 


(*) XfRes (F x ,..., F n ) * 0 mod (F x ,..., F n )Z[W F , X). 

We let Gj,. .., G n be independent generic polynomials of degree d, and let W G 
denote their independent variable coefficients. Substituting G z forX, in (*), we 
find 

G-Res(Fj,. . ., F n ) = 0 mod (F x ° G,.. ., F n ° G)Z[W F , W G , X\. 

Abbreviate Res(F 1? .. ., F n ) by R(F), and let g t = Gf/?(F). By Lemma 3.12, it 
follows that 

Res(/j o G,. . ., F n o G) divides Res (G\R(F),..., G s n R(F)) in Z[W F , W G \. 

By Theorem 3.10 and the homogeneity of Theorem 3.8(b) we find that 

Res (G\R(F),G S „R(F)) = Res(G,,.... G n ) M Res (F„ .... F n ) N 

with integers M,N ^ 0. Since Res(Gj,..., G n ) and Res ^,. . . ,F n ) are distinct 
prime elements in Z [W G , W F \ (distinct because they involve independent vari- 
ables), it follows that 

(**) Res(Fj ° G,..., F n ° G) = e Res(G 1? . .., G n ) a Res(Fj,..., F n ) b 

with integers a, b ^ 0 and e = 1 or — 1. Finally, we specialize F t to VVJXf' and 
we specialize G z to £/ z Xf, with independent variables (W x ,. . . , W n , U x ,. . . , U n ). 
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Substituting in (**), we obtain 

Res(W,£/f'Xf>. W n U d n n X d n") 

= e Res(£/,Xf,..., U n X d Y ResOV,Xf>,. .., W n X d n ») b . 
By the homogeneity of Theorem 3.8(b) we get 

Y\(WiU d ‘) dl di d " d " ' = eYlu d " ' a T\w d ' d > d " b . 

/' i i 

From this we get at once e = 1 and a, b are what they are stated to be in the 
theorem. 

Corollary 3.14. Let C = (c íy ) be a square matrix mth coefficients in A. Let 
f(X) = Fi(CX) (where CX is multiplication ofmatrices, viewing X as a column 
vector). Then 

Res(/,, ...,/„) = detfCf' Res(F,,..., F„). 

Proof. This is the case when d = 1 and g t is a linear form for each i. 

Theorem 3.15. Let f Xì ..., f n be homogeneous in A[X], and suppose 
d n ^ difor all i. Let h { be homogeneous of degree d n — d t in A[X]. Then 

n-l 

Res(/,, ...,/„_,,/„+ S hjfj) = Res(/,, ...,/„) in A. 
j= 1 

Proof. We may assume f t = F z are the generic forms, H { are forms generic 
independent from F { ,. .., F n , and A = Z [W f9 W H \, where (W F ) and (W H ) 
are the coefficients of the respective polynomials. We note that the ideals 

(F ì9 . . ., F n ) and (F Ì9 . . . , F n + 21 are equal. From Lemma 3.12 we 

j *n 

conclude that the two resultants in the statement of the theorem differ by a factor 
of 1 or — 1. We may now specialize to 0 to determine that the factor is +1, 
thus concluding the proof. 

Theorem 3.16. Let tt be a permutation of {1,..., n), and let e(i t) be its 
sign. Then 

Res(/v (I) .F OT(n) ) = e(TT) d '■■ d » Res(F,,..., F„). 

Proof. Again using Lemma 3.12 with the ideals (F Ì9 ... 9 F n ) and 
(F w(1 ),.. . , F^n)), which are equal, we conclude the desired equality up to a 
factor ±1, in Z [W F ]. We determine this sign by specializing F z - toXf , and using 
the multiplicativity of Theorem 3.10. We are then reduced to the case when 
F t = X i9 so a linear form; and we can apply Corollary 3.14 to conclude the proof. 

The next theorem was an exercise in van der Waerden’s Moderne Algebra. 
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Theorem 3.17. Let L x . L n ~ i, F be generic forms in n variables , such 

that L ìt ..., are ofdegree 1, and F has degree d = d n . Let 

A j(J = 1,..., n) 

be times the j-th minor determinant of the coefficient matrix of the 

forms (L ìt ... t L n _ x ). Then 

Res (L ì9 ..., L„_,, F) = F( Aj,..., A„). 

Proof. We first claim that for all j = 1,..., n we have the congruence 

(*) X n A, - *,A„ 3 0 mod ( L i> • • • ’ ^-i)Z[W, X], 

where as usual, (W) are the coefficients of the forms L Ì9 ... , L n _ Ì9 F. To see 
this, we consider the system of linear equations 

W n Xi + • • • + Wx,n-xXn-x = L X (W, X) - W x , n X n 


+ ••• + W n - X , n - X X n - x = L n _ x (W,X) - W n _ Un X n . 

If C = (C l ,, C n ~ l ) is a square matrix with columns C j , then a solution of 
a system of linear equations CX = C n satisfies Cramer’s rule 

Xjáti(C\ ...,C n ~ l ) = det (C\ ...,C n ,..., C n ~ l ). 

Using the fact that the determinant is linear in each column, (*) falls out. 

Then from the congruence (*) it follows that 

X d n F{ A„ ..., A„) - A d F{X x ,..., X n ) mod (L,,..., L„_,) Z[W, X], 

whence 

X d F{ A„ ..., A„) = 0 mod (L„ ..., L„_„ F). 

Hence by Theorem 3.4 and the fact that Res^,. . ., L n _ x , F) = /?(W) generates 
the elimination ideal, it follows that there exists c e Z [W] such that 

F(A,,. .., A„) = cRes(L,,..., L„_,, F). 

Since the left side is homogeneous of degree 1 in the coefficients W F and homo- 
geneous of degree d in the coefficients W L . for each i = 1,. . . , n — 1, it follows 
from Theorem 3.8 that c e Z. Specializing L z toX z and F toX n makes Aspecialize 
to 0 if j + n and specializes to 1. Hence the left side specializes to 1, and 
so does the right side, whence c = 1. This concludes the proof. 
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§4. RESULTANT SYSTEMS 

The projection argument used to prove Theorem 3.4 has the advantage of 
constructing a generic point in a very explicit way. On the other hand, no explicit, 
or even effective, formula was given to construct a system of forms defining 
We shall now reformulate a version of Theorem 3.4 over Z and we shall 
prove it using a completely different technique which constructs effectively a 
system of generators for an ideal of definition of the arithmetic variety in 
Theorem 3.2. 

Theorem 4.1. Given degrees d x . d r ^ 1, and positive integers m, n. Let 

(W) = (W iM ) be the variables as in §3, (2) viewed as algebraically independent 
elements over the integers Z. There exists an ejfectively determinable finite 
number of polynomials R p (W) e Z [W\ having the following property. Let (f) 
be as in (1), a system of forms of the given degrees with coejficients (vv) in 
some field k. Then (f) has a non-trivial common zero ifand only ifR p (w) = 0 
for all p. 

A finite family {/? p } having the property stated in Theorem 4.1 will be called 
a resultant system for the given degrees. According to van der Waerden 
(Moderne Algebra , first and second edition, §80), the following technique of 
proof using resultants goes back to Kronecker elimination, and to a paper of 
Kapferer (Úber Resultanten und Resultantensysteme, Sitzungsber. Bayer. Akad. 
Miinchen 1929, pp. 179-200). The family of polynomials {R p (W)} is called a 
resultant system, because of the way they are constructed. They form a set of 
generators for an ideal b^ such that the arithmetic variety Ct x is the set of zeros 
of bj. I don’t know how close the system constructed below is to being a set of 
generators for the prime ideal in Z[W] associated with d x . Actually we shall 
not need the whole theory of Chapter IV, §10; we need only one of the char- 
acterizing properties of resultants. 



IX, §4 


RESULTANT SYSTEMS 403 


Let p 9 q be positive integers. Let 

fv = v 0 XI + vjp-% + • • • + v p XP 2 
g w = WoX\ + w x X\~% + • • • + w q Xl 

be two generic homogeneous polynomials in Z[v , vv, X Ì9 X 2 ] = Z[v 9 w][X]. In 
Chapter IV, §10 we defined their resultant Res (f V9 g w ) in case X 2 = 1, but we 
find it now more appropriate to work with homogeneous polynomials. For our 
purposes here, we need only the fact that the resultant R(v 9 vv) is characterized 
by the following property. If we have a specialization ( a 9 b) of (v 9 vv) in a field 
K 9 and if f a ,fb have a factorization 

p 

fa = a 0 n (X, - a,X 2 ) 

Ì= 1 

q 

9 b = bo n (X, - pjX 2 ) 

7=1 

then we have the symmetric expressions in terms of the roots: 

R(a, b) = Res (f a , f b ) = agfeg ÍI (a ; - Pj) 

i.j 

= ai n gb^, i) = (~i)p% Um, i). 

ì j 

From the general theory of symmetric polynomials, it is a priori clear that 
R(v 9 vv) lies in Z[v 9 vv], and Chapter IV, §10 gives an explicit representation 

<Pv, w fv + 4>v, w g w = x?i-'R(p, w) 

where <p v w and if/ VfW e Z[v 9 vv, X]. This representation will not be needed. The 
next property will provide the basic inductive step for elimination. 

Proposition 4.2. Let f a , g b be homogeneous polynomials with coefftcients in 
afield K. Then R(a , b) = 0 if and only if the system of equations 

fa(X) = o, g b (X) = 0 

has a non-trivial zero in some extension of K (which can be tahen to be finite). 

If a 0 = 0 then a zero of g b is also a zero of f a \ and if b 0 = 0 then a zero of f a 
is also a zero of g b . If a 0 b 0 ± 0 then from the expression of the resultant as a 
product of the difference of roots (a^ - fy) the proposition follows at once. 

We shall now prove Theorem 4.1 by using resultants. We do this by induction 
on n. 
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If n = 1, the theorem is obvious. 

If n = 2, r = 1, the theorem is again obvious, taking the empty set for ( R p ). 
If n = 2, r = 2, then the theorem amounts to Proposition 4.2. 

Assume now n = 2 and r > 2, so we have a system of homogeneous equations 


o =mx) =f 2 (X) = ... =f r (X) 


with (X) = (X { , X 2 ). Let d ( be the degree of f and let d = max d h We replace 
the family {f(X)} by the family of all polynomials 

fi(X)XÌ~ d ‘ and fi(X)XỲ d ‘, i = 1,..., r. 

These two families have the same sets of non-trivial zeros, so to prove Theorem 
4.1 we may assume without loss of generality that all the polynomials f ì9 . . . , 
f r have the same degree d. 

With n = 2, consider the generic system of forms of degree d in (X): 

(4) Fi(W, r , X) = 0 with i = 1,. .., r, in two variables (X) = (X Ì9 X 2 ), 

where the coefficients of F t are W i 0 ,..., W i d so that 

( W) = (W uo ,..., W Ud9 ... 9 W r , 0 ,..., W r4 ). 

The next proposition is a special case of Theorem 4.1, but gives the first step 
of an induction showing how to get the analogue of Proposition 4.2 for such a 
larger system. Let T x ,..., T r and U x ,..., U r be independent variables over 
Z [W, X]. Let F x ,..., F r be the generic forms of §3, (2). Let 

/= F x (W,X)T x + ••• + F r (W, X)T r 
9 = F X (W, X)U x + • • • + F r (W, X)U r 

so f,ge Z [W, T, U][X]. Then f, g are polynomials in (X) with coefficients in 
Z[W, T, U]. We may form their resultant 

Res(/, g) e Z [W, T, U]. 

Thus Res(/, g) is a polynomial in the variables (T, U) with coefficients in Z [W]. 
We let (Qn{W)) be the family of coefficients of this polynomial. 

Proposition 4.3. The system {Q t ^(VT)} just constructed satisfìes the property 
ofTheorem 4.1, i.e. it is a resultant system for r forms of the same degree d. 

Proof. Suppose that there is a non-trivial solution of a special system 
FfW, X) = 0 with (vv) in some field k. Then (vv, T, U) is a common non-trivial 
zero of/, g, so Res(/, g) = 0 and therefore G^vv) = 0 for all /x. Conversely, 
suppose that Q^(w) = 0 for all fi. Let f(X) = F ( (w, X). We want to show 
that/ z (X) for i = 1,. .. , r have a common non-trivial zero in some extension of 
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k. If all /■ are 0 in k[X ì9 X 2 ] then they have a common non-trivial zero. If, say, 
/, =k 0 in k[X ], then specializing T 2 ,. .., T r to 0 and T x to 1 in the resultant 
Res(/, g ), we see that 

Res {f x ,f 2 U 2 + ••• +f r U r ) = 0 

as a polynomial in k[U 2 ,. . . , U r ]. After making a finite extension of k if neces- 
sary, we may assume that/íX) splits into linear factors. Let {aj be the roots 
of f\(X ì9 1). Then some (a i9 1) must also be a zero of f 2 U 2 + • • • + f r U r , 
which implies that (a i9 1) is a common zero of /,,..., f r since U l9 . . . , U r 
are algebraically independent over k. This proves Proposition 4.3. 

We are now ready to do the inductive step with n > 2. Again, let 

fi(X) = Fi(w, X) fov j = 1,.. ., r 

be polynomials with coefficients (vv) in some fields k. 

Remark 4.4. There exists a non-trivial zero of the system 

fi = 0 (i = 1, . .., r) 

in some extension of k if and only if there exist 

(x x ,..., x n —,) + (0,..., 0) and (x n , t) + (0, 0) 

in some extension of k such that 

fi(tx x ,..., tx n _ x , x n ) = 0 for i = ì,..., r. 

So we may now construct the system (R p ) inductively as follows. 

Let Tbea new variable, and let = (X Ì9 . . . , X,^!). Let 

g,(W, X (n -'\ S„, T) = F,(W, TX x ,..., TX„. x , X„) e Z [W, X (n ~ l U[X„, T]. 

Then g t is homogeneous in the two variables (X n9 T). By the theorem for two 
variables, there is a system of polynomials (Q^) in Z[W, X (n ' l) ] having the 
property: if(w , jt^ -1 *) is a point in a field K, then 

gi(w, X n , T) have a non-trivial common zero for i = 1,.. ., r. 

Qil( w > * (/I_1) ) = 0 for all p. 

Viewing each Q p as a polynomial in the variables (X (/î-1) ), we decompose each 
Q p as a sum of its homogeneous terms, and we let (H Á (W, X (n ~ ]) )) be the fam- 
ily of these polynomials, homogeneous in (X (,I-1) ). From the homogeneity 
property of the forms Fj in (X), it follows that if t is transcendental over K 
and gi(w, x (n ~ ]) , X n , T) have a non-trivial common zero for j = 1 ,..., r 
then ^(vv, tx (n ~ ]) , X n , T) also have a non-trivial common zero. Therefore 
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Q t ^(vv, tx (n ~ l) ) = 0 for all /jl, and so H x (w, jc ( w-1) ) = 0. Therefore we may use the 
family of polynomials (H x ) instead of the family (Q^), and we obtain the property: 
if(w , x (n ' l) ) is a point in a field K, then 

g t (w , x (n ~ l) 9 X n , T) have a non-trivial common zero for i = 1,. .., r 

<^> H x (w , x (n ~ l) ) = 0 for all À. 

By induction on n, there exists a family (7? p (PT)) of polynomials in Z[W] 
(actually homogeneous), having the property: if(w) is a point in a field K, then 

H x (w , X (/ï_1) ) have a non-trivial common zero for all À 

<=> R p (w) = 0 for all p. 

In light of Remark 4.4, this concludes the proof of Theorem 4.1 by the resultant 
method. 


§5. SPEC OF A RING 

We shall extend the notions of §2 to arbitrary commutative rings. 

Let A be a commutative ring. By spec(A) we mean the set of all prime ideals 
of A. An element of spec(A) is also called a point of spec(A). 

If/e A, we view the set of prime ideals p of spec(/l) containing /as the set 
of zeros of /. Indeed, it is the set of p such that the image of / in the canonical 
homomorphism 

A -> A/p 

is 0. Let û be an ideal, and let 2E(a) (the set of zeros of û) be the set of all 
primes of A containing a. Let a,b be ideals. Then we have: 

Proposition 5.1. 

(i) 3£(ab) = 2(a) U 2(b). 

(ii) //{a/} is a family ofideals, then 2ê(2û/) =11 2£(a/). 

(iii) We have 3£(a) C 3£(b) if and only if rad(a) D rad(b), where rad(a), the 
radical ofa, is the set of all elements x e A such that x n e a for some 
positive integer n. 

Proof. Exercise. See Corollary 2.3 of Chapter X. 

A subset C of spec(A) is said to be closed if there exists an ideal a of A such 
that C consists of those prime ideals p such that acp. The complement of a 
closed subset of spec(^) is called an open subset of spec(A). The following 
statements are then very easy to verify, and will be left to the reader. 
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Proposition 5.2. The union of a finite number of closed sets is closed. The 
intersection of an arbitrary family of closed sets is closed. 

The intersection of a finite number of open sets is open. The union of an 
arbitrary family of open sets is open. 

The empty set and spec(/l) itself are both open and closed. 

If 5 is a subset of A, then the set of prime ideals p e spec(A) such that S c p 
coincides with the set of prime ideals p containing the ideal generated by S. 

The collection of open sets as in Proposition 5.2 is said to be a topology on 
spec(A), called the Zariski topology. 


Remark. In analysis, one considers a compact Hausdorff space S. “Haus- 
dorff” means that given two points P, Q there exists disjoint open sets U P , U Q 
containing P and Q respectively. In the present algebraic context, the topology 
is not Hausdorff. In the analytic context, let R be the ring of complex valued 
continuous functions on S. Then the maximal ideals of R are in bijection with 
the points of S (Gelfand-Naimark theorem). To each point P e S, we associate 
the ideal M P of functions / such that f(P) = 0. The association P i-> M P 
gives the bijection. There are analogous results in the complex analytic case. 
For a non-trivial example, see Exercise 19 of Chapter XII. 


Let A, B be commutative rings and q>: A B a homomorphism. Then <p 
induces a map 

cp* = spec (cp) = cp~ l : spec (B) -> spec(/l) 

by 

p i ► cp ~ 1 (p). 


Indeed, it is immediately verified that cp~ *(p) is a prime ideal of A. Note however 
that the inverse image of a maximal ideal of B is not necessarily a maximal ideal 
of A. Example? The reader will verify at once that spec (cp) is continuous, in the 
sense that if U is open in spec (B), then cp~ l (U) is open in spec(/l). 

We can then view spec as a contravariant functor from the category of 
commutative rings to the category of topological spaces. 

By a point of spec(A) in a field L one means a mapping 


spec (cp): spec(L) -► spec(/l) 


induced by a homomorphism cp: A L of A into L. 

For example, for each prime number p, we get a point of spec(Z), namely 
the point arising from the reduction map 


Z - Z/pZ. 
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The corresponding point is given by the reversed arrow, 

spec(Z) <- spec(Z/pZ). 

As another example, consider the polynomial ring k[X ..., over a 
field k. For each n-tuple (c í9 ..., c„) in /c a(n) we get a homomorphism 

q>:k[X x ,...,Xà->k? 

such that cp is the identity on /c, and <p (Xỳ = for all i. The corresponding 

point is given by the reversed arrow 

spec /c[A"] <- spec(/c a ). 

Thus we may identify the points in n-space /c a(w) with the points of spec /c[AJ 
(over k) in /c a . 

However, one does not want to take points only in the algebraic closure of 
and of course one may deal with the case of an arbitrary variety V over k 
rather than all of affine n-space. Thus let k[x ì9 .. ., x n ] be a finitely generated 
entire ring over k with a chosen family of generators. Let V = spec k[x]. Let A 
be a commutative k-algebra, corresponding to a homomorphism k—>A. Then a 
point of V in A may be described either as a homomorphism 

(p: *[*,, ...,x„]->A, 


or as the reversed arrow 


spec(^) —► spec(fc[x]) 

corresponding to this homomorphism. If we put c t = <p( jcJ, then one may call 
(c) = (ci,. .., c n ) the coordinates of the point in A. By a generic point of V 
in a field K we mean a point such that the map <p:k[x] —» K is injective, i.e. an 
isomorphism of k[x] with some subring of K. 

Let A be a commutative Noetherian ring. We leave it as an exercise to 
verify the following assertions, which translate the Noetherian condition into 
properties of closed sets in the Zariski topology. 

Closed subsets of spec(A) satisfy the descending chain condition, i.e., if 

c, => C 2 =3 C 3 =3 ••• 

is a descending chain of closed sets, then we have C n = C n+l for all sufficiently 
large n. Equivalently, let {C f } ie/ be a family of closed sets. Then there exists a 
relatively minimal element of this family, that is a closed set C io in the family 
such that for all i, if C t <= C io then C, = C io . The proof follows at once from 
the corresponding properties of ideals, and the simple formalism relating 
unions and intersections of closed sets with products and sums of ideals. 
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A closed set C is said to be irreducible if it cannot be expressed as the union 
of two closed sets 


C / Cj u C 2 


with Cj ^ C and C 2 # C. 

Theorem 5.3. Let Abea Noetherian commutative ring. Then every closed 
set C can be expressed as a finite union of irreducible closed sets, and this 
expression is unique if in the union 

C = Cj u • u C r 

of irreducible closed sets, we have C, C } if i # j. 

Proof. We give the proof as an example to show how the version of Theorem 
2.2 has an immediate translation in the more general context of spec(A). Suppose 
the family of closed sets which cannot be represented as a finite union of irreducible 
ones is not empty. Translating the Noetherian hypothesis in this case shows that 
there exists a minimal such set C. Then C cannot be irreducible, and we can 
write C as a union of closed sets 


C = C' u C", 

with C' # C and C" # C. Since C' and C" are strictly smaller than C, then we 
can express C' and C" as finite unions of irreducible closed sets, thus getting a 
similar expression for C, and a contradiction which proves existence. 

As to uniqueness, let 

C = Cj u • • • u C r = Zj u • • • u Z s 

be an expression of C as union of irreducible closed sets, without inclusion 
relations. For each we can write 

= (Zj n CJ u • * • u (Zj n C r ). 

Since each Zj n C, is a closed set, we must have Zj = Zj n c, for some i. Hence 
Zj = C, for some i. Similarly, C, is contained in some Z k . Since there is no 
inclusion relation among the Z/ s, we must have Zj = C x = Z h . This argument 
can be carried out for each Z } and each C,. This proves that each Zj appears 
among the C,’s and each C, appears among the Z/s, and proves the uniqueness 
of our representation. This proves the theorem. 

Proposition 5.4. Let C be a closed subset ofsptc(A). Then C is irreducible 
ifand only ifC = 3£(p) for some prime ideal p. 

Proof. Exercise. 

More properties at the same basic level will be given in Exercises 14-19. 
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EXERCISES 

Integrality 

1. (Hilbert-Zariski) Let k be a field and let V be a homogeneous variety with generic 
point (jf) over k. Let 2Ê be the algebraic set of zeros in k a of a homogeneous ideal in 
k[X] generated by forms f r in k[X ]. Prove that V fì 2Ê has only the trivial 
zero if and only if each jc, is integral over the ring k[f(x)] = k[f x (: c),. .. , f r (x)]. 
(Compare with Theorem 3.7 of Chapter VII.) 

2. Let /,,..., f r be forms in n variables and suppose n > r. Prove that these forms 
have a non-trivial common zero. 

3. Let R be an entire ring. Prove that R is integrally closed if and only if the local ring 
R p is integrally closed for each prime ideal p. 

4. Let R be an entire ring with quotient field K. Let t be transcendental over K. Let 
/(0 = 2<V' E K[t]. Prove: 

(a) If f(t) is integral over R[t], then all û, are integral over R. 

(b) If R is integrally closed, then R[t] is integrally closed. 

For the next exercises, we let R = k[x] = k[X]/p , where p is a homogeneous prime 
ideal. Then (j:) is a homogeneous generic point for a &-variety V. We let / be the integral 
closure of R in /:(jc). We assume for simplicity that k(x) is a regular extension of k. 

5. Let z = with c, E k , and z 0. If k[x] is integrally closed, prove that k[x/z] 

is integrally closed. 

6. Define an element/ E k(x) to be homogeneous if f(tx) = t d f(x) for t transcendental 
over k(x) and some integer d. Let / E /. Show that / can be written in the form 
/ = 2/ where each/ is homogeneous of degree i ^ 0, and where also/ E /. (Some 
/ may be 0, of course.) 

We let /? m denote the set of elements of R which are homogeneous of degree m. 
Similarly for I m . We note that R m and l m are vector spaces over k , and that R (resp. /) 
is the direct sum of all spaces R m (resp. / m ) for m = 0, 1,. . . This is obvious for R , and 
it is true for / because of Exercise 6. 

7. Prove that / can be written as a sum / = Rz { + • • • + Rz s , where each z, is homoge- 
neous of some degree d ( . 

8. Define an integer m ^ 1 to be well behaved if I q m = I qm for all integers q ^ 1. If 
R = I, then all m are well behaved. In Exercise 7, suppose m ^ max d t . Show that 
m is well behaved. 

9. (a) Prove that / m is a finite dimensional vector space over k. Let w 0 ,. .. , w M be a 

basis for / m over Then k[I m ] = k[w]. 

(b) If m is well behaved, show that k[I m ] is integrally closed. 

(c) Denote by /:((j:)) the field generated over k by all quotients xjxj with Xj =É 0, 
and similarly for /:((w)). Show that k((x)) = k((w)). 

(If you want to see Exercises 4-9 worked out, see my Introduction to Algebraic 
Geometry , Interscience 1958, Chapter V.) 
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Resultants 

10. Prove that the resultant defined for n forms in n variables in §3 actually coincides 
with the resultant of Chapter IV, or §4 when n = 2. 

11. Let o =(/!,... ,/ r ) be a homogeneous ideal in k[X l9 ... , X n ) (with k algebraically 
closed). Assume that the only zeros of û consist of a finite number of points 
(jc (1) ), . .. , ( x (d) ) in projective space P" -1 , so the coordinates of each x (j) can be 
taken in k. Let u í9 .. ., u n be independent variables and let 

L U (X) = u x X j + • • • -f u n X n . 

Let /?j(m), ...» R s (u) E k[u] be a resultant system for/j,..., / r , L u . 

(a) Show that the common non-trivial zeros of the system /?,(«) (i = 1,..., s) 
in k are the zeros of the polynomial 

n Lu(x <J) ) e k[u]. 
j 

(b) Let D(u) be the greatest common divisor of /?/«),.. . , R s (u) in k[u\. Show 
that there exist integers nij ^ 1 such that (up to a factor in k) 

d 

D(u) = n L u (x < j) ) m ‘. 

7=1 

[See van der Waerden, Moderne Algebra, Second Edition, Volume II, §79.] 

12. For forms in 2 variables, prove directly from the definition used in §4 that one has 

Res (fg, h) = Res(/, h) Res (g, h) 

Res(/, g) = (-l)( de g/)( d eg9) Res (g 9 f). 

13. Let k be a field and let Z —* k be the canonical homomorphism. If F E Z[W, X], we 
denote by F the image of F in k[W ,, X] under this homomorphism. Thus we get R , 
the image of the resultant R. 

(a) Show that R is a generator of the prime ideal p* , of Theorem 3.5 over the 
field k. Thus we may denote R by R k . 

(b) Show that R is absolutely irreducible, and so is R k . In other words, R k is 
irreducible over the algebraic closure of k. 

Spec of a ring 

14. Let A be a commutative ring. Define spec(A) to be connected if spec(A) is not the 
union of two disjoint non-empty closed sets (or equivalently, spec(A) is not the union 
of two disjoint, non-empty open sets). 

(a) Suppose that there are idempotents e lf e 2 in A (that is e\ = e x and e\ = e 2 ), 
=É 0, 1, such that e x e 2 = 0 and e x + e 2 = 1. Show that spec(A) is not 
connected. 

(b) Conversely, if spec(/4) is not connected, show that there exist idempotents 
as in part (a). 

In either case, the existence of the idempotents is equivalent with the fact that the 
ring A is a product of two non-zero rings, A = A x X A 2 . 
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15. Prove that the Zariski topology is compact, in other words: let {Ui} ieI be a family of 
open sets such that 

1J Ui = spec(A). 

i 

Show that there ìs a finite number of open sets U iv ..., U in whose union is spec(/4). 
[Hint: Use closed sets, and use the fact that if a sum of ideals is the unit ideal, then 1 
can be written as a finite sum of elements.] 

16. Let/be an element of A. Let S be the multiplicative subset {1, /, / 2 , / 3 ,. . .} con- 
sisting of the powers of /. We denote by Af the ring S~ l A as in Chapter II, §3. 
From the natural homomorphism A —> A f one gets the corresponding map 
spec^AJ —> spec(A). 

(a) Show that spec(AJ maps on the open set of points in spec(A) which are not 
zeros of /. 

(b) Given a point p E spec(A), and an open set U containing p, show that there 
exists / such that p E spec(AJ C U. 

17. Let Ui = spec(A /( ) be a finite family of open subsets of spec(A) covering spec(A). 
For each /, let ajf E A fi . Assume that as functions on U { - fl Uj we have a,-// = aj/fj 
for all pairs /, j. Show that there exists a unique element a E A such that a = û,-// 
in A f for all /. 

18. Let k be a field and let k[x lt ... , x n ] = A C K be a finitely generated subring of 
some extension field K. Assume that k(x { , . . . , x n ) has transcendence degree r. Show 
that every maximal chain of prime ideals 

ADPj DP 2 D ... {0}, 

with P, =f= A , P, i= P i+Ì , P m i= {0}, must have m = r. 

19. Let A = Z[jc,, ..., x n ] be a finitely generated entire ring over Z. Show that every 
maximal chain of prime ideals as in Exercise 18 must have m = r + 1. Here, r = 
transcendence degree of Q(jt,, ..., jc„) over Q. 



CHAPTER X 

Noetherian Rings and 
Modules 


This chapter may serve as an introduction to the methods of algebraic geometry 
rooted in commutative algebra and the theory of modules, mostly over a Noeth- 
erian ring. 


§1. BASIC CRITERIA 

Let A be a ring and M a module (i.e., a left /1-module). We shall say that 
M is Noetherian if it satisfies any one of the following three conditions: 

(1) Every submodule of M is finitely generated. 

(2) Every ascending sequence of submodules of M, 

Mj c= M 2 c= M 3 c= • • •, 
such that M, # M i+ j is finite. 

(3) Every non-empty set S of submodules of M has a maximal element 
(i.e., a submodule M 0 such that for any element N of 5 which contains 
M 0 we have N = M 0 ). 

We shall now prove that the above three conditions are equivalent. 

(1) => (2) Suppose we have an ascending sequence of submodules of M as 
above. Let N be the union of all the M, (i = 1,2,...). Then N is finitely gen- 
erated, say by elements x Ì9 ..., x r , and each generator is in some M f . Hence 
there exists an index j such that 


x ì9 ...,x r e Mj. 


413 




414 NOETHERIAN RINGS AND MODULES 


X, §1 


Then 

<Xj,..., x r > c Mj cz N = <Xj,..., x r >, 

whence equality holds and our implication is proved. 

(2) => (3) Let N 0 be an element of S. If N 0 is not maximal, it is properly 
contained in a submodule N v If N x is not maximal, it is properly contained in 
a submodule N 2 . Inductively, if we have found N t which is not maximal, it is 
contained properly in a submodule N i+l . In this way we could construct an 
infinite chain, which is impossible. 

(3) =>(1) Let N be a submodule of M. Let a 0 eN. If N # <a 0 >, then 
there exists an element a { e N which does not lie in <a 0 >. Proceeding induc- 
tively, we can find an ascending sequence of submodules of N, namely 


<tf 0 > cz <a 0 , û!> c (a 0 , a l9 a 2 > <= • • • 


where the inclusion each time is proper. The set of these submodules has a 
maximal element, say a submodule (a 0 , a í9 ..., a r >, and it is then clear that 
this finitely generated submodule must be equal to N, as was to be shown. 

Proposition 1.1. Let M be a Noetherian A-module. Then every submodule 

and every factor module of M is Noetherian. 

Proof. Our assertion is clear for submodules (say from the first condi- 
tion). For the factor module, let N be a submodule and /: M -► M/N the 
canonical homomorphism. Let M x <= M 2 <= • • • be an ascending chain of sub- 
modules of M/N and let M f = f~ l (M/). Then M x c M 2 cz ♦ • • is an ascending 
chain of submodules of M, which must have a maximal element, say M r , so 
that M, = M r for r ^ i. Then /(M,) = M f and our assertion follows. 

Proposition 1.2. Let M be a module, N a submodule. Assume that N and 

M/N are Noetherian. Then M is Noetherian. 

Proof. With every submodule L of M we associate the pair of modules 
Lh-+(L n N,(L + N)/N). 

We contend: If E <= F are two submodules of M such that their associated 
pairs are equal, then E = F. To see this, let x e F. By the hypothesis that 
(E -f N)/N = (F + N)/N there exist elements u, v e N and y e E such that 
y -f u = x -f v. Then 


x — y = u- veFnN = Er\N. 

Since y e E, it follows the x e E and our contention is proved. If we have an 
ascending sequence 


c E 2 c • • • 
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then the associated pairs form an ascending sequence of submodules of N and 
M/N respectively, and these sequences must stop. Hence our sequence 
c - E 2 ' also stops, by our preceding contention. 

Propositions 1.1 and 1.2 may be summarized by saying that in an exact 
sequence 0 -► M' -► M -> M" -* 0, M is Noetherian if and only if M' and M" 
are Noetherian. 

Corollary 1.3. Let M be a module, and let N, N' be submodules. If 
M = N + N' and if both N, N' are Noetherian, then M is Noetherian. A 
finite direct sum of Noetherian modules is Noetherian. 

Proof. We first observe that the direct product N x N' is Noetherian 
since it contains N as a submodule whose factor module is isomorphic to N', 
and Proposition 1.2 applies. We have a surjective homomorphism 

N x N' -+ M 

such that the pair (x, x' ) with x e N and x' e N' maps onx-f- x'. By Prop- 
osition 1.1, it follows that M is Noetherian. Finite products (or sums) follow 
by induction. 

A ring A is called Noetherian if it is Noetherian as a left module over itself. 
This means that every left ideal is finitely generated. 

Proposition 1.4. Let Abea Noetherian ring and let M be a finitely generated 
module. Then M is Noetherian. 

Proof. Let Xj, ..., x n be generators of M. There exists a homomorphism 
f:AxAx--xA-*M 
of the product of A with itself n times such that 

/(«i. •••.«») = « 1*1 + • • • + a n x n . 

This homomorphism is surjective. By the corollary ofthe preceding proposition, 
the product is Noetherian, and hence M is Noetherian by Proposition 1.1. 

Proposition 1.5. Let Abe a ring which is Noetherian, and let cp: A -► B be 
a surjective ring-homomorphism. Then B is Noetherian. 

Proof. Let c • • • c b„ c ■ • • be an ascending chain of left ideals of B 
and let a, = <p -1 (b f ). Then the a, form an ascending chain of left ideals of A 
which must stop, say at a r . Since <p(a f ) = b, for all i, our proposition is proved. 

Proposition 1.6. Let A be a commutative Noetherian ring, and let S be a 
multiplicative subset of A. Then S~ l A is Noetherian. 

Proof. We leave the proof as an exercise. 
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Examples. In Chapter IV, we gave the fundamental examples of Noeth- 
erian rings, namely polynomial rings and rings of power series. The above 
propositions show how to construct other examples from these, by taking factor 
rings or modules, or submodules. 

We have already mentioned that for applications to algebraic geometry, it is 
valuable to consider factor rings of type k[X]/a , where û is an arbitrary ideal. 
For this and similar reasons, it has been found that the foundations should be 
laid in terms of modules, not just ideals or factor rings. Notably, we shall first 
see that the prime ideal associated with an irreducible algebraic set has an analogue 
in terms of modules. We shall also see that the decomposition of an algebraic 
set into irreducibles has a natural formulation in terms of modules, namely by 
expressing a submodule as an intersection or primary modules. 

In §6 we shall apply some general notions to get the Hilbert polynomial of 
a module of finite length, and we shall make comments on how this can be 
interpreted in terms of geometric notions. Thus the present chapter is partly 
intended to provide a bridge between basic algebra and algebraic geometry. 


§2. ASSOCIATED PRIMES 

Throughout this section , we let Abe a commutative ring. Modules and homo- 
morphisms are A-modules and A-homomorphisms unless otherwise specified. 

Proposition 2.1. Let S be a multiplicative subset of A , and assume that S 
does not contain 0. Then there exists an ideal of A which is maximal in the 
set of ideals not intersecting S, and any such ideal is prime. 

Proof. The existence of such an ideal p follows from Zorn’s lemma (the 
set of ideals not meeting S is not empty, because it contains the zero ideal, and is 
clearly inductively ordered). Let p be maximal in the set. Let n, b e A, ab e p, 
but a $ p and b $ p. By hypothesis, the ideals ( a , p) and ( b , p) generated by a 
and p (or b and p respectively) meet S, and there exist therefore elements 
s, s' e S, c, c\ x, x' e A, p, p' e p such that 

s = ca + xp and s' = c'b + x'p'. 

Multiplying these two expressions, we obtain 

ss' = cc'ab + p" 

with some p" e p, whence we see that ss' lies in p. This contradicts the fact 
that p does not intersect S, and proves that p is prime. 

An element a of A is said to be nilpotent if there exists an integer n ^ 1 such 
that a n = 0. 
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Corollary 2.2. An element a of A is nilpotent if and only if it lies in every 

prime ideal of A. 

Proof. If a n = 0, then a n e p for every prime p, and hence aep. If a n # 0 
for any positive integer n, we let S be the multiplicative subset of powers of a, 
namely {1, a, a 2 , ...}, and find a prime ideal as in the proposition to prove the 
converse. 

Let a be an ideal of A. The radical of a is the set of all a e A such that a n e a 
for some integer n ^ 1, (or equivalently, it is the set of elements ae A whose 
image in the factor ring A/a is nilpotent). We observe that the radical of û is an 
ideal, for if a n = 0 and b m = 0 then (a + b) k = 0 if k is sufficiently large: In the 
binomial expansion, either a or b will appear with a power at least equal to 
n or m. 

Corollary 2.3. An element a of A lies in the radical of an ideal a if and only 

if it lies in every prime ideal containing a. 

Proof. Corollary 2.3 is equivalent to Corollary 2.2 applied to the ring A/a. 

We shall extend Corollary 2.2 to modules. We first make some remarks on 
localization. Let S be a multiplicative subset of A. If M is a module, we can 
define S~ l M in the same way that we defined S~ l A. We consider equivalence 
classes of pairs (x, s) with xeM and seS, two pairs (x,s) and (x',s') being 
equivalent if there exists Sj e S such that Sj(s'x - sx') = 0. We denote the 
equivalence class of (x, s) by x/s, and verify at once that the set of equivalence 
classes is an additive group (under the obvious operations). It is in fact an 
/1-module, under the operation 


(a, x/s) i—► ax/s. 

We shall denote this module of equivalence classes by S~ l M. (We note that 
S~ l M could also be viewed as an S~ M-module.) 

Ifp is a prime ideal of A , and S is the complement of p in A , then S l M is 
also denoted by M p . 

It follows trivially from the definitions that if N -► M is an injective homo- 
morphism, then we have a natural injection S~ l N -► S~ l M. In other words, if 
N is a submodule of M, then S~'N can be viewed as a submodule of 5 _1 M. 
If x g N and s e 5, then the fraction x/s can be viewed as an element of S~*N 
or S~ l M. If x/s = 0 in 5“ l M, then there exists Sj e S such that s^x = 0, and 
this means that x/s is also 0 in S~^N. Thus if p is a prime ideal and N is a sub- 
module of M, we have a natural inclusion of N p in M v . We shall in fact identify 
N v as a submodule of M p . In particular, we see that M p is the sum of its sub- 
modules ( Ax) v , for x e M (but of course not the direct sum). 

Let x e M. The annihilator a of x is the ideal consisting of all elements 
ae A such that ax = 0. We have an isomorphism (of modules) 


A/a ^ Ax 



418 NOETHERIAN RINGS AND MODULES 


X, §2 


under the map 

a -* ax. 

Lemnoa 2.4. Let x be an element of a module M, and let a be its annihilator. 

Let p be a prime ideal of A. Then (Ax) p # 0 if and only if p contains û. 

Proof. The lemma is an immediate consequence of the definitions, and 
will be left to the reader. 

Let a be an element of A. Let M be a module. The homomorphism 

x t—> ax, x e M 

will be called the principal homomorphism associated with a , and will be de- 
noted by a M . We shall say that a M is locally nilpotent if for each xe M there 
exists an integer n(x) ^ 1 such that a n{x) x = 0. This condition implies that 
for every finitely generated submodule N of M, there exists an integer n ^ 1 
such that a n N = 0: We take for n the largest power of a annihilating a finite 
set of generators of N. Therefore, if M is finitely generated , a M is locally 
nilpotent if and only if it is nilpotent. 

Proposition 2.5. Let M be a module , a e A. Then a M is locally nilpotent 

if and only if a lies in every prime ideal p such that M p # 0. 

Proof. Assume that a M is locally nilpotent. Let p be a prime of A such 
that M p # 0. Then there exists xe M such that (Ax) p # 0. Let nbea positive 
integer such that a n x = 0. Let a be the annihilator of x. Then a n e a, and hence 
we can apply the lemma, and Corollary 4.3 to conclude that a lies in every prime 
p such that M p # 0. Conversely, suppose a M is not locally nilpotent, so there 
exists x e M such that a n x = 0 for all n ^ 0. Let S = {1, a , a 2 , . . .}, and 
using Proposition 2.1 let p be a prime not intersecting S. Then (Ar) p 4 0, so 
M p 4 0 and a £ p, as desired. 

Let M be a module. A prime ideal p of A will be said to be associated with 
M if there exists an element x e M such that p is the annihilator of x. In par- 
ticular, since p 4 A, we must have x # 0. 

Proposition 2.6. Let M bea module 4 0. Let p be a maximal element in the 

set ofideals which are annihilators of elements x e M, x 4 0. Then p is prime. 

Proof. Let p be'the annihilator of the element x # 0. Then p 4 A. Let 
a 9 b e A, ab e p, a <£ p. Then ax 4 0. But the ideal (b 9 p) annihilates ax 9 and 
contains p. Since p is maximal, it follows that b e p, and hence p is prime. 

Corollary 2.7. IfAis Noetherian and M is a module 4= 0, then there exists 

a prime associated with M. 

Proof. The set of ideals as in Proposition 2.6 is not empty since M 4 0, 
and has a maximal element because A is Noetherian. 
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Corollary 2.8. Assume that both A and M are Noetherian , M # 0. Then 
there exists a sequence of submodules 

M = Mj M 2 => • • • 3 M r = 0 

such that each factor module is isomorphic to Afa for some 

prime p f . 

Proof. Consider the set of submodules having the property described in 
the corollary. It is not empty, since there exists an associated prime p of M, 
and if p is the annihilator of x, then Ax » A/p. Let N be a maximal element in 
the set. If JV # M, then by the preceding argument applied to M/N, there exists 
a submodule N' of M containing N such that N'/N is isomorphic to A/p for 
some p, and this contradicts the maximality of N. 

Proposition 2.9. Let A be Noetherian , and a e A. Let M be a module. 
Then a M is injective if and only if a does not lie in any associated prime of M. 

Proof. Assume that a M is not injective, so that ax = 0 for some xeM, 
x ± 0. By Corollary 2.7, there exists an associated prime p of Ax , and a is an 
element of p. Conversely, if a M is injective, then a cannot lie in any associated 
prime because a does not annihilate any non-zero element of M. 

Proposition 2.10. Let A be Noetherian , and let M be a module. Let a e A. 
The follomng conditions are equivalent: 

(i) a M is locally nilpotent. 

(ii) a lies in every associated prime of M. 

(iii) a lies in every prime p such that M p ^ 0. 

// p is a prime such that M p 4 0, then p contains an associated prime of M. 

Proof. The fact that (i) implies (ii) is obvious from the definitions, and 
does not need the hypothesis that A is Noetherian. Neither does the fact that 
(iii) implies (i), which has been proved in Proposition 2.5. We must therefore 
prove that (ii) implies (iii) which is actually implied by the last statement. The 
latter is proved as follows. Let p be a prime such that M p 4 0. Then there exists 
x e M such that (Ax) p 4 0. By Corollary 2.7, there exists an associated prime 
q of (Ax) p in A. Hence there exists an element y/s of (Ax) p , with y e Ax, 
s £ p, and y/s 4 0, such that q is the annihilator of y/s. It follows that q (= p, 
for otherwise, there exists b e q, b £ p, and 0 = by/s , whence y/s = 0, contra- 
diction. Let b x ,. . . , b n be generators for q. For each /, there exists s t e A, 
St £ p, such that s { b t y = 0 because bj/s = 0. Let t = s Y • • • s n . Then it is 
trivially verified that q is the annihilator of ty in A. Hence q c= p, as desired. 

Let us define the support of M by 

supp(M) = set of primes p such that M p # 0. 
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We also have the annihilator of M, 

ann(M) = set of elements a e A such that aM = 0. 

We use the notation 

ass(M) = set of associated primes of M. 

For any ideal a we have its radical, 

rad(a) = set of elements ae A such that a n e a for some integer n ^ 1. 

Then for finitely generated M , we can reformulate Proposition 2.10 by the 
following formula: 

rad(ann(M)) = f| P = 0 P- 

pesupp(M) peass(M) 


Corollary 2.11. Let A be Noetherian , and let M be a module. The follomng 
conditions are equivalent : 

(i) There exists only one associated prime of M. 

(ii) We have M # 0, and for every a e A, the homomorphism a M is injective , 
or locally nilpotent. 

If these conditions are satisfied , then the set of elements aeA such that a M 
is locally nilpotent is equal to the associated prime of M. 

Proof. Immediate consequence of Propositions 2.9 and 2.10. 


Proposition 2.12. Let N be a submodule of M. Every associated prime of 
N is associated mth M also. An associated prime of M is associated mth N 
or mth M/N. 

Proof. The first assertion is obvious. Let p be an associated prime of M, 
and say p is the annihilator of the element x # 0. If Ax n N = 0, then Ax is 
isomorphic toasubmodule ofM/N, andhence pis associated v/ithM/N. Suppose 
Ajc fl N 4= 0. Let y = ax e N with a e A and y =/= 0. Then p annihilates y. 
We claim p = ann(y). Let b e A and by = 0. Then ba e p but a £ p, so 
b e p. Hence p is the annihilator of y in A, and therefore is associated with 
N, as was to be shown. 
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§3. PRIMARY DECOMPOSITION 

We continue to assume that A is a commutative ring, and that modules ( resp. 

homomorphisms) are A-modules ( resp. A-homomorphisms), unless othermse 

specified. 

Let M be a module. A submodule Q of M is said to be primary if Q # M, 
and if given a e A, the homomorphism a M/Q is either injective or nilpotent. 
Viewing A as a module over itself, we see that an ideal q is primary if and only 
if it satisfies the following condition: 

Given a, b e A, ab e q and a $ q, then b n g q for some n ^ 1. 

Let Q be primary. Let p be the ideal of elements ae A such that a M/Q is 

nilpotent. Then p is prime. Indeed, suppose that a, b e A, ab e p and a $ p. 
Then a M/Q is injective, and consequently a M/Q is injective for all n ^ 1. Since 
(ab) M/Q is nilpotent, it follows that b MIQ must be nilpotent, and hence that b e p, 
proving that p is prime. We shall call p the prime belonging to Q, and also say 
that Q is p-primary. 

We note the corresponding property for a primary module Q with prime p: 

Let b e A and x e M be such that bx e Q. Ifx £ Q then b e p. 

Examples. Let m be a maximal ideal of A and let q be an ideal of A such 

that m^Cq for some positive integer k. Then q is primary, and m belongs to 
q. We leave the proof to the reader. 

The above conclusion is not always true if m is replaced by some prime ideal 
p. For instance, let R be a factorial ring with a prime element t. Let A be the 
subring of polynomials f(X) e R[X ] such that 

f(X) = a 0 + a x X + ... 

with a x divisible by t. Let p = (tX, X 2 ). Then p is prime but 

p 2 = (t 2 X 2 , tx 3 , X 4 ) 

is not primary, as one sees because X 2 $ p 2 but t k $ p 2 for all 1, yet 
t 2 X 2 ep 2 . 

Proposition 3.1. Let M be a module, and Q x , . . . , Q r submodules which are 

p-primary for the same prime p. Then Q x n • • • n Q r is also p-primary. 

Proof. Let Q = Q x n • • • n Q r . Let aep. Let n t be such that (a M/Q ) ni = 0 
for each i = 1,..., r and let n be the maximum of n ì9 ..., n r . Then a M/Q = 0, 
so that a M/Q is nilpotent. Conversely, suppose a $ p. Let xe M, x$Qj for 
some j. Then a n x $ Q } for all positive integers n, and consequently a M/Q is 
injective. This proves our proposition. 
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Let iVbea submodule of M. When N is written as a finite intersection of 
primary submodules, say 

N = (2i n ••• n Q r , 

we shall call this a primary decomposition of N. Using Proposition3.1, we 
see that by grouping the g, according to their primes, we can always obtain 
from a given primary decomposition another one such that the primes belonging 
to the primary ideals are all distinct. A primary decomposition as above such 
that the prime ideals p t ,..., p r belonging to Q l9 ..., Q r respectively are distinct, 
and such that N cannot be expressed as an intersection of a proper subfamily 
of the primary ideals {<2 1 , • • • > Qrì w iU ^e said to be reduced. By deleting some' 
of the primary modules appearing in a given decomposition, we see that if N 
admits some primary decomposition, then it admits a reduced one. We shall 
prove a result giving certain uniqueness properties of a reduced primary 
decomposition. 

Let N be a submodule of M and let jc i—> jc be the canonical homomorphism. 
Let Q be a submodule of M = M/N and let Q be its inverse image in M. Then 
directly from the definition, one sees that Q is primary if and only if Q is primary; 
and if they are primary, then the prime belonging to Q is also the prime belonging 
to Q. Furthermore, if N = Q x D . . . D Q r is a primary decomposition of N in 
M, then 

(0) = Q x n .. . n Q r 

is a primary decomposition of (0) in M, as the reader will verify at once from 
the definitions. In addition, the decomposition of N is reduced if and only if the 
decomposition of (0) is reduced since the primes belonging to one are the same 
as the primes belonging to the other. 

Let Q x n • • n Q r = N be a reduced primary decomposition, and let p f 
belong to Q t . If p, does not contain p,- ( j / i) then we say that p t is ísolated. 
The isolated primes are therefore those primes which are minimal in the set 
of primes belonging to the primary modules Q t . 

Theorem 3.2. Let N be a submodule of M, and let 

N = <2i n ••• n Q r = Q\ n ••• n Q' s 

be a reduced primary decomposition of N. Then r = s. The set of primes 
belonging to Q í9 ... 9 Q r and Q\,... , Q' r is the same. If {p^,..., p m } is the 
set of isolated primes belonging to these decompositions , then Q t = Q\ for 
i = l 9 ...,m, in other words , the primary modules corresponding to isolated 
primes are uniquely determined. 

Proof. The uniqueness of the number of terms in a reduced decomposition 
and the uniqueness of the family of primes belonging to the primary components 
will be a consequence of Theorem 3.5 below. 
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There remains to prove the uniqueness of the primary module belonging 
to an isolated prime, say p v By definition, for each j = 2,..., r there exists 
a } e pj and a } $ p v Let a = a 2 • • • a r be the product. Then a e Pj for all j > 1, 
but a i p v We can find an integer n ^ 1 such that a n MÌQ = 0 for j = 2,..., r. 
Let 

N { = set of x e M such that a n x e N. 

We contend that Q x = N v This will prove the desired uniqueness. Let xeQ v 
Then a n x e Q { n • • • n Q r = N , so x e N v Conversely, let x e N ì9 so that 
a n x e N, and in particular a n x e Q v Since a $ p u we know by definition that 
a M , Qx is injective. Hence xeQ u thereby proving our theorem. 

Theorem 3.3. Let M be a Noetherian module. Let N be a submodule of 
M. Then N admits a primary decomposition. 

Proof. We consider the set of submodules of M which do not admit a 
primary decomposition. If this set is not empty, then it has a maximal element 
because M is Noetherian. Let N be this maximal element. Then N is not 
primary, and there exists ae A such that a M/N is neither injective nor nilpotent. 
The increasing sequence of modules 

Ker a M/N <= Ker a 2 M/N <= Ker a 3 M/N <=■■■ 

stops, say at a M/N . Let (p \ M/N -+ M/N be the endomorphism cp = a M/N . 
Then Ker (p 2 = Ker q>. Hence 0 = Ker (p n Im (p in M/N, and neither the 
kernel nor the image of q> is 0. Taking the inverse image in M, we see that N is 
the intersection of two submodules of M, unequal to N. We conclude from the 
maximality of N that each one of these submodules admits a primary de- 
composition, and therefore that N admits one also, contradiction. 

We shall conclude our discussion by relating the primes belonging to a 
primary decomposition with the associated primes discussed in the previous 
section. 

Proposition 3.4. Let A and M be Noetherian. A submodule Q of M is 
primary if and only if M/Q has exactly one associated prime p, and in that 
case , p belongs to Q , i.e. Q is p-primary. 

Proof. Immediate consequence of the definitions, and Corollary 2.11. 

Theorem 3.5. Let A and M be Noetherian. The associated primes of M 
are precisely the primes which belong to the primary modules in a reduced 
primary decomposition of 0 in M. In particular , the set of associated primes 
of M is finite. 

Proof. Let 


0 = Qi n • • • n Q x 
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be a reduced primary decomposition of 0 in M. We have an injective homo- 
morphism 


M _ 0AÍ/Q,. 

i= 1 

By Proposition 2.12 and Proposition 3.4, we conclude that every associated 
prime of M belongs to some Q { . Conversely, let N = Q 2 n • n Q r . Then 
N # 0 because our decomposition is reduced. We have 

N = N/(N n Ôi) « (N + ÔO/Ôi <= Aí/Gi- 

Hence N is isomorphic to a submodule of M/Q u and consequently has an 
associated prime which can be none other than the prime belonging to Q v 
This proves our theorem. 

Theorem 3.6. Let A be a Noetherian ring. Then the set of divisors of zero 
in A is the set-theoretic union of all primes belonging to primary ideals in a 
reduced primary decomposition of 0. 

Proof. An element of a e A is a divisor of 0 if and only if a A is not injective. 
According to Proposition 2.9, this is equivalent to a lying in some associated 
prime of A (viewed as module over itself). Applying Theorem 3.5 concludes the 
proof. 


§4. NAKAYAMA’S LEMMA 

We let A denote a commutative ring , but not necessarily Noetherian. 

When dealing with modules over a ring, many properties can be obtained 
fìrst by localizing, thus reducing problems to modules over local rings. In practice, 
as in the present section, such modules will be finitely generated. This section 
shows that some aspects can be reduced to vector spaces over a field by reducing 
modulo the maximal ideal of the local ring. Over a field, a module always has 
a basis. We extend this property as far as we can to modules finite over a local 
ring. The first three statements which follow are known as Nakayama’s lemma. 

Lemma 4.1. Let û be an ideal ofA which is contained in every maximal ideal 
of A. Let E be a finitely generated A-module. Suppose that aE = E. Then 
E = {0}. 
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Proof. Induction on the number of generators of E. Let x l9 ..., x s be 
generators of E. By hypothesis, there exist elements a u ... 9 a s e a such that 

x s = tfjXj + • • • + a s x s9 

so there is an element a (namely a s ) in a such that (1 + à)x s lies in the module 
generated by the first s — 1 generators. Furthermore 1 + a is a unit in A, 
otherwise 1 + a is contained in some maximal ideal, and since a lies in all 
maximal ideals, we conclude that 1 lies in a maximal ideal, which is not possible. 
Hence x s itself lies in the module generated by s - 1 generators, and the proof 
is complete by induction. 

Lemma 4.1 applies in particular to the case when A is a local ring, and 
a = m is its maximal ideal. 

Lemma 4.2. Let Abe a local ring , let Ebe a finitely generated A-module , and 

F a submodule. If E = F + m E, then E = F. 

Proof. Apply Lemma 4.1 to E/F. 

Lemma 4.3. Let A be a local ring. Let E be a finitely generated A-module. 

If X\, . . . , x n are generators for E mod m E, then they are generators for E. 

Proof. Take F to be the submodule generated by x u ..., 

Theorem 4.4. Let A be a local ring and E a fìnite projective A-module. 

Then E is free. In fact , if x u ..., x n are elements of E whose residue classes 

Xj,..., x n are a basis of E/mE over A/ m, then x l9 ..., x n are a basis of E 

over A. If x l9 ... 9 x r are such that 5c l9 ..., x r are linearly independent over 

A/m 9 then they can be completed to a basis of E over A. 

Proof. I am indebted to George Bergman for the following proof of the 
first statement. Let F be a free module with basis e l9 . . . , e n9 and let/: F —> E 
be the homomorphism mapping e t to x,-. We want to prove that /is an isomor- 
phism. By Lemma 4.3, / is surjective. Since E is projective, it follows that / 
splits, i.e. we can write F = P () © P l9 where P 0 = Ker/and P\ is mapped 
isomorphically onto E by /. Now the linear independence of x l9 . . . , x n mod 
m E shows that 


P a C m E = m P 0 C mP\. 

Hence P 0 C m P a . Also, as a direct summand in a finitely generated module, P () 
is finitely generated. So by Lemma 4.3, P 0 = (0) and /is an isomorphism, as 
was to be proved. 

As to the second statement, it is immediate since we can complete a given 
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sequence x\, ... 9 x r with x Ì9 ... 9 x r linearly independent over A/ m, to a 
sequence x\ 9 ... 9 x n with x Ì9 ... 9 x n lineary independent over A/ m, and then 
we can apply the first part of the proof. This concludes the proof of the theorem. 

Let E be a module over a local ring A with maximal ideal m. We let 
£(m) = E/mE. If /: £ -► F is a homomorphism, then / induces a homo- 
morphism 

/ (m) :E(m)->F(rn). 

If / is surjective, then it follows trivially that / (m) is surjective. 


Proposition 4.5. Let f:E->Fbea homomorphism of modules, fìnite over a 
localringA. Then: 

(i) // / (m) is surjective , so is f. 

(ii) Assume f is injective. If / (m) is surjective , then f is an isomorphism. 

(iii) Assume that £, F are free. /// (m) is injective ( resp. an isomorphism) then 
f is injective (resp. an isomorphism). 

Proof. The proofs are immediate consequences of Nakayama’s lemma and 
will be left to the reader. For instance, in the first statement, consider the exact 
sequence 


E F -+ F/ Im / -» 0 

and apply Nakayama to the term on the right. In (iii), use the lifting of bases 
as in Theorem 4.4. 


§5. FILTERED AND GRADED MODULES 

Let A be a commutative ring and E a module. By a filtration of E one means 
a sequence of submodules 

E = E 0 zd E x zd E 2 • • • =5 E n => • • • 

Strictly speaking, this should be called a descending filtration. We don’t 
consider any other. 

Example. Let a be an ideal of a ring A, and E an A- module. Let 

E n = a n E. 

Then the sequence of submodules {E n } is a filtration. 

More generally, let {E n } be any filtration of a module E. We say that it is 
an a-fìltration if aE n c E n+Í for all n. The preceding example is an a-filtration. 
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We say that an a-filtration is a-stable, or stable if we have aE n = E n+1 for all n 
sulficiently large. 

Proposition 5.1. Let {E n } and {E n } be stable a-filtrations of E. Then there 
exists a positive integer d such that 

E n+d cz E n and E n+d c= E n 


for all n ^ 0. 

Proof . It suffices to prove the proposition when E' n = a n E. Since 
aE n ci E n+1 for all n, we have a n E c= E n . By the stability hypothesis, there 
exists d such that 


E n+d = a n E d c= a"£, 
which proves the proposition. 

A ring A is called graded (by the natural numbers) if one can write A as a 
direct sum (as abelian group), 

A = ©A„, 

/1 = 0 

such that for all integers m, n ^ 0 we have A n A m c= A n+m . It follows in par- 
ticular that A 0 is a subring, and that each component A n is an A 0 -module. 

Let A be a graded ring. A module E is called a graded module if E can be 
expressed as a direct sum (as abelian group) 

E = © E n , 

/1 = 0 

such that A n E m c= E n+m . In particular, E n is an A 0 -module. Elements of E n are 
then called homogeneous of degree n. By definition, any element of E can be 
written uniquely as a finite sum of homogeneous elements. 

Example. Let k be a field, and let X 0 ,..., X r be independent variables. 
The polynomial ring A = k[_X 0 ,..., X r ~\ is a graded algebra, with k = A 0 . 
The homogeneous elements of degree n are the polynomials generated by the 
monomials in X 0i ..., X r of degree n, that is 

r 

X%> • • • X d / with X d i = n ' 

i = 0 

An ideal / of A is called homogeneous if it is graded, as an A-module. If this 
is the case, then the factor ring A/I is also a graded ring. 

Proposition 5.2. Let Abe a graded ring. Then A is Noetherian if and only 
if A 0 is Noetherian , and A is finitely generated as A 0 -algebra. 
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Proof. A finitely generated algebra over a Noetherian ring is Noetherian, 
because it is a homomorphic image of the polynomial ring in finitely many 
variables, and we can apply Hilbert’s theorem. 

Conversely, suppose that A is Noetherian. The sum 

A + = ®A„ 

n = 1 

is an ideal of A, whose residue class ring is A 0ì which is thus a homomorphic 
image of A, and is therefore Noetherian. Furthermore, A + has a finite number 
of generators x l9 ..., x s by hypothesis. Expressing each generator as a sum of 
homogeneous elements, we may assume without loss of generality that these 
generators are homogeneous, say of degrees d í9 ..., d 5 respectively, with all 
di > 0. Let B be the subring of A generated over A 0 by x l9 ..., x s . We claim 
that A n cz B for all n. This is certainly true for n = 0. Let n > 0. Let x be 
homogeneous of degree n. Then there exist elements a x e A n _ di such that 

s 

x = £a,Xi. 

i= l 

Since di > 0 by induction, each a x is in A 0 [x x ,..., xj = B, so this shows x e B 
also, and concludes the proof. 

We shall now see two ways of constructing graded rings from filtrations. 
First, let A be a ring and a an ideal. We view A as a fìltered ring, by the 
powers a”. We define the fìrst associated graded ring to be 

S.(A) = S = ® «-. 

« = 0 

Similarly, if E is an /1-module, and E is filtered by an a-filtration, we define 

Es= ©£„• 

n = 0 

Then it is immediately verified that E s is a graded S-module. 

Observe that if A is Noetherian, and a is generated by elements x l9 ... 9 x s 
then S is generated as an /1-algebra also by x ì9 ..., x s , and is therefore also 
Noetherian. 

Lemma 5.3. Let Abe a Noetherian ring , and E a finitely generated module , 
with an a-filtration. Then E s is finite over S if and only if the filtration of E 
is a-stable. 


F n =@Ei, 


Proof. Let 
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and let 

G„ = E 0 © • • • ® E n © aE„ © a 2 E„ © a 3 E„ © • • • 

Then G n is an S-submodule of £ s , and is finite over S since F n is finite over A. 
We have 

G n c 1 and U G n = F s . 

Since S is Noetherian, we get: 

E s is finite over S <=> E s = G N for some N 

oE N + m = a"'£ N for all m ^ 0 
<=> the filtration of £ is a-stable. 

This proves the lemma. 

Theorem 5.4. (Artin-Rees). Let A be a Noetherian ring, a an ideal, E a 
finite A-module with a stable a-filtration. Let F be a submodule, and let 
F n = F n E n . Then {£„} is a stable a-filtration of F. 

Proof. We have 

a(£ n £„) cz aF n aE n a F n E n+ ,, 

so {£„} is an a-filtration of F. We can then form the associated graded S-module 
/^,which is a submodule of E s , and ìs finite over S since S is Noetherian. We 
apply Lemma 5.3 to conclude the proof. 

We reformulate the Artin-Rees theorem in its original form as follows. 

Corollary 5.5. Let A be a Noetherian ring, E a finite A-module, and F a 
submodule. Let a be an ideal. There exists an integer s such that for all 
integers n ^ s we have 

a n E n £ = a n ~\a s E n £). 

Proof. Special case of Theorem 5.4 and the definitions. 

Theorem 5.6. (Krull). Let A be a Noetherian ring, and let a be an ideal 
contained in every maximal ideal of A. Let E be a finite A-module. Then 

n = o* 

n = 1 

Proof. Let F = f) a n E and apply Nakayama’s lemma to conclude the 
proof. 



430 NOETHERIAN RINGS AND MODULES 


X, §5 


Corollary 5.7. Let obea local Noetherian ring with maximal ideal m. Then 


f) m" = 0. 

n = 1 

Proof. Special case of Theorem 5.6 when E = A. 

The second way of forming a graded ring or module is done as follows. Let 
A be a ring and û an ideal of A. We define the second associated graded ring 

gr a (A)= ©a"/a" + 1 . 

n = 0 

Multiplication is defined in the obvious way. Let a e a n and let a denote its 
residue class mod a n+l . Let bea m and let h denote its residue class mod a m+ì . 
We define the product âh to be the residue class of ab mod a m+n+1 . It is easily 
verified that this definition is independent of the choices of representatives and 
defines a multiplication on gr 0 (/4) which makes gr a (A) into a graded ring. 

Let E be a filtered /1-module. We define 

gr(E)= @ EJE„ +i . 

n = 0 

If the filtration is an a-filtration, then gr(£) is a graded gr 0 (>l)-module. 

Proposition 5.8. Assume that A is Noetherian , and let a be an ideal of A. 
Then gr 0 (/l) is Noetherian. IfE is a finite A-module with a stable a-filtration , 
then gr (E) is a fnite gr a (A)-module. 

Proof. Let x ì9 ...,x s be generators of a. Let x, be the residue class of x t 
in a/a 2 . Then 


gr 0 (/l) = (A/a)[x u ..., xj 

is Noetherian, thus proving the first assertion. For the second assertion, we 
have for some d, 


E d+m = a m E d for all m ^ 0. 

Hence gr(£) is generated by the finite direct sum 

gr(£) 0 ® • • • © gr (E) d . 

But each gr (E) n = E n /E n+Ì is finitely generated over A 9 and annihilated by a, 
so is a finite /1/a-module. Hence the above finite direct sum is a finite A/a- 
module, so gr(£) is a finite gr 0 (^)-module, thus concluding the proof of the 
proposition. 
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§6. THE HILBERT POLYNOMIAL 

The main point of this section is to study the lengths of certain filtered 
modules over local rings, and to show that they are polynomials in appropriate 
cases. However, we first look at graded modules, and then relate filtered 
modules to graded ones by using the construction at the end of the preceding 
section. 

We start with a graded Noetherian ring together with a finite graded A-module 
E , so 

oc oc 

A = © A n and E = 0 E n- 
/2 = 0 / 2=0 


We have seen in Proposition 5.2 that A 0 is Noetherian, and that A is a finitely 
generated A 0 -algebra. The same type of argument shows that£has a finite number 
of homogeneous generators, and E n is a finite A 0 -module for all n ^ 0. 

Let <p be an Euler-Poincaré Z-valued function on the class of all finite 
A 0 -modules, as in Chapter III, §8. We define the Poincaré series with respect 
to <p to be the power series 


P v (E,t)= X <p(EX e Z[[t]]. 

/1 = 0 

We write P(E , í) instead of P^E, t ) for simplicity. 

Theorem 6.1. (Hilbert-Serre). Let s be the number of generators of A as 
A 0 -algebra. Then P(£, t) is a rational function of type 


P(E , 0 = 


m 

fi (l - t d o 


vví'r/i suitable positive integers d { , and f(t) e Z[f]. 


Proof. Induction on s. For s = 0 the assertion is trivially true. Let s= 1. 
Let A = AoIjc,, deg. x, = d, § 1. Multiplication by x s on E gives rise 

to an exact sequence 


0 


K„ 


E n * E n+a 


L n +d, 0 - 


Let 


K = @K„ and L = ®L„. 
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Then K , L are finite A-modules (being submodules and factor modules of E), 
and are annihilated by x s , so are in fact graded A 0 [ x x ,... , jc^.J-modules. By 
definition of an Euler-Poincaré function, we get 

cp(K n ) - (p(E n ) + (p(E n+ds ) - (p(L n + ds ) = 0. 

Multiplying by t n+ds and summing over n, we get 

(1 - t ds )P(E, t) = P(L, t) - t ds P(K, t) + g(t), 

where g(t) is a polynomial in Z[í]. The theorem follows by induction. 

Remark. In Theorem 6.1, if A = A 0 \_x u ..., x s ] then d, = deg x, as shown 
in the proof. The next result shows what happens when all the degrees are 
equal to 1. 

Theorem 6.2. Assume that A is generated as an A 0 -algebra by homogeneous 
elements of degree 1. Let d be the order of the pole of P(E, t) at t = 1. Then 
for all sufficiently large n, (p(E n ) is a polynomial in n of degree d — 1. (For 
this statement, the zero polynomial is assumed to have degree — 1.) 

Proof By Theorem 6.1, <p(E n ) is the coefficient of t n in the rational function 

p(e, t) = /(o/(i - ty. 

Cancelling powers of 1 - t, we write P(E, t) = h(t)/( 1 - t) d , and h( 1) + 0, with 
h(t) e Z[f]. Let 


Kt) = Y. a k tk 


We have the binomial expansion 


i : -7 > 

For convenience we let ^ = 0 for n ^ 0 and ^ = 1 for n = — 1. We 

then get 

(d + n — k — l\ 

<P(E„) = X a kí d _ j j for a11 n^m. 

The sum on the right-hand side is a polynomial in n with leading term 


(Z a *) 


„( 1 - 1 


(d - 1)! 


# 0. 


This proves the theorem. 
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The polynomial of Theorem 6.2 is called the Hilbert polynomial of the 
graded module £, with respect to cp. 

We now put together a number of results of this chapter, and give an application 
of Theorem 6.2 to certain filtered modules. 

Let A be a Noetherian local ring with maximal ideal m. Let q be an m- 
primary ideal. Then A/q is also Noetherian and local. Since some power of m 
is contained in q, it follows that A/q has only one associated prime, viewed as 
module over itself, namely m/q itself. Similarly, if M is a finite A/q-module, 
then M has only one associated prime, and the only simple /1/q-module is in 
fact an A/m-moduìe which is one-dimensional. Again since some power of m 
is contained in q, it follows that A/q has finite length, and M also has finite 
length. We now use the length function as an Euler-Poincaré function in 
applying Theorem 6.2. 

Theorem 6.3. Let A be a Noetherian local ring with maximal ideal m. 

Let q be an m-primary ideal , and let Ebe a finitely generated A-module , with 

a stabie q-filtration. Then : 

(i) E/E n has finite length for n ^ 0. 

(ii) For all sujficiently large n , this length is a polynomial g(rì) of degree ^ s, 
where s is the least number of generators of q. 

(iii) The degree and leading coefficient of g(rì) depend only on E and q, but not 
on the chosen filtration. 

Proof. Let 


G = gr q (^l) = ®q7q" +1 . 

Then gr(£) = ® EJE n+í is a graded G-module, and G 0 = A/q. By Proposition 
5.8, G is Noetherian and gr (E) is a finite G-module. By the remarks preceding 
the theorem, E/E n has finite length, and if cp denotes the length, then 

cp(E/E n )= X cp(Ej- i/Ej). 
j= i 

If x,,..., x s generate q, then the images x ì9 ... , x s in q/q 2 generate G as A/q- 
algebra, and each x f has degree 1. By Theorem 6.2 we see that 

cp(EJE n+ì ) = h(n) 

is a polynomial in n of degree ^ s — 1 for sufficiently large n. Since 
cp(E/E n+ì ) - cp(E/E n ) = h(n ), 

it follows by Lemma 6.4 below that cp(E/E n ) is a polynomial g(n) of degree 
^ s for all large n. The last statement concerning the independence of the degree 
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of g and its leading coefficient from the chosen filtration follows immediately 
from Proposition 5.1, and will be left to the reader. This concludes the proof. 

From the theorem, we see that there is a polynomial Xe, q suc h that 
Xe,,(«) = length(£/q"£) 

for all sufficiently large n. If E = A , then ^ q is usually called the characteristic 
polynomial of q. In particular, we see that 

Xa ,,(«) = length(i4/q") 

for all sufficiently large n. 

For a continuation of these topics into dimension theory, see [AtM 69] and 
[Mat 80]. 

We shall now study a particularly important special case having to do with 
polynomial ideals. Let k be a field, and let 

A = k[X 0 , ...,X N ] 

be the polynomial ring in N + 1 variable. Then A is graded, the elements of 
degree n being the homogeneous polynomials of degree n. We let abea homo- 
geneous ideal of A , and for an integer n ^ 0 we define: 

<p(n) = dim* A n 

<p(n , û ) = dim* û„ 

X(n , a) = dim* A n /a n = dim* A n - áim k a n = <p(n) - <p(n , û). 

As earlier in this section, A n denotes the /:-space of homogeneous elements of 
degree n in A, and similarly for a„. Then we have 

(N + n\ 

^ ~ \ N } 

We shall consider the binomial polynomial 

(T\ _ T(T- 1) ■■■(T- d+ 1) T d 
(1) -= jT + l° wer terms. 

If/is a function, we define the difference function A/by 

A f(T) =f(T+ 1) — f(T). 

Then one verifies directly that 
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Lemma 6.4. Let P e Q[7] be a polynomial of degree d with rational 
coeffìcients. 

(a) If P{n) e Z for all suffìciently large integers n, then there exist integers 
c 0 , ..., c d such that 

,m -‘*C) +e 'C-,) + 

In particular, P{n) e Z for all integers n. 

(b) Iff: Z —» Z is any function, and ifthere exists a polynomial Q{T) e Q[r] 
such that Q(Z) C Z and A f(n) = Q(n) for all n sufftciently large, then 
there exists a polynomial P as in (a) such thatf(n) = P(n)for all n sufftciently 
large. 

Proof. We prove (a) by induction. If the degree of P is 0, then the assertion 
is obvious. Suppose deg P ^ 1. By (1) there exist rational numbers c 0 ,. . . , c d 
such that P(T) has the expression given in (a). But A P has degree strictly smaller 
than deg P. Using (2) and induction, we conclude that c 0 ,. .., c d _ x must be 
integers. Finally c d is an integer because P(n) e Z for n sufficiently large. This 
proves (a). 

As for (b), using (a), we can write 

<m = c °C_ i) + ••• + Cd ~' 

with integers c 0 ,.. ., c d - X . Let P x be the “integral” of Q , that is 

Pi(T) = c 0 Q + • • • + Q_,Q, so AP, = Q. 

Then A(/ - P x )(n) = 0 for all n sufficiently large. Hence (/ - P x )(n) is equal 
to a constant c d for all n sufficiently large, so we let P = P x + c d to conclude 
the proof. 

Proposition 6.5. Let a,b be homogeneous ideals in A. Then 
<p(n , û + b) = <p(n, û ) + <p(n, b) — <p(n, û fl b) 

X(n, a + b ) = *(n, a ) + *(>*, b) - x( n > a H b). 

Proof. The first is immediate, and the second follows from the definition 
of 
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Theorem 6.6. Let F be a homogeneous polynomial of degree d. Assume that 

F is not a divisor of zero mod û, that is: ifG e A, FG e a, then G e û. Then 

X(n , + ( F )) = ^(n,û) - x(n - d , û). 

Proo/. First observe that trivially 

<p(n , (F)) = <p(n - d ), 

because the degree of a product is the sum of the degrees. Next, using the 
hypothesis that F is not divisor of 0 mod a, we conclude immediately 

<p(n , a n (F)) = <p(n - d 9 a). 

Finally, by Proposition 6.5 (the formula for x) 9 we obtain: 

X(n, a + (F)) = x(n, û) + x(n, (F)) - x(n, û H (F)) 

= x(n , o) + 9(n) - <p(n 9 (F)) - <p(n) + <p(n , a n (F)) 

= *(w, a) - <p(n - d) + <p(n - d 9 a) 

= x(n 9 a) - *(n - d/,a) 
thus proving the theorem. 

We denote by m the maximal ideal m = (X 0 ,..., X N ) in A. We call m the 
irrelevant prime ideal. An ideal is called irrelevant if some positive power of 
m is contained in the ideal. In particular, a primary ideal q is irrelevant if and 
only if m belongs to q. Note that by the Hilbert nullstellensatz, the condition 
that some power of m is contained in a is equivalent with the condition that the 
only zero of a (in some algebraically closed field containing k) is the trivial zero. 

Proposition 6.7. Let abe a homogeneous ideal. 

(a) Ifa is irrelevant, then x(n , a) = 0 for n sufficiently large. 

(b) In general, there is an expression a = q j fì ... C\q s as a reduced primary 
decomposition such that all q, are homogeneous. 

(c) If an irrelevant primary ideal occurs in the decomposition, let b be the 
intersection of all other primary ideals. Then 

X(n, a) = x(n, b) 

for all n sufftciently large. 

Proof. For (a), by assumption we have A n = a„ for n sufficiently large, so 
the assertion (a) is obvious. We leave (b) as an exercise. As to (c), say q s is 
irrelevant, and let b = q { fl . . . fl q s -\. By Proposition 6.5, we have 

X(n 9 b + q s ) = x(n 9 b) + x(n 9 q s ) ~ x(n , a). 

But b + q s is irrelevant, so (c) follows from (a), thus concluding the proof. 
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We now want to see that for any homogeneous ideal û the function / such 
that 


f(n) = ^(n,û) 

satisfies the conditions of Lemma 6.4(b). First, we observe that if we change 
the ground field from k to an algebraically closed field K containing k 9 and we 
let A k — K[Xq, . . . , Xyy], ^k = then 

dim t A„ = dim* A K n and dim* a„ = dim* a* 

Hence we can assume that k is algebraically closed. 

Second, we shall need a geometric notion, that of dimension. Let V be a 
variety over k , say affine, with generic point (x) = (x Ì9 ..., x N ). We define its 
dimension to be the transcendence degree of k(x) over k. For a projective variety, 
defined by a homogeneous prime ideal p, we define its dimension to be the 
dimension of the homogeneous variety defined by p minus 1. 

We now need the following lemma. 

Lemma 6.8. Let V , W be varieties over a field k. 

IfVD W and dim V = dim W , then V = W. 

Proof. Say V, W are in affine space A N . Letp v and p w be the respective 
prime ideals of V and W in k[X]. Then we have a canonical homomorphism 

k[X]/ Vv « k[x] -* k[y] « k[X]/p w 

from the affine coordinate ring of V onto the affine coordinate ring of W. If the 
transcendence degree of /:(*) is the same as that of k(y ), and say y x ,.. . , y r form 
a transcendence basis of /:(y) over k , then jcj, ..., x r is a transcendence basis 
of k(x) over k , the homomorphism k[x] —» k[y] induces an isomorphism 

k[x x ,. .., x r ] A k[y j,. .*. , y r ] 9 

and hence an isomorphism on the finite extension k[x] to k[y] 9 as desired. 


Theorem 6.9. Let a be a homogeneous ideal in A. Let r be the maximum 
dimension of the irreducible components of the algebraic space in projective 
space defined by a. Then there exists a polynomial P e Q[T] of degree ^ r, 
such that P( Z) C Z, and such that 

P(n) = x(n , û) 


for all n sufficiently large. 
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Proof. By Proposition 6.7(c), we may assume that no primary component 
in the primary decomposition of a is irrelevant. Let Z be the algebraic space of 
zeros of û in projective space. We may assume k algebraically closed as noted 
previously. Then there exists a homogeneous polynomial L e k[X] of degree 1 
(a linear form) which does not lie in any of the prime ideals belonging to the 
primary ideals in the given decomposition. In particular, L is not a divisor of 
zero mod û. Then the components of the algebraic space of zeros of a + (L) 
must have dimension ^ r — 1. By induction and Theorem 6.6, we conclude 
that the difference 

X(n , û) - x(n - 1, a) 

satisfies the conditions of Lemma 6.4(b), which concludes the proof. 

The polynomial in Theorem 6.9 is called the Hilbert polynomial of the 
ideal a. 

Remark. The above results give an introduction for Hartshorne’s [Ha 77], 
Chapter I, especially §7. If Z is not empty, and if we write 

n r 

X(n, a) = c— + lower terms, 

then c > 0 and c can be interpreted as the degree of Z, or in geometric terms, 
the number of points of intersection of Z with a sufficiently general linear variety 
of complementary dimension (counting the points with certain multiplicities). 
Forexplanationsanddetails, see [Ha77], Chapterl, Proposition7.6andTheorem 
7.7; van der Waerden [vdW 29] which does the same thing for multihomogeneous 
polynomial ideals; [La 58], referred to at the end of Chapter VIII, §2; and the 
papers [MaW 85], [Ph 86], making the link with van der Waerden some six 
decades before. 
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§7. INDECOMPOSABLE MODULES 

Let A be a ring, not necessarily commutative, and E an / 1 -module. We 
say that E is Artinian if E satisfies the descending chain condition on sub- 
modules, that is a sequence 


E t ZD E 2 => £3 ... 

must stabilize: there exists an integer N such that if n ^ N then E n = E n+Í . 

Example 1. If k is a field, A is a /c-algebra, and E is a finite-dimensional 
vector space over k which is also an /4-module, then E is Artinian as well as 
Noetherian. 

Example 2. Let A be a commutative Noetherian local ring with maximal 
ideal m, and let q be an m-primary ideal. Then for every positive integer n, 
A/q n is Artinian. Indeed, A/q n has a Jordan-Hòlder filtration in which each 
factor is a finite dimensional vector space over the field A/m, and is a module 
of finite length. See Proposition 7.2. 

Conversely, suppose that A is a local ring which is both Noetherian and 
Artinian. Let m be the maximal ideal. Then there exists some positive integer 
n such that m" = 0. Indeed, the descending sequence m" stabilizes, and 
Nakayama’s lemma implies our assertion. It then also follows that every 
primary ideal is nilpotent. 

As with Noetherian rings and modules, it is easy to verify the following 
statements: 

Proposition 7.1. Let Abe a ring , and let 

0 _► E' E -► E" -+ 0 

be an exact sequence of A-modules. Then E is Artinian if and only if E' and 

E" are Artinian. 

We leave the proof to the reader. The proof is the same as in the Noetherian 
case, reversing the inclusion relations between modules. 

Proposition 7.2. A module E has a finite simple filtration if and only if E 

is both Noetherian and Artinian. 

Proof. A simple module is generated by one element, and so is Noetherian. 
Since it contains no proper submodule ± 0, it is also Artinian. Proposition 7.2 
is then immediate from Proposition 7.1. 


A module E is called decomposable if E can be written as a direct sum 

E = E x 0 E 2 
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with E x # E and E 2 # E. Otherwise, E is called indecomposable. If E is 
decomposable as above, let e { be the projection on the first factor, and 
e 2 = 1 — e x the projection on the second factor. Then e í9 e 2 are idempotents 
such that 


e x #1, e 2 # 1, e x 4- e 2 = 1 and e x e 2 = e 2 e x = 0. 

Conversely, if such idempotents exist in End(£) for some module £, then E is 
decomposable, and e { is the projection on the submodule e { E. 

Let u: E -> E be an endomorphism of some module E. We can form the 
descending sequence 

Im w d Im m 2 d Im m 3 d • • . 

If E is Artinian, this sequence stabilizes, and we have 

Im u n = Im u n+ì for all sufficiently large n. 

We call this submodule u°°(£), or Im u 00 . 

Similarly, we have an ascending sequence 

Ker u c= Ker u 2 c = Ker u 3 c= • • • 

which stabilizes if E is Noetherian, and in this case we write 

Ker u® = Ker u n for n sufficiently large. 


Proposition 7.3. (Fitting’s Lemma). Assume that E is Noetherian and 
Aninian. Let u e End(E). Then E has a direct sum decomposition 

E = Im w°° © Ker u 00 . 

Furthermore , the restriction ofutolmu™ is an automorphism , and the restric- 
tion of u to Ker u 00 is nilpotent. 

Proof. Choose n such that Im w°° = Im u n and Ker w°° = Ker u n . We 
have 


Im u 00 n Ker u 00 = {0}, 

for if x lies in the intersection, then x = u n (y) for some y e £, and then 
0 = u n (x) = u 2n (y). So y e Ker u 2n = Ker u n 9 whence x = u n (y) = 0. 

Secondly, let xeE. Then for some y e u n (E) we have 


u n (x) = u n (y). 
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Then we can write 

x = x - u n (y) 4 - u n (y\ 

which shows that E = Im w°° + Ker w°°. Combined with the first step of the 
proof, this shows that E is a direct sum as stated. 

The final assertion is immediate, since the restriction of u to Im w°° is sur- 
jective, and its kernel is 0 by the first part of the proof. The restriction of u to 
Ker w°° is nilpotent because Ker w°° = Ker u n . This concludes the proof of the 
proposition. 

We now generalize the notion of a local ring to a non-commutative ring. 
A ring A is called local if the set of non-units is a two-sided ideal. 

Proposition 7.4. Let E be an indecomposable module over the ring A. Assume 
E Noetherian and Artinian. Any endomorphism of E is either nilpotent or an 
automorphism. Furthermore End(£) is local. 

Proof. By Fitting’s lemma, we know that for any endomorphism u, we 
have E = Im w°° or E = Ker u 00 . So we have to prove that End(£) is local. 
Let u be an endomorphism which is not a unit, so u is nilpotent. For any 
endomorphism v it follows that uv and vu are not surjective or injective respec- 
tively, so are not automorphisms. Let u u u 2 be endomorphisms which are not 
units. We have to show u t 4- u 2 is not a unit. If it is a unit in End(£), let 
v t = «,(«! 4- w 2 ) _1 . Then v x 4- v 2 = 1. Furthermore, v x = 1 — v 2 is invertible 
by the geometric series since v 2 is nilpotent. But v x is not a unit by the first part 
of the proof, contradiction. This concludes the proof. 

Theorem 7.5. (Krull-Remak-Schmidt). Let E # 0 be a module which is 
both Noetherian and Artinian. Then E is a finite direct sum of indecomposable 
modules. Up to a permutation , the indecomposable components in such a 
direct sum are uniquely determined up to isomorphism. 

Proof. The existence of a direct sum decomposition into indecomposable 
modules follows from the Artinian condition. If first E = E x ® E 2 , then either 
E ì9 E 2 are indecomposable, and we are done; or, say, E t is decomposable. 
Repeating the argument, we see that we cannot continue this decomposition 
indefinitely without contradicting the Artinian assumption. 

There remains to prove uniqueness. Suppose 

E = E, 0 • • • 0 E r = F t 0 • • • © F s 

where E h Fj are indecomposable. We have to show that r = s and after some 
permutation, E t « F h Let e { be the projection of E on E h and let Uj be the 
projection of E on F j9 relative to the above direct sum decompositions. Let: 
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Then £ u j = implies that 


I.v J w j \E l = id £l . 
j= 1 

By Proposition 7.4,End(£!) is local, and therefore some VjWj is an automor- 
phism of E v After renumbering, we may assume that is an automorphism 
of E v We claim that v { and w { induce isomorphisms between E x and F u 
This follows from a lemma. 

Lemma 7.6. Let M, N be modules , and assume N indecomposable. Let 
u: M -> N and v: N -► M be such that vu is an automorphism. Then u 9 v 
are isomorphisms. 

Proof. Let e = u(vu)~ l v. Then e 2 = e is an idempotent, lying in End(Af), 
and therefore equal to 0 or 1 since N is assumed indecomposable. But e # 0 
because id M # 0 and 


0 # id M = id M = (vu) l vu(vu) 'vu. 

So e = id N . Then u is injective because vu is an automorphism; v is injective 
because e = id N is injective; u is surjective because e = id N ; and v is surjective 
because vu is an automorphism. This concludes the proof of the lemma. 

Returning to the theorem, we now see that 


E = F t 0 (E 2 ® • • • ® E r ). 


Indeed, e x induces an isomorphism from to E u and since the kernel of e x 
is E 2 ® • • • ® E r it follows that 

Fj n (E 2 ©•••©£,.) = 0. 

But also, Fj = E { (mod E 2 © • • • © E r \ so E is the sum of F t and E 2 © • • • © E r , 
whence E is the direct sum, as claimed. But then 

E/F t « F 2 ® • • • ® F s & E 2 © • • • © E r . 

The proof is then completed by induction. 

We apply the preceding results to a commutative ring A. We note that an 
idempotent in A as a ring is the same thing as an idempotent as an element of 
End(A), viewing A as module over itself. Furthermore End(A) « A. Therefore, 
we-find the special cases: 


Theorem 7.7. Let Abea Noetherian and Artinian commutative ring. 
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(i) If A is indecomposable as a ring 9 then A is local. 

(ii) In general, A is a direct product of local rings 9 which are Artinian and 
Noetherian. 

Another way of deriving this theorem will be given in the exercises. 


EXERCISES 

1. Let A be a commutative ring. Let M be a module, and N a submodule. Let 
/V = Q { n • • • n Q r be a primary decomposition of N . Let (?, = QJN. Show that 
0 = Q { n • • • n Q r is a primary decomposition of 0 in M/N. State and prove the 
converse. 

2. Let p be a prime ideal, and a, b ideals of A. If ab c= p, show that a c p or b c p. 

3. Let q be a primary ideal. Let a, b be ideals, and assume ab c: q. Assume that b is 
finitely generated. Show that acqor there exists some positive integer n such that 
b" c q. 

4. Let A be Noetherian, and let q be a p-primary ideal. Show that there exists some n ^ 1 
such that p" c: q. 

5. Let A be an arbitrary commutative ring and let S be a multiplicative subset. Let p 
be a prime ideal and let q be a p-primary ideal. Then p intersects S if and only if q 
intersects S. Furthermore, if q does not intersect S, then S~ l q is S -1 p-primary in 
S~ l A. 

6. If a is an ideal of A, let a s = S~ x a. If (p s : A -> S~ l A is the canonical map, abbreviate 
cp-'(a s ) by a s n A , even though (p s is not injective. Show that there is a bijection 
between the prime ideals of A which do not intersect S and the prime ideals of S~ l A 9 
given by 


p i—► p s and p s i-> p s n /1 = p. 

Prove a similar statement for primary ideals instead of prime ideals. 

7. Let a = q^ n • • • n q r be a reduced primary decomposition of an ideal. Assume that 
q!,..., q, do not intersect S, but that q^ intersects S for j > i. Show that 

= Qis n n Qîs 

is a reduced primary decomposition of a s . 

8. Let A be a local ring. Show that any idempotent ^ 0 in A is necessarily the unit 
element. (An idempotent is an element e e A such that e 2 = e.) 

9. Let A be an Artinian commutative ring. Prove: 

(a) All prime ideals are maximal. [ Hint : Given a prime ideal p, let x e A, x(p) = 0. 
Consider the descending chain (x) => (x 2 ) => (x 3 ) => • • •.] 
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(b) There ìs only a fìnite number of prime, or maximal, ideals. [Hint: Among all 
fìmte mtersections of maximal ideals, pick a minimal one.] 

(c) The ìdeal N of mlpotent elements in A is mlpotent, that is there exists a positive 
ìnteger/c such that N k = (0). [ Hint : Let k besuch that N k = N k + '. Let a = N k . 
Let b be a minimal ideal # 0 such that bo ^ 0. Then b ìs principal and bo = b.] 

(d) A ìs Noetherian. 

(e) There exists an integer r such that 

A = f] A/m r 

where the product ìs taken over all maximal ideals. 

(f) We have 

A = UA e 

where agam the product is taken over all prime ideals p. 

10. Let A, B be local rings with maximal ideals m^, m fl , respectively. Let/: A -> B be a 
homomorphism. We say that / is local if / _1 (m B ) = m^. Suppose this is the case. 
Assume A, B Noetherian, and assume that: 

1. A/m A -> B/m H ìs an ìsomorphism, 

2. m^ -> rn H /ml is surjective; 

3. B ìs a finite /1-module, via f. 

Prove that f ts surjective. [Hint: Apply Nakayama twice.] 

For an ideal a, recall from Chapter IX, §5 that 3E (a) is the set of primes containing a. 

11. Let A be a commutative ring and M an /1-module. Define the support of M by 

supp(M) = {p e spec(/f): M p ^ 0}. 

If M is finite over A, show that supp(M) = 3E (ann(M)), where ann(M) is the annihilator 
of M in A, that is the set of elements a E A such that aM = 0. 

12. Let A be a Noetherian ring and M a finite A-module. Let / be an ideal of A such that 
supp(M) C^(/). Then I n M - 0 for some n > 0. 

13. Let A be any commutative ring, and M, N modules over A. If M is finitely presented, 
and S is a multiplicative subset of A , show that 

S _1 Hom a (M, N) « Hom s -i A (S~ l M, S~ l N). 

This is usually applied when A is Noetherian and M finitely generated, in which case 
M is also finitely presented since the module of relations is a submodule of a finitely 
generated free module. 

14. (a) Prove Proposition 6.7(b). 

(b) Prove that the degree of the polynomial P in Theorem 6.9 is exactly r. 


Locally constant dimensions 

15. Let A be a Noetherian local ring. Let E be a finite /Lmodule. Assume that A has no 
nilpotent elements. For each prime ìdeal p of A, let k(p) be the residue class field. If 
dim Mp) £ p /p£ p is constant for all p, show that E ìs free. [Hint: Let x lt ..., x r e A be 
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such that the residue classes mod the maximal ideal form a basis for E/mE over k( m). 
We get a surjective homomorphism 


A r E -> 0. 

Let J be the kernel. Show that J p c m p A p for all p so J c= p for all p and J = 0.] 

16. Let A be a Noetherian local ring without nilpotent elements. Let/: E -> F be a homo- 
morphism of /1-modules, and suppose £, F are finite free. For each prime p of A let 

/<p) : EJpE p -> F p /pFp 

be the corresponding /c(p)-homomorphism, where k(p) = A p /pA p is the residue class 
íìeld at p. Assume that 

dim J(p) Im / (pl 

is constant. 

(a) Prove that F/ Im /and Im / are free, and that there is an isomorphism 

F » Im / © (F/Im /). 

[//í«r: Use Exercise 15.] 

(b) Prove that Ker j is free and E « (Ker /) ® (Im /). [Hint: Use that finite 
projective is free.] 

The next exercises depend on the notion of a complex, which we have not yet formally 
defined. A (finite) complex E is a sequence of homomorphisms of modules 

0 E° i E l 4 • • • 4 E n —» 0 

and homorphisms d*: E 1 -* E i+Ì such that d i+l ° d' = 0 for all i. Thus Im(c/‘) C Ker (d i+l ). 
The homology H‘ of the complex is defined to be 

//' = Ker(J f+1 )/Im(rf i ). 


By definition, H° = E° and H n = E n /lm(d n ). You may want to look at the first section 
of Chapter XX, because all we use here is the basic notion, and the following property, 
which you can easily prove. Let £, F be two complexes. By a homomorphism /: E —> F 
we mean a sequence of homomorphisms 

/: E‘ —* F' 

making the diagram commutative for all i : 



E i+l 
fi +1 

fi+ 1 


Show that such a homomorphism /induces a homomorphism H(f): H(E) —> H(F) on the 
homology; that is, for each i we have an induced homomorphism 
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The following exercises are inspired from applications to algebraic geometry, as for 
instance in Hartshorne, Algebraic Geometry , Chapter III, Theorem 12.8. See also Chapter 
XXI, §1 to see how one can construct complexes such as those considered in the next 
exercises in order to compute the homology with respect to less tractable complexes. 

Reduction of a complex mod p 

17. Let 0 -* K° K l ->•••-► X” -> 0 be a complex of finite free modules over a local 
Noetherian ring A without nilpotent elements. For each prime p of A and module £, 
let £(p) = £ p /p£ p , and similarly let K(p) be the complex localized and reduced mod p. 
For a given integer í, assume that 


dim k(p) 

is constant, where H' is the í-th homology of the reduced complex. Show that H l (K) 
is free and that we have a natural isomorphism 

H l (X)(p) * HÏK(p)). 

[Hint : First write d\ 9) for the map induced by d' on X'(p). Write 

dim Wp) Ker dj pt = dim t(p) K‘( p) - dim t(p) Im d\ pt . 

Then show that the dimensions dim t(p) Im d\ p) and dim t(p) Im d\ p> 1 must be constant. 
Then apply Exercise 12.] 

Comparison of homology at the special point 

18. Let A be a Noetherian local ring. Let K be a finite complex, as follows: 

0 - X° ->-► K n -> 0, 

such that K‘ is finite free for all i. For some index i assume that 

H\K)( m) -> H‘(K(m)) 


is surjective. Prove: 

(a) This map is an isomorphism. 

(b) The following exact sequences split: 


0 -> Ker d 1 - X‘ -> Im d l -> 0 
0 -> Im d l -> X í+1 


(c) Every term in these sequences is free. 

19. Let A be a Noetherian local ring. Let K be a complex as in the previous exercise. For 
some i assume that 


H\K)( m) -> H\K( m)) 

is surjective (or equivalently is an isomorphism by the previous exercise). Prove that 
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the following conditions are equivalent: 

(a) if 1 " l (K)( m) - tf \K( m)) is surjective. 

(b) H'~ l (K)(m) -> //*" ^(m)) is an isomorphism. 

(c) H l (K) is free. 

[Hint: Lift bases until you are blue in the face.] 

(d) If these conditions hold, then each one of the two inclusions 

Im d'~ 1 c Ker d' c K l 

splits, and each one of these modules is free. Reducing mod m yields the 
corresponding inclusions 

Im d[ m) ' <= Ker d[ m) c K'(m), 

and induce the isomorphism on cohomology as stated in (b). [Hint: Apply 
the preceding exercise.] 





CHAPTER XI 

Real Fields 


§1. ORDERED FIELDS 

Let K be a field. An ordering of K is a subset P of K having the following 
properties: 

ORD 1. Given xe K, we have either xeP, or x = 0, or — xeP , and these 
three possibilities are mutually exclusive. In other words, K is the 
disjoint union of P, {0}, and -P. 

ORD 2. If x, y e P , then x -f y and xy e P. 

We shall also say that K is ordered by P, and we call P the set of positive 
elements. 

Let us assume that K is ordered by P. Since 1 # 0 and 1 = 1 2 = (—l ) 2 
we see that 1 e P. By ORD 2, it follows that 1 -f • • • + 1 e P, whence K has 
characteristic 0. If x e P, and x # 0, then xx“ 1 = 1 e P implies that x“ 1 e P. 

Let x, ye K. We define x < y (or y > x) to mean that y — xeP. Ifx<0 
we say that x is negative. This means that - x is positive. One verifies trivially 
the usual relations for inequalities, for instanc'e: 

x < y and y < z implies x < z, 

x < y and z > 0 implies xz < yz, 

. 1 1 

x < y and x, y > 0 ìmplies - < -. 

y x 

We define x ^ y to mean x < y or x = y. Then x ^ y and y ^ x imply x = y. 

If K is ordered and x e X, x # 0, then x 2 is positive because x 2 = ( —x ) 2 
and either xeP or —xeP. Thus a sum of squares is positive, or 0. 

Let E be a field. Then a product of sums of squares in E is a sum of squares. 

If a,be E are sums of squares and b # 0 then a/b is a sum of squares. 
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The first assertion is obvious, and the second also, from the expression 
a/b = ab{b- 1 ) 2 . 

If E has characteristic # 2, and -1 is a sum of squares in £, then every 
element aeE is a sum of squares, because Aa = (1 -f a) 2 — (1 — a) 2 . 

If K is a field with an ordering P, and F is a subfield, then obviously, P n F 
defines an ordering of F, which is called the induced ordering. 

We observe that our two axioms ORD 1 and ORD 2 apply to a ring. If 
A is an ordered ring, with 1 # 0 , then clearly A cannot have divisors of 0 , and 
one can extend the ordering of A to the quotient field in the obvious way: A 
faction is called positive if it can be written in the form a/b with a, b e A and 
a, b > 0. One verifies trivially that this defines an ordering on the quotient 
field. 

Example. We define an ordering on the polynomial ring R[í] over the 
real numbers. A polynomial 

/(0 = a nt n + • * * + 

with a n # 0 is defined to be positive if a n > 0. The two axioms are then trivially 
verified. We note that t > a for all aeR. Thus t is infinitely large with respect 
to R. The existence of infinitely large (or infinitely small) elements in an ordered 
field is the main aspect in which such a field differs from a subfield of the real 
numbers. 

We shall now make some comment on this behavior, i.e. the existence of 
infinitely large elements. 

Let K be an ordered field and let F be a subfield with the induced ordering. 
As usual, we put | x | = x if x > 0 and | x | = - x if x < 0. We say that an element 
a in K is infinitely large over F if | a | ^ x for all x e F. We say that it is infìnitely 
small over F if 0 ^ | a | < | x | for all x e F, x # 0. We see that a is infinitely large 
if and only if a~ 1 is infinitely small. We say that K is archimedean over F if K 
has no elements which are infinitely large over F. An intermediate field F ì9 
K D F x D F, is maximal archimedean over F in K if it is archimedean over F, 
and no other intermediate field containing F x is archimedean over F. If F { is 
archimedean over F and F 2 is archimedean over F x then F 2 is archimedean over 
F. Hence by Zorn’s lemma there always exists a maximal archimedean subfield 
F x of K over F. We say that F is maximal archimedean in K if it is maximal 
archimedean over itself in K. 

Let K be an ordered field and F a subfield. Let o be the set of elements of K 
which are not infinitely large over F. Then it is clear that o is a ring, and that for 
any aeX, we have a or a _1 eo. Hence o is what is called a valuation ring, 
containing F. Let m be the ideal of all a e K which are infinitely small over F. 
Then m is the unique maximal ideal of o, because any element in o which is not 
in m has an inverse in o. We call o the valuation ring determined by the ordering 
of K/F. 
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Proposition 1.1. Let K be an ordered field and F a subfield. Let o be the 
valuation ring determined by the ordering of K/F , and let m be its maximal 
ideal. Then o/m is a real field. 

Proof. Otherwise, we could write 

-1 = x«? + a 

with a, e o and a e m. Since £ a f is positive and a is infinitely small, such a 
relation is clearly impossible. 


§2. REAL FIELDS 

A field K is said to be real if — 1 is not a sum of squares in K. A field K is 
said to be real closed if it is real, and if any algebraic extension of K which is real 
must be equal to K. In other words, K is maximal with respect to the property 
of reality in an algebraic closure. 

Proposition 2.1. Let Kbe a real field. 

(i) IfaeK, then K(^ r a) or K{^J—a) is real. If a is a sum of squares in K , 
then ÂXVa) is real. If K(Vâ) is not real, then -a is a sum of squares 
in K. 

(ii) Iff is an irreducible polynomial of odd degree n in K[X~\ and if oc is a root 
of /, then K(oc) is real. 

Proof. Let ae K. If a is a square in K , then K(^Ja) = K and hence is real by 
assumption. Assume that a is not a square in K. If K(yja) is not real, then there 
exist b i9 Ci e K such that 

-1=£(6, + c is /a) 2 

= X (bf + 2cib iy /à + cfa). 

Since yfa is of degree 2 over K , it follows that 

-1 = Z bf + a X cf. 

If a is a sum of squares in K, this yields a contradiction. In any case, we con- 
clude that 


— a = 


1 +lbf 

Zc? 


is a quotient of sums of squares, and by a previous remark, that — a is a sum of 
squares. Hence K(Vâ) is real, thereby proving our first assertion. 
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As to the second, suppose K( oc) is not real. Then we can write 

-1 = Z 9i(<x) 2 

with polynomials g t in X[A"] of degree ^ n — 1. There exists a polynomial h 
in K\_X ] such that 

-1 =ï t g&X) 2 + KX)f(X). 

The sum of g t (X) 2 has even degree, and this degree must be > 0, otherwise - 1 
is a sum of squares in K. This degree is ^ 2 n — 2. Since / has odd degree n, it 
follows that h has odd degree ^ n — 2. If /? is a root of h then we see that — 1 
is a sum of squares in K(/?). Since deg h < deg /, our proof is finished by 
induction. 

Let K be a real field. By a real closure we shall mean a real closed field L 
which is algebraic over K. 

Theorem 2.2. Let K be a real field. Then there exists a real closure of K. 
If R is real closed , then R has a unique ordering. The positive elements are 
the squares of R. Every positive element is a square, and every polynomial of 
odd degree in R[X] has a root in R. We have R a = /?(V—T). 

Proof. By Zorn’s lemma, our field K is contained in some real closed field 
algebraic over K. Now let R be a real closed field. Let P be the set of non-zero 
elements of R which are sums of squares. Then P is closed under addition and 
multiplication. By Proposition 2.1, every element of P is a square in R , and given 
a e R, a # 0, we must have aeP or —aeP. Thus P defines an ordering. Again 
by Proposition 2.1, every polynomial of odd degree over R has a root in R. Our 
assertion follows by Example 5 of Chapter VI, §2. 

Corollary 2.3. Let K be a real field and a an element of K which is not a 
sum of squares. Then there exists an ordering of K in which a is negative. 

Proof. The field K(J — a) is real by Proposition 1.1 and hence has an 
ordering as a subfield of a real closure. In this ordering, — a > 0 and hence a is 
negative. 

Proposition 2.4. Let Rbea field such that R # R a but R a = R(*J— 1). Then 
R is real and hence real closed. 

Proofi Let P be the set of elements of R which are squares and # 0. We 
contend that P is an ordering of R. Let a e R 9 a # 0. Suppose that a is not a 
square in R. Let a be a root of X 2 — a = 0. Then R(tx) = R(^/ — 1), and hence 
there exist c,deR such that a = c + dyj — 1. Then 

a 2 = c 2 -h 2cdyJ^Ì — d 2 . 
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Since 1, yf— \ are linearly independent over R , it follows that c = 0 (because 
a $ R 2 \ and hence —a is a square. 

We shall now prove that a sum of squares is a square. For simplicity, write 
i = >/-T. Since R(i) is algebraically closed, given a,beR we can find c,deR 
such that (c -F di) 2 = a + bi. Then a = c 2 — d 2 and b = 2cd. Hence 

a 2 + b 2 = (c 2 + á 2 ) 2 , 


as was to be shown. 

If a e R, a # 0, then not both a and -a can be squares in R. Hence P is an 
ordering and our proposition is proved. 


Theorem 2.5. Let Rbe a real closed field , and f(X) a polynomial in K[X]. 
Let a, b e R and assume that f(a) < 0 and f (b ) > 0. Then there exists c 
between a and b such that f(c) = 0. 

Proof Since R(y/~— 1) is algebraically closed, it follows that / splits into a 
product of irreducible factors of degree 1 or 2. If X 2 + otX + f is irreducible 
(a, / g R) then it is a sum of squares, namely 


i x+ î) + { f + 


and we must have 4/ > a 2 since our factor is assumed irreducible. Hence the 
change of sign of / must be due to the change of sign of a linear factor, which is 
trivially verified to be a root lying between a and b. 


Lemma 2.6. Let Kbe a subfield of an ordered field E. Let a e Ebe algebraic 

over K , and a root of the polynomial 

/(X) = jr" + 0 ll _ 1 X'’- 1 + --- + Û0 

with coejficients in K. Then |a| ^ 1 + \a n - t \ + • • • + \a 0 \. 

Proofi If |a| ^ 1, the assertion is obvious. If |a| > 1, we express |a|" in 
terms of the terms of lower degree, divide by |a|” _1 , and get a proof for our 
lemma. 

Note that the lemma implies that an element which is algebraic over an 
ordered field cannot be infinitely large with respect to that field. 

Let f(X) be a polynomial with coeíficients in a real closed field R , and 
assume that / has no multiple roots. Let u < v be elements of R. By a Sturm 
sequence for / over the interval [ u , v] we shall mean a sequence of polynomials 

5={/ = / 0 ,/' = /i,...,/J 


having the following properties: 
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ST 1. The last polynomial f m is a non-zero constant. 

ST 2. There is no point x g [u, v\ such that ffx) = f j+ì (x) = 0 for any 
value 0 ^ j ^ m — 1 . 

ST 3. If x e [u, v~\ and ffx) = 0 for some j = 1,..., m - 1, then /_ X (x) 
and / j+ j(x) have opposite signs. 

ST 4. We have ffu) ^ 0 and ffv) # 0 for all j = 0,..., m . 

For any x g [m, t»] which is not a root of any polynomial / we denote by 
W s (x) the number of sign changes in the sequence 


{f(x\A(x),...,f m (x)}, 
and call W s (x) the variation of signs in the sequence 

Theorem2.7. (Sturm’s Theorem). The number of roots of f betxveen u and v 
is equal to W s (u) — W s (v) for any Sturm sequence S. 

Proof We observe that if ocj < a 2 < • • • < a r is the ordered sequence of 
roots of the polynomials / in [u, v\ (j = 0 ,..., m — 1 ), then is constant 
on the open intervals between these roots, by Theorem 2.5. Hence it will suffice 
to prove that if there is precisely one element a such that u < a < v and a is a 
root of some /•, then W s (u) - W s (v) = 1 if a is a root of /, and 0 otherwise. 
Suppose that a is a root of some f j9 for 1 tkj^m— 1. Then /•_ ^(a), f j+ ^(a) 
have opposite signs by ST 3, and these signs do not change when we replace a 
by u or v. Hence the variation of signs in 

fj(u), f J+1 (u)} and {fj-fv), ffv), f j+l (v)} 
is the same, namely equal to 2. If a is not a root of /, we conclude that 

W s (u) = W s (v). 

If a is a root of /, then f(u) and f(v) have opposite signs, but f'(u) and f'(v) 
have the same sign, namely, the sign of f'(oc). Hence in this case, 

W s (u) = W s (v) + 1. 


This proves our theorem. 

It is easy to construct a Sturm sequence for a polynomial without multiple 
roots. We use the Euclidean algorithm, writing 

/ = Qif ’ ~ f 2 , 
fi ~ 9 2 f \ ~ /3» 

fm -2 Qm - 1 fm - 1 fm ’ 
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using /' = f v Since /, f' have no common factor, the last term of this sequence 
is non-zero constant. The other properties of a Sturm sequence are trivially 
verified, because if two successive polynomials of the sequence have a com- 
mon zero, then they must all be 0 , contradicting the fact that the last one is not. 

Corollary 2.8. Let K be an ordered field , / an irreducible polynomial of 
degree ^ 1 over K. The number of roots of f in two real closures of K inducing 
the given ordering on K is the same. 

Proof We can take v sufficiently large positive and u sufficiently large 
negative in K so that all roots of / and all roots of the polynomials in the Sturm 
sequence lie between u and v, using Lemma 2.6. Then W s (u) — W s (v) is the 
total number of roots of / in any real closure of K inducing the given ordering. 

Theorem 2.9. Let K be an ordered field , and ìet R , R’ be real closures of K , 
whose orderings induce the given ordering on K. Then there exists a unique 
isomorphism o : R -► R' over K , and this isomorphism is order-preserving. 

Proof. We first show that given a finite subextension E of R over K , there 
exists an embedding of E into R' over K. Let E = K(ol\ and let 

f(X) = Irr(a, K , X). 

Then /(a) = 0 and the corollary of Sturm’s Theorem (Corollary 2.8) shows that 
/ has a root /} in R'. Thus there exists an isomorphism of K(a) on K(f) over X, 
mapping a on f. 

Let ocj,..., oc n be the distinct roots of/ in R , and let p u ..., /? m be the distinct 
roots of / in R'. Say 


aj < • • • < <x n in the ordering of R , 

Pi < • - • < P m in the ordering of R'. 

We contend that m = n and that we can select an embedding o of K(oc í9 ... 9 oc n ) 
into R' such that cra, = /?, for i = 1,.. ., n. Indeed, let y f be an element of R 
such that 


yf = oc i+l — 0 Cì for i = 1 ,..., n — 1 

and let = K(oc l9 ..., a„, y u . .. , y„_ t ). By what we have seen, there exists 
an embedding o of E { into R' , and then ooc i+l — ooc { is a square in R'. Hence 

ooc^ < • • • < ooc n . 

This proves that m ^ n. By symmetry, it follows that m = n. Furthermore, 
the condition that ooc^ = j8 f for i = 1, . . . , « determines the effect of cr on 
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K(a!,..., <x n ). We contend that o is order-preserving. 'Let y e K(cc u ..., ot n ) 
and 0 < y. Let y e R be such that y 2 = y. There exists an embedding of 

K(a 1 ,...,a„,y 1 ,...,y n _ 1 ,y) 

into R' over K which must induce o on K(ol u ..., <x n ) and is such that oy is a 
square, hence > 0 , as contended. 

Using Zorn’s lemma, it is now clear that we get an isomorphism of R onto R' 
over K. This isomorphism is order-preserving because it maps squares on 
squares, thereby proving our theorem. 

Proposition 2.10. Let K be an orderedfield, K' an extension such that there is 
no relation 


-1 = 2>,a? 

i = 1 

with a t e K , a { > 0, and a, e K'. Let L be thefield obtainedfrom K' by adjoining 
the square roots of all positive elements of K. Then L is real. 

Proof. If not, there exists a relation of type 

-1 = 

î = 1 

with a ( e X, a t > 0, and a, eL. (We can take a, = 1.) Let r be the smallest 
integer such that we can write such a relation with a, in a subfìeld of L, of type 

K'C/b u ...,Jb r ) 

with bj e K , bj > 0. Write 

* i = *, + yisfbr 

with x h y, e K\^/b[,.... fbff). Then 

-1 = Z a >( x > + W ^) 2 

= X a i(x? + 2xiy is /b r + yfb r ). 

By hypothesis, yfb r is not in K'(b „ ..., fb r 7,). Hence 

= Y, a ì x ï + Y* a ib r yîi 

contradicting the minimality of r. 


Theorem 2.11. Let K be an ordered field. There exists a real closure R ofK 
inducing the given ordering on K. 
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Proof. Take K' = K in Proposition 2.10. Then L is real, and is contained 
in a real closure. Our assertion is clear. 

Corollary 2.12. Let K be an ordered field , and K' an extension field. In order 
that there exist an ordering on K' inducing the given ordering of K , it is 
necessary and sufficient that there is no relation of type 

-1 = £>ja? 

i — 1 


mth a x € K , a t > 0, and a f e K'. 

Proof. If there is no such relation, then Proposition 2.10 states that L is 
contained in a real closure, whose ordering induces an ordering on K\ and the 
given ordering on X, as desired. The converse is clear. 

Example. Let Q a be the field of algebraic numbers. One sees at once that 
Q admits only one ordering, the ordinary one. Hence any two real closures of Q 
in Q a are isomorphic, by means of a unique isomorphism. The real closures of Q 
in Q a are precisely those subfields of Q a which are of finite degree under Q a . 
Let K be a finite real extension of Q, contained in Q a . An element a of K is a 
sum of squares in K if and only if every conjugate of a in the real numbers is 
positive, or equivalently, if and only if every conjugate of a in one of the real 
closures of Q in Q a is positive. 

Note. The theory developed in this and the preceding section is due to Artin- 
Schreier. See the bibliography at the end of the chapter. 


§3. REAL ZEROS AND HOMOMORPHISMS 

Just as we developed a theory of extension of homomorphisms into an 
algebraically closed field, and Hilbert’s Nullstellensatz for zeros in an alge- 
braically closed field, we wish to develop the theory for values in a real closed 
field. One of the main theorems is the following: 

Theorem 3.1. Let k be a field , K = k(x u ..., x n ) a finitely generated 
extension. Assume that K is ordered. Let R k be a real closure of k inducing 
the same ordering on k as K. Then there exists a homomorphism 

(p.k[x u -»• R k 


over k. 
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As applications of Theorem 3.1, one gets: 

Corollary 3.2. Notation being as in the theorem , let y í9 ..., y m e fc[x] «nd 
assume 

yi < y 2 < ••' < y m 

is the given ordering of K. Then one can choose (p such that 

tpy i < * * * < (py m . 

Proof Let 7 ,-eiC 3 be such that yf = y i+ì - y t . Then K(y í ,... 9 y n - Ì ) 
has an ordering inducing the given ordering on K. We apply the theorem to the 
ring 

fc[xi,..., x n , yi *... * y m -yh.. • * y m — í~\' 

Corollary 3.3. (Artin). Let kbe a real field admitting only one ordering. 
Let f(X i,..., X n ) e k(X) be a rational function having the property that for 
all (a) = (a í9 ..., a n ) e R { k ] such that f(a) is defined , we have f(a) ^ 0. Then 
j (X) is a sum of squares in k(X). 

Proof. Assume that our conclusion is false. By Corollary 2.3, there exists 
an ordering of k(X) in which / is negative. Apply Corollary 3.2 to the ring 

klX í9 ... 9 X n9 h(X)~'l 

where h(X) is a polynomial denominator for f(X). We can find a homo- 
morphism (p of this ring into R k (inducing the identity on k) such that (p(f) < 0. 
But 

(P(f) = f((pX í9 ... 9 (pX n ). 

contradiction. We let a t = (p(Xỳ to conclude the proof. 

Corollary 3.3 was a Hilbert problem. The proof which we shall describe for 
Theorem 3.1 differs from Artin’s proof of the corollary in several technical 
aspects. 

We shall first see how one can reduce Theorem 3.1 to the case when K has 
transcendence degree 1 over k 9 and k is real closed. 

Lemma 3.4. Let Rbe a real closed field and let R 0 be a subfield which is 
algebraically closed in R (i.e. such that every element of R not in R 0 is tran- 
scendental over R 0 ). Then R 0 is real closed. 

Proof. Let f(X) be an irreducible polynomial over R 0 . It splits in R into 
linear and quadratic factors. Its coeíficients in R are algebraic over R 09 and 
hence must lie in R 0 . Hence/(X) is linear itself, or quadratic irreducible already 
over R 0 . By the intermediate value theorem, we may assume that / is positive 
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definite, i.e. f(a) > 0 for all ae R 0 . Without loss of generality, we may assume 
that f(X) = X 2 + b 2 for some beR 0 . Any root of this polynomial will bring 
with it and therefore the only algebraic extension of R 0 is Roiy/^ï)- 
This proves that R 0 is real closed. 

Let R k be a real closure of K inducing the given ordering on K. Let R 0 be 
the algebraic closure of k in R K . By the lemma, R 0 is real closed. 

We consider the field R 0 (x l9 ..., x n ). If we can prove our theorem for the 
ring R 0 [x í9 ..., xj, and find a homomorphism 

ý'-Rolx -> R 09 

then we let o : R 0 -* R K be an isomorphism over k (it exists by Theorem 2.9), and 
we let cp = o o \j/ to solve our problem over k. This reduces our theorem to the 
case when k is real closed. 

Next, let F be an intermediate field, K => F k, such that K is of tran- 
scendence degree 1 over F. Again let R K be a real closure of K preserving the 
ordering, and let R F be the real closure of F contained in R K . If we know our 
theorem for extensions of dimension 1, then we can find a homomorphism 

i//:R F lx l ,...,x n ]-+ R f . 

We note that the field k(ỳx ì9 ..., il/x n ) has transcendence degree ^ n — 1, 
and is real, because it is contained in R F . Thus we are reduced inductively to 
the case when K has dimension 1, and as we saw above, when k is real closed. 

One can interpret our statement geometrically as follows. We can write 
K = R(x, y) with x transcendental over R , and (x, y) satisfying some irreducible 
polynomial f(X, Y) = 0 in R\_X, T]. What we essentially want to prove is that 
there are infinitely many points on the curve f(X, Y) = 0, with coordinates 
lying in R, i.e. infinitely many real points. 

The main idea is that we find some point (a, b) e R {2) such that f(a, b) = 0 
but D 2 f(a, b) # 0. We can then use the intermediate value theorem. We see 
that f(a,b + h) changes sign as h changes from a small positive to a small 
negative element of R. If we take a e R close to a, then / (a', b + h) also changes 
sign for small h, and hence / (a', Y) has a zero in R for all a' sufficiently close to a. 
In this way we get infinitely many zeros. 

To find our point, we consider the polynomial /(x, Y) as a polynomial in one 
variable Y with coefficients in R(x). Without loss of generality we may assume 
that this polynomial has leading coefficient 1. We construct a Sturm sequence 
for this polynomial, say 

{/(x, Y),f |(x, Y),...,f m (x, Y)). 

Let d = deg /. If we denote by A(x) = (a d _ /x),..., a 0 (x)) the coefficients of 
f(x, Y), then from the Euclidean alogrithm, we see that the coefficients of the 
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polynomials in the Sturm sequence can be expressed as rational functions 

{G v (A(x))} 


in terms of a d _ j(x),..., a 0 (x). 
Let 


v(x) = 1 ± a d _ i(x) ± • ■ • ± a 0 (x) ± s, 

where s is a positive integer, and the signs are selected so that each term in this 
sum gives a positive contribution. We let u(x) = —v(x\ and select s so that 
neither u nor v is a root of any polynomial in the Sturm sequence for /. Now 
we need a lemma. 

Lemma 3.5. Let Rbe a real closed field, and {/i/x)} a finite set of rational 
functions in one variable mth coefficients in R. Suppose the rational field 
R(x) ordered in some way, so that each h^x) has a sign attached to it. Then 
there exist infinitely many special values c of x in R such that h t (c) is defined 
and has the same sign as h^xXfor all i. 

Proof. Considering the numerators and denominators of the rational 
functions, we may assume without loss of generality that the h t are polynomials. 
We then write 

hì(x) = a II ( x “ II P( x X 

where the first product is extended over all roots A of h t in R , and the second 
product is over positive definite quadratic factors over R. For any ÇeR, p(Ç) is 
positive. It suffices therefore to show that the signs of (x — A) can be preserved 
for all A by substituting infinitely many values a for x. We order all values of A 
and of x and obtain 


• • • < Aj < x < À 2 < • • * 

where possibly A { or A 2 is omitted if x is larger or smaller than any A. Any value 
a of x in R selected between A x and A 2 will then satisfy the requirements of our 
lemma. 

To apply the lemma to the existence of our point, we let the rational functions 
{/zï(x)} consist of all coefficients a d _ j(x),.. . , a 0 (x) 9 all rational functions 
G v (A(x)), and all values ffx, u(x)), ffx, d(x)) whose variation in signs satisfied 
Sturm’s theorem. We then find infinitely many special values a of x in R which 
preserve the signs ofthese rational functions. Then the polynomials/(a, Y) have 
roots in R, and for all but a finite number of a, these roots have multiplicity 1. 

It is then a matter of simple technique to see that for all but a finite number of 
points on the curve, the elements x í9 ..., x„ lie in the local ring of the homo- 
morphism R[x, y] -> R mapping (x, y) on (a, b) such that f(a,b) = 0 but 
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Di f (a, b) ^ 0. (Cf. for instance the example at the end of §4, Chapter XII, and 
Exercise 18 of that chapter.) One could also give direct proofs here. In this 
way, we obtain homomorphisms 

/^[^ 1 ,..., x„] —► R , 

thereby proving Theorem 3.1. 

Theorem 3.6. Let k be a real field, K = k(x u ..., x n , y) = fc(x, y) a 
finitely generated extension such that x ì9 ...,x n are algebraically independent 
over k, and y is algebraic over k(x). Let f ( X, Y) be the irreducible polynomial 
in k\_X, T] such that f(x, y) = 0. Let Rbe a real closed field containing k, 
and assume that there exists (a, b)eR (n+ì) such that f (a, b) = 0 but 

D n + if(a, b) ï 0. 

Then K is reaì. 

Proof. Let tt n be algebraically independent over R. Inductively, we 
can put an ordering on R(t u ..., t n ) such that each t { is infinitely small with 
respect to R , (cf. the example in §1). Let R' be a real closure of R(t u ..., t n ) 
preserving the ordering. Let u t = a { -f t t for each i= ì,...,n. Then f(u, b -h h) 
changes sign for small h positive and negative in R , and hence f(u, Y) has a 
root in R', say v. Since / is irreducible, the isomorphism of k(x) on k(u) sending 
x t on Ui extends to an embedding of k(x, y) into R', and hence K is real, as was to 
be shown. 

In the language of algebraic geometry, Theorems 3.1 and 3.6 state that the 
function field of a variety over a real field k is real if and only if the variety has a 
simple point in some real closure of k. 


EXERCISES 


1. Let a be algebraic over Q and assume that Q(a) is a real field. Prove that a is a sum of 
squares m Q(a) if and only if for every embedding o of Q(a) in R we have o<x > 0. 

2. Let F be a finite extension of Q. Let q>: F -* Q be a Q-linear functional such that 
(p(x 2 ) > 0 for all xe F,x ^ 0. Let a e F, a # 0. If q>( ax 2 ) ^ 0 for all xeF, show that a is 
a sum of squares in F, and that F is totally real, i.e. every embedding of F in the complex 
numbers is contained in the real numbers. [Hint: Use the fact that the trace gives an 
identification of F with its dual space over Q, and use the approximation theorem of 
Chapter XII, §1.] 
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3. Let a ^ t g p be a real interval, and let /(í) be a real polynomial which is positive on this 
mterval. Show that f(t) can be written in the form 

c (Z ôv 2 + Z ( f - a )Ql + Y.(P - QQl) 

where Q 2 denotes a square, and c ^ 0. Hint : Split the polynomial, and use the ìdentity: 


(t - a)(fi -t) = 


(t - «Ýjp - t) + (t - a)(fi - t) 2 

D - « . 


Remark. The above seemingly innocuous result is a key step in developing the 
spectral theorem for bounded hermitian operators on Hilbert space. See the appendix 
of [La 72] and also [La 85]. 

4. Show that the field of real numbers has only the identity automorphism. [Hint\ Show 
that an automorphism preserves the ordering.] 

Real places 

For the next exercises, cf. Krull [Kr 32] and Lang [La 53]. These exercises form a 
connected sequence, and solutions will be found in [La 53]. 

5. Let K be a field and suppose that there exists a real place of K\ that is, a place (p 
with values in a real field L. Show that K is real. 

6. Let K be an ordered real field and let Fbea subfield which is maximal archimedean 
in K. Show that the canonical place of K with respect to F is algebraic over F (i.e. 
if o is the valuation ring of elements of K which are not infinitely large over F, and 
m is its maximal ideal, then o/m is algebraic over F). 

7. Let K be an ordered field and let F be a subfield which is maximal archimedean in 
K. Let K' be the real closure of K (preserving the ordering), and let F' be the real 
closure of F contained in K'. Let (p be the canonical place of K' with respect to F'. 
Show that (p(K') is F'-valued, and that the restriction of (p to K is equivalent to the 
canonical place of K over F. 

8. Define a real field K to be quadratically closed if for all a E K either Vâ or 
V— a lies in K. The ordering of a quadratically closed real field K is then uniquely 
determined, and so is the real closure of such a field, up to an isomorphism over K. 
Suppose that K is quadratically closed. Let F be a subfield of K and suppose that 
F is maximal archimedean in K. Let (p be a place of K over F, with values in a 
field which is algebraic over F. Show that (p is equivalent to the canonical place of 
K over F. 

9. Let K be a quadratically closed real field. Let (p be a real place of K , taking its values 
in a real closed field R. Let F be a maximal subfield of K such that (p is an isomorphism 
on F, and identify F with <p(F). Show that such F exists and is maximal archimedean 
in K. Show that the image of ç> is algebraic over F, and that (p is induced by the 
canonical place of K over F. 

10. Let K be a real field and let (p be a real place of K , taking its values in a real closed 
field R. Show that there is an extension of (p to an F-valued place of a real closure 
of K. [ Hint : first extend (p to a quadratic closure of K. Then use Exercise 5.] 
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11. Let K C K x C K 2 b e real closed fields. Suppose that K is maximal archimedean in 
K x and K x is maximal archimedean in K 2 . Show that K is maximal archimedean in 
K 2 . 

12. Let K be a real closed field. Show that there exists a real closed field R containing 
K and having arbitrarily large transcendence degree over K , and such that K is maximal 
archimedean in R. 

13. Let R be a real closed field. Let f l9 ..., f r be homogeneous polynomials of odd 
degrees in n variables over R. If n > r, show that these polynomials have a non- 
trivial common zero in R. ( Comments : If the forms are generic (in the sense of Chapter 
IX), and n = r + 1, it is a theorem of Bezout that in the algebraic closure R a the 
forms have exactly d { • • • d m common zeros, where d, is the degree of f. You may 
assume this to prove the result as stated. If you want to see this worked out, see 
[La 53], Theorem 15. Compare with Exercise 3 of Chapter IX.) 
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CHAPTER XII 

Absolute Values 


§1. DEFINITIONS, DEPENDENCE, AND 
INDEPENDENCE 

Let K be a field. An absolute value v on K is a real-valued function xt-+\x\ v 
on K satisfying the following three properties: 

AV 1. We have | jc|„ ^ 0 for all xe K, and 1x^ = 0 if and only if x = 0. 

AV 2. For all x, ye K y we have \xy\ v = | x\ v \y\ v . 

AV 3. For all xjeX, we have |x + y\ v ^ |x^ + | y \ v . 

If instead of AV 3 the absolute value satisfìes the stronger condition 

AV 4 . | x + y\ v ^ maxdxl,, \y\ v ) 

then we shall say that it is a valuation, or that it is non-archimedean. 

The absolute value which is such that | x |„ = 1 for all x # 0 is called trivial. 
We shall write | x | instead of | x if we deal with just one fixed absolute value. 
We also refer to v as the absolute value. 

An absolute value of K defines a metric. The distance between two elements 
x, y of K in this metric is |x — y |. Thus an absolute value defines a topology on 
K. Two absolute values are called dependent if they define the same topology. 
If they do not, they are called independent. 

We observe that 111 = 11 2 1 = |( — l) 2 1 = 111 2 whence 


| 1 | = |- 1 | = 1 . 


Also, | - x | = | x | for all xe K, and | x 1 1 = | x | 1 for x + 0. 
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Proposition 1.1. Let | | t and | | 2 be non-trivial absolute values on a field K. 
They are dependent if and only if the relation 

Mi < 1 

implies |x| 2 < 1. If they are dependent , then there exists a number X > 0 
such that |x | ^ = | x | 2 for all xe K. 

Proof If the two absolute values are dependent, then our condition is 
satisfied, because the set of xe K such that \x\ x < 1 is the same as the set such 
that lim x n = 0 for n -* oo. Conversely, assume the condition satisfied. Then 
|x|^ > 1 implies |x| 2 > 1 since Ix -1 ^ < 1. By hypothesis, there exists an 
element x 0 e K such that |x 0 |i > 1. Let a = |x 0 1j and b = |x 0 | 2 . Let 

= logfr 

log a 

Let xeK,x ^ 0. Then|x|! = \x 0 \\ forsomenumbera. Ifm, n are integers such 
that m/n > <x and n > 0, we have 

i*ii > ix 0 ir 


whence 


| x”/x 0 1 1 < 1, 


and thus 


1*7*0 12 < 1. 

This implies that |x| 2 < |x 0 | 2 7 Hence 

I * I2 = I *o 1 2 • 

Similarly, one proves the reverse inequality, and thus one gets 

I * I2 = I *o I2 

for all xeK, x ^ 0. The assertion of the proposition is now obvious, i.e. 
|*| 2 = |x|{. 

We shall give some examples of absolute values. 

Consider first the rational numbers. We have the ordinary absolute value 
such that |m| = m for any positive integer m. 

For each prime number p, we have the p-adic absolute value v p , defined by the 
formula 


\p r m/n\ p = l/p' 



XII, §1 


DEFINITIONS, DEPENDENCE, AND INDEPENDENCE 467 


where r is an integer, and m, n are integers / 0, not divisible by p. One sees at 
once that the p-adic absolute value is non-archimedean. 

One can give a similar definition of a valuation for any field K which is the 
quotient field of a principal ring. For instance, let K = k(t) where k is a field 
and t is a variable over k. We have a valuation v p for each irreducible polynomial 
p(t) in /c[í], defined as for the rational numbers, but there is no way of normalizing 
it in a natural way. Thus we select a number c with 0 < c < 1 and for any 
rational function p r f /g where /, g are polynomials not divisible by p, we define 

\p r f/g\ P = c r . 


The various choices of the constant c give rise to dependent valuations. 

Any subfield of the complex numbers (or real numbers) has an absolute 
value, induced by the ordinary absolute value on the complex numbers. We shall 
see later how to obtain absolute values on certain fìelds by embedding them into 
others which are already endowed with natural absolute values. 

Suppose that we have an absolute value on a field which is bounded on the 
prime ring (i.e. the integers Z if the characteristic is 0, or the integers mod p if 
the characteristic is p). Then the absolute value is necessarily non-archimedean. 

Proof. For any elements x, y and any positive integer n, we have 



^ nC max(|x|, |y|)". 


Taking n- th roots and letting n go to infìnity proves our assertion. We note that 
this is always the case in characteristic > 0 because the prime ring is finite! 

If the absolute value is archimedean, then we refer the reader to any other 
book in which there is a discussion of absolute values for a proof of the fact that 
it is dependent on the ordinary absolute value. This fact is essentially useless 
(and is never used in the sequel), because we always start with a concretely given 
set of absolute values on fields which interest us. 

In Proposition 1.1 we derived a strong condition on dependent absolute 
values. We shall now derive a condition on independent ones. 


Theorem 1.2. (Approximation Theorem). (Artin-Whaples). Let K be 
afield and | |j,..., | | s non-trivial pairwise independent absolute values on K. 
Let x x s be elements of X, and e > 0. Then there exists xeK such that 

|x - x^ < e 


for all i. 
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Proof. Consider first two of our absolute values, say v^ and v 2 . By hypo- 
thesis we can find ote K such that | ot 1 1 < 1 and | ot | s ^ 1. Similarly, we can find 
feK suchthat |j?|j ^ 1 and \P\ S < 1. Puty = f/ot. Then \y\^ > 1 and |y| s < 1. 

We shall now prove that there exists ze K such that |z|j > 1 and \z\j < 1 
for j = 2,. .., s. We prove this by induction, the case s = 2 having just been 
proved. Suppose we have found ze K satisfying 

|z|j > 1 and \z\j < 1 for y* = 2, ...,s— 1. 

If | z | s ^ 1 then the element z w y for large n will satisfy our requirements. 

If | z | s > 1, then the sequence 


tendsto 1 atuj andi; s ,andtendstoOat Vj(J = 2,..., s — 1). For largen,it isthen 
clear that t n y satisfies our requirements. 

Using the element z that we have just constructed, we see that the sequence 
z"/(l + z n ) tends to 1 at v t and to 0 at v } for j = 2,..., s. For each i = 1,..., s 
we can therefore construct an element z, which is very close to 1 at v, and very 
close to 0 at v, (j i ). The element 

X = Z ^X ^ + ••• + Z s x s 

then satisfies the requirement of the theorem. 


§2. COMPLETIONS 

Let K be a field with a non-trivial absolute value v, which will remain fixed 
throughout this section. One can then define in the usual manner the notion of a 
Cauchy sequence. It is a sequence {x n } of elements in K such that, given e > 0, 
there exists an integer N such that for all n,m > N we have 


l*„ - *ml < e- 


We say that K is complete if every Cauchy sequence converges. 

Proposition 2.1. There exists a pair ( K v , í) consisting of afield K vi complete 
under an absolute value , and an embedding i: K -> K v such that the absolute 
value on K is induced by that of K v (i.e. \x\ v = \ix\ for x e K ), and such that iK 
is dense in K v . If (K v , i') is another such pair , then there exists a unique 
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isomorphism <p: K v -> K' v preserving the absolute values , and making the 
following diagram commutative: 


K„ 


K 


K 

Proof. The uniqueness is obvious. One proves the existence in the well- 
known manner, which we shall now recall briefly, leaving the details to the reader. 

The Cauchy sequences form a ring, addition and multiplication being taken 
componentwise. 

One defines a null sequence to be a sequence {x„} such that lim x n = 0. The 

n -*• oo 

null sequences form an ideal in the ring of Cauchy sequences, and in fact form a 
maximal ideal. (If a Cauchy sequence is not a null sequence, then it stays away 
from 0 for all n sufficiently large, and one can then take the inverse of almost all 
its terms. Up to a finite number ofterms, one then gets again a Cauchy sequence.) 

The residue class field of Cauchy sequences modulo null sequences is the 
field K v . We embed K in K v “on the diagonal”, i.e. send xe K on the sequence 
(x, x, x,...). 

We extend the absolute value of K to K v by continuity. If {x n } is a Cauchy 
sequence, representing an element Ç in K vì we define |<^| = lim|xj. It is easily 
proved that this yields an absolute value (independent of the choice of repre- 
sentative sequence {x n } for £), and this absolute value induces the given one on K. 

Finally, one proves that K v is complete. Let {<^ n } be a- Cauchy sequence in 
K v . For each n , we can find an element x n e K such that \Ç n — x n \ < 1/n. Then 
one verifies immediately that {x„} is a Cauchy sequence in K. We let £ be its 
limit in K v . By a three-r argument, one sees that {<!;„} converges to thus 
proving the completeness. 

A pair ( K v , i) as in Proposition 2.1 may be called a completion of K. The 
standard pair obtained by the preceding construction could be called the 
completion of K. 

Let K have a non-trivial archimedean absolute value v. If one knows that the 
restriction of v to the rationals is dependent on the ordinary absolute value, then 
the completion K v is a complete field, containing the completion of Q as a 
closed subfield, i.e. containing the real numbers R as a closed subfield. It will be 
worthwhile to state the theorem of Gelfand-Mazur concerning the structure of 
such fields. First we define the notion of normed vector space. 

Let K be a field with a non-trivial absolute value, and let E be a vector space 
over K. By a norm on E (compatible with the absolute value of K) we shall 
mean a function Ç -► | Ç | of E into the real numbers such that: 


NO 1. | Ç | ^ 0 for all Ç e £, and = 0 if and only if Ç = 0. 
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NO 2. For all jc e K and £ e E we have = M|£|. 

NO 3. If Ç, Ç' e E then | £ + <f | á | £ | + | <f | • 

Two norms | |, and | | 2 are called equivalent if there exist numbers C ly C 2 > 0 
such that for all i $e E we have 

c,ia ^ iíi 2 ^ c 2 iíIj. 

Suppose that E is finite dimensional, and let a> l5 ..., u>„ be a basis of E 
over K. If we write an element 

£ = *iû>i + • • • + x„a>„ 

in terms of this basis, with x, e K, then we can define a norm by putting 

lÉl = max|x,|. 

i 

The three properties defining a norm are trivially satisfied. 

Proposition 2.2. Let Kbe a complete field under a non-trivial absolute value , 
and let E be a finite-dimensional space over K. Then any two norms on E 
(compatible with the given absolute value on K) are equivalent. 

Proof. We shall first prove that the topology on E is that of a product space, 
i.e. if (o ì9 ..., co n is a basis of E over K , then a sequence 

£ (v) = x^co ! + • • • + x|j V) co n , x\ v) e X, 

is a Cauchy sequence in E only if each one of the n sequences x (v) is a Cauchy 
sequence in K. We do this by induction on n. It is obvious for n = 1. Assume 
n > 2. We consider a sequence as above, and without loss of generality, we may 
assume that it converges to 0. (If necessary, consider £ (v) - Ç ifl) for v, p -► oo.) 
We must then show that the sequences of the coefficients converge to 0 also. 
If this is not the case, then there exists a number a > 0 such that we have for 
some j, say 7=1, 


| | > a 

for arbitrarily large v. Thus for a subsequence of (v), £ (v) /x ( 1 v) converges to 0, and 
we can write 


£(V) 


, x (v) 

x í 

= W )C ° 2 

. . + _ 
X ( 1 V) 


We let y] ( x) be the right-hand side of this equation. Then the subsequence rf v) 
converges (according to the left-hand side of our equation). By induction, we 
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conclude that its coefficients in terms of a > 2 ,..., a> n also converge in K , say to 
y 2 ,. • •, y„- Taking the limit, we get 

coi = y 2 co 2 + ••• + y n oj „, 

contradicting the linear independence of the co, . 

We must finally see that two norms inducing the same topology are equivalent. 
Let | | ^ and | | 2 be these norms. There exists a number C > 0 such that for any 
^ e E we have 


\Z\i âc implies |{| 2 gl. 

Let a e K be such that 0 < | a \ < 1. For every Ç e E there exists a unique integer 
s such that 


C|a| < \a s Ç\i ^ C. 

Hence \a s Ç\ 2 á 1 whence we get at once 

i{i 2 ác- i i«r 1 i€ii. 

The other inequality follows by symmetry, with a similar constant. 

Theorem 2.3. (Gelfand-Mazur). Let A be a commutative algebra over the 
real numbers, and assume that A contains an element j such that j 2 = — 1. Let 
C = R + R j. Assume that A is normed (as a vector space over R), and that 
\xy \ ^ |*| \y\for all x, y e A. Given x 0 e A, * 0 + 0, there exists an element 
c e C such that x 0 — c is not invertible in A . 

Proof. (Tornheim). Assume that x 0 — z is invertible for all zeC. 
Consider the mapping /: C -► A defined by 

/(z) = (x 0 - z)-\ 

It is easily verified (as usual) that taking inverses is a continuous operation. 
Hence / is continuous, and for z # 0 we have 

/(z) = z" \x 0 z ~ 1 - 1)" 1 = - / - ! - V 

From this we see that /(z) approaches 0 when z goes to infinity (in C). Hence the 
map z i—► | /(z) | is a continuous map of C into the real numbers ^ 0, is bounded, 
and is small outside some large circle. Hence it has a maximum, say M. Let D 
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be the set of elements z e C such that |/(z)| = M. Then D is not empty; D is 
bounded and closed. We shall prove that D is open, hence a contradiction. 

Let c 0 be a point of D, which, after a translation, we may assume to be the 
origin. We shall see that if r is real > 0 and small, then all points on the circle of 
radius r lie in D. Indeed, consider the sum 


1 n 1 

m = - E 


n ^ x 0 - arr 

where co is a primitive n-th root of unity. Taking formally the logarithmic 

n 

derivative of X n — r n = Y\ (X ~ s hows that 

k = 1 

nX n ~ 1 _ n 1 
X n - r n - coV 

and hence, dividing by n, and by X n ~ \ and substituting x 0 for X , we obtain 

S(n) = x 0 - r(r/x 0 ) n ~ l ' 

If r is small (say |r/x 0 | < 1), then we see that 


lim |S(n)| = 


= M. 


Suppose that there exists a complex number A of absolute value 1 such that 

1 


x 0 — Àr 


< M. 


Then there exists an interval on the unit circle near A, and there exists e > 0 such 
that for all roots of unity C lying in this interval, we have 


1 

x 0 - Cr 


<M - e. 


(This is true by continuity.) Let us take n very large. Let b n be the number of 
n-th roots of unity lying in our interval. Then bjn is approximately equal to the 
length of the interval (times 2n)\ We can express S(n) as a sum 


S(n) = - 


S, 


x 0 - (tì k r 


+ 2n 


x 0 — <tì K r 


1 
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the first sum being taken over those roots of unity co k lying in our interval, and 
the second sum being taken over the others. Each term in the second sum has 
norm ^ M because M is a maximum. Hence we obtain the estimate 


|S(n)| ^ i [|X,I + llnl] 

g ~ (b„(M -e) + (n- b n )M) 

g Aí - — 

n 

This contradicts the fact that the limit of | S(n) | is equal to M. 


Corollary 2.4. Let K be a field , which is an extension of R, and has an 
absolute value extending the ordinary absolute value on R. Then K = R or 
K = C. 

Proofi Assume first that K contains C. Then the assumption that K is a 
field and Theorem 2.3 imply that K = C. 

If K does not contain C, in other words, does not contain a square root of 
- 1, we let L = K(j) where j 2 = — 1. We define a norm on L (as an R-space) by 
putting 


\x + yj | = M + |y| 

for x, yeK. This clearly makes L into a normed R-space. Furthermore, if 
z = x + yj and z' = x' + y'j are in L, then 

|zz'| = | xx' - yy’\ + | xy’ + x’y\ 
â \xx’\ + \yy'\ + \xy’\ + |x'y| 

á l*ll*'l + \y\\y’\ + 1*11/1 + l*'lly| 

^ (1*1 + lyl)(l*'l + 1/1) 

^ \z\\z’\, 

and we can therefore apply Theorem 2.3 again to conclude the proof. 

As an important application of Proposition 2.2, we have: 

Proposition 2.5. Let K be complete with respect to a nontrivial absolute 
value v. IfE is any algebraic extension of K , then v has a unique extension to 
E. If E is finite over K , then E is complete. 

Proof. In the archimedean case, the existence is obvious since we deal 
with the real and complex numbers. In the non-archimedean case, we postpone 
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the existence proof to a later section. It uses entirely different ideas from the 
present ones. As to uniqueness, we may assume that E is finite over K. By 
Proposition 2.2, an extension of v to E defines the same topology as the max 
norm obtained in terms of a basis as above. Given a Cauchy sequence ^ (v) in £, 

(^ (V) = X vX (tí x -f • • • + x vn co n9 

the n sequences {x vi }(i = 1 must be Cauchy sequences in K by the 

definition of the max norm. If {x vi } converges to an element z f in K 9 then it 
is clear that the sequence ^ (v) converges to ZjCOj -f ••• -f z n oj n . Hence E is 
complete. Furthermore, since any two extensions of v to E are equivalent, 
we can apply Proposition 1.1, and we see that we must have À = 1, since the 
extensions induce the same absolute value v on K. This proves what we want. 

From the uniqueness we can get an explicit determination of the absolute 
value on an algebraic extension of K. Observe first that if E is a normal extension 
of K, and <r is an automorphism of E over K , then the function 

X I—► | GX | 

is an absolute value on E extending that of K. Hence we must have 

\gx\ = |x| 

for all xeE. If E is algebraic over K , and a is an embedding of E over K in K a , 
then the same conclusion remains valid, as one sees immediately by embedding 
E in a normal extension of K. In particular, if oc is algebraic over K , of degree n , 
and if a j,..., a n are its conjugates (counting multiplicities, equal to the degree of 
inseparability), then all the absolute values |a.| are equal. Denoting by N 
the norm from K(oc) to X, we see that 

\ n (*)\ = i«r. 


and taking the n-th root, we get: 

Proposition 2.6. Let K be complete with respect to a non-trivial absolute 
value. Let oc be algebraic over K , and ìet N be the norm from K(oc) to K. Let 
n = lK(oc) :Kl Then 


\*\ = \N(oc)\" n . 

In the special case of the complex numbers over the real numbers, we can 
write oc = a -f bi with a, b e R, and we see that the formula of Proposition 2.6 is 
a generalization of the formula for the absolute value of a complex number, 

oc = (a 2 -f ò 2 ) 1/2 , 


since a 2 -f b 2 is none other than the norm of oc from C to R. 
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Comments and examples. The process of completion is widespread in 
mathematics. The fìrst example occurs in getting the real numbers from the 
rational numbers, with the added property of ordering. I carry this process out 
in full in [La 90a], Chapter IX, §3. In all other examples I know, the ordering 
property does not intervene. We have seen examples of completions of fields in 
this chapter, especially with the p -adic absolute values which are far away from 
ordering the field. But the real numbers are nevertheless needed as the range of 
values of absolute values, or more generally norms. 

In analysis, one completes various spaces with various norms. Let V be a 
vector space over the complex numbers, say. For many applications, one must 
also deal with a seminorm, which satisfies the same conditions except that in 
NO 1 we require only that ||£|| ^ 0. We allow ||£|| = 0 even if £ 0. 

One may then form the space of Cauchy sequences, the subspace of null 
sequences, and the factor space V. The seminorm can be extended to a seminorm 
on V by continuity, and this extension actually turns out to be a norm. It is a 
general fact that V is then complete under this extension. A Banach space is a 
complete normed vector space. 

Example. Let V be the vector space of step functions on R, a step function 
being a complex valued function which is a finite sum of characteristic functions 
of intervals (closed, open, or semiclosed, i.e. the intervals may or may not 
contain their endpoints). For/e V we define the L^-seminorm by 

ll/lli = J l/U)l dx. 

R 

The completion of V with respect to this seminorm is defined to be L ! (R). One 
then wants to get a better idea of what elements of L x { R) look like. It is a simple 
lemma that given an L 1 -Cauchy sequence in V, and given e > 0, there exists a 
subsequence which converges uniformly except on a set of measure less than e. 
Thus elements of L x { R) can be identified with pointwise limits of L x -C auchy 
sequences in V . The reader will find details carried out in [La 85]. 

Analysts use other norms or seminorms, of course, and other spaces, such 
as the space of C 00 functions on R with compact support, and norms which may 
bound the derivatives. There is no end to the possible variations. 

Theorem 2.3 and Corollary 2.4 are also used in the theory of Banach algebras, 
representing a certain type of Banach algebra as the algebra of continuous func- 
tions on a compact space, with the Gelfand-Mazur and Gelfand-Naimark theo- 
rems. Cf. [Ri 60] and [Ru 73]. 

Arithmetic example. For p -adic Banach spaces in connection with the 
number theoretic work of Dwork, see for instance Serre [Se 62], or also 
[La 90b], Chapter 15. 

In this book we limit ourselves to complete fields and their finite extensions. 
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§3. FINITE EXTENSIONS 

Throughout this section we shall deal with a fìeld X having a non-trivial 
absolute value v. 

We wish to describe how this absolute value extends to fìnite extensions of K. 
If E is an extension of K and vv is an absolute value on E extending v, then we shall 
write vv | v. 

If we let K v be the completion, we know that v can be extended to K v , and 
then uniquely to its algebraic closure K a v . If E is a finite extension of K, or even 
an algebraic one, then we can extend v to E by embedding E in K a v by an iso- 
morphism over X, and taking the induced absolute value on E. We shall now 
prove that every extension of v can be obtained in this manner. 

Proposition 3.1. Let E be a finite extension of K. Let vv be an absolute value 

on E extending v , and let E w be the completion. Let K w be the closure of K in 

E w and identify E in E w . Then E w = EK W (the composite field). 

Proof. We observe that K w is a completion of X, and that the composite 
field EK W is algebraic over K w and therefore complete by Proposition 2.5. Since 
it contains E, it follows that E is dense in it, and hence that E w = EK W . 

If we start with an embedding o\E -► K a v (always assumed to be over X), 
then we know again by Proposition 2.5 that oE • K v is complete. Thus this 
construction and the construction of the proposition are essentially the same, up 
to an isomorphism. In the future, we take the embedding point of view. We 
must now determine when two embeddings give us the same absolute value on E. 

Given two embeddings o, x : E -+ K a v , we shall say that they are conjugate 
over K v if there exists an automorphism A of K a v over K v such that o = At. We 
see that actually A is determined by its efifect on zE, or zE ■ K v . 
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Proposition 3.2. Let E be an algebraic extension of K. Two embeddings 
o 9 t : E -► K* v give rise to the same absolute value on E if and only if they are 
conjugate over K v . 

Proof. Suppose they are conjugate over K v . Then the uniqueness of the 
extension of the absolute value from K v to K* v guarantees that the induced 
absolute values on E are equal. Conversely, suppose this is the case. Let 
k : tE -> oE be an isomorphism over K. We shall prove that À extends to an 
isomorphism of xE K v onto cE K v over K v . Since t E is dense in t E K v9 
an element xetE K v can be written 

x = lim xx n 

with x n e E. Since the absolute values induced by cr and t on E coincide, it 
follows that the sequence hx n = ox n converges to an element of oE • K v which 
we denote by kx. One then verifies immediately that kx is independent of the 
particular sequence tx„ used, and that the map k: tE • K v -+ oE • K v is an iso- 
morphism, which clearly leaves K v fìxed. This proves our proposition. 

In view of the previous two propositions, if vv is an extension of v to a finite 
extension E of K , then we may identify E w and a composite extension EK V of E 
and K v . If N = [E : K] is finite, then we shall call 

N w = ÍE W :K V -] 


the local degree. 

Proposition 3.3. Let Ebea finite separable extension of K , of degree N. Then 

w\v 

Proof. We can write E = K(oc) for a single element a. Let f(X) be its 
irreducible polynomial over K. Then over K v , we have a decomposition 

f(X) = ffX^-'-ffX) 

into irreducible factors f(X). They all appear with multiplicity 1 according to 
our hypothesis of separability. The embeddings of E into K a v correspond to the 
maps of a onto the roots of the f. Two embeddings are conjugate if and only if 
they map a onto roots of the same polynomial f. On the other hand, it is clear 
that the local degree in each case is precisely the degree of f. This proves our 
proposition. 

Proposition 3.4. Let Ebe a finite extension of K. Then 

w|y 
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IfE is purely inseparable over X, then there exists only one absolute value vv on 
E extending v. 

Proof Let us first prove the second statement. If E is purely inseparable 
over X, and p r is its inseparable degree, then ot pr e X for every a in E. Hence v has 
a unique extension to E. Consider now the general case of a finite extension, and 
let F = E P K. Then F is separable over K and E is purely inseparable over F. 
By the preceding proposition, 

E \F W :K V } = [F:K], 

W\V 

and for each vv, we have [ E w : F J g [£: F]. From this our inequality in the 
statement of the proposition is obvious. 

Whenever v is an absolute value on K such that for any finite extension E of K 
we have [£: K] = Z [F M : K v ~\ we shall say that v is well behaved. Suppose we 

vvlf 

have a tower of finite extensions, L =5 E zd K. Let vv range over the absolute 
values of E extending u, and u over those of L extending v. If u\w then L u 
contains E w . Thuswehave: 

Z ÍL U :K V \= Z Z[L U :£J[£,:KJ 

u\v w|p u|w 

w|p u|w 

^ Z[£ w : XJ[L:£] 

w|p 

^ [£: K] [L: £]. 

From this we immediately see that if v is well behaved, E finite over X, and vv 
extends v on L, then vv is well behaved (we must have an equality everywhere). 

Let E be a finite extension of K. Let p r be its inseparable degree. We recall 
that the norm of an element a e K is given by the formula 

N &:«) = n 

a 

where o ranges over all distinct isomorphisms of E over K (into a given algebraic 
closure). 

If vv is an absolute value extending v on L, then the norm from E w to K v will 
be called the local norm. 

Replacing the above product by a sum, we get the trace, and the local trace. 
We abbreviate the trace by Tr. 

Proposition 3.8. Let Ebe a finite extension of X, and assume that v is well 
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behaved. Let ote E. Then: 


NU*) = n Nftot) 

w\v 


Tr|(«) = £Tr£ : (a) 


Proof. Suppose first that E = K(oì), and let f(X) be the irreducible poly- 
nomial of oc over K. If we factor /( X) into irreducible terms over K v , then 


f(X) = f x (X)-f r (X) 


where each /( X) is irreducible, and the f are distinct because of our hypothesis 
that v is well behaved. The norm N^( a) is equal to (— l) deg/ times the constant 
term of /, and similarly for each f. Since the constant term of / is equal to the 
product of the constant terms of the /, we get the first part of the proposition. 
The statement for the trace follows by looking at the penultimate coefficient of/ 
and each /. 

If E is not equal to K(a ), then we simply use the transitivity of the norm and 
trace. We leave the details to the reader. 

Onecan also argue directly on the embeddings. Let a l9 ..., o m be thedistinct 
embeddings of E into K* v over K, and let p r be the inseparable degree of E 
over K. The inseparable degree of oE • K v over K v for any o is at most equal 
to p r . If we separate o í9 ...,o m into distinct conjugacy classes over K V9 
then from our hypothesis that v is well behaved, we conclude at once that the 
inseparable degree of OiE K v over K v must be equal to p r also, for each i. 
Thus the formula giving the norm as a product over conjugates with multi- 
plicity p r breaks up into a product of factors corresponding to the conjugacy 
classes over K v . 

Taking into account Proposition 2.6, we have: 

Proposition 3.6. Let K have a well-behaved absolute value v. Let E be a 

fnite extension of K, and oce E. Let 

N W = [E W :K V ] 

for each absolute value w on E extending v. Then 



480 ABSOLUTE VALUES 


XII, §4 


§4. VALUATIONS 

In this section, we shall obtain, among other things, the existence theorem 
concerning the possibility of extending non-archimedean absolute values to 
algebraic extensions. We introduce first a generalization of the notion of non- 
archimedean absolute value. 

Let T be a multiplicative commutative group. We shall say that an ordering 
is defined in T if we are given a subset S of T closed under multiplication such 
that T is the disjoint union of S , the unit element 1 , and the set S~ 1 consisting of 
all inverses of elements of S. 

If a, P e T we define oc < /? to mean ajS “ 1 e S. We have a < 1 if and only if 
a e 5. One easily verifies the following properties of the relation < : 

1 . For a, T we have a < /?, or a = /?, or /? < a, and these possibilities 
are mutually exclusive. 

2 . a < j? implies ocy < jSy for any y e T. 

3. a < j8 and /? < y implies a < y. 

(Conversely, a relation satisfying the three properties gives rise to a subset S 
consisting of all elements < 1. However, we don’t need this fact in the sequel.) 

It is convenient to attach to an ordered group formally an extra element 0, 
such that 0a = 0, and 0 < a for all a e T. The ordered group is then analogous 
to the multiplicative group of positive reals, except that there may be non- 
archimedean ordering. 

If a e T and n is an integer # 0, such that oc n = 1, then oc = 1. This follows at 
once from the assumption that S is closed under multiplication and does not 
contain 1 . In particular, the map oc i—► oc n is injective. 

Let K be a field. By a valuation of K we shall mean a map x i—► |x | of K into 
an ordered group T, together with the extra element 0 , such that: 

VAL 1. |x| = 0 if and only if x = 0. 

VAL 2. | xy | = | x 11 y | for all x, y e K. 

VAL 3. | x -F y | ^ max( | x |, | y \). 

We see that a valuation gives rise to a homomorphism of the multiplicative 
group K* into T. The valuation is called trivial if it maps K* on 1. If the map 
giving the valuation is not surjective, then its image is an ordered subgroup of T, 
and by taking its restriction to this image, we obtain a valuation onto an ordered 
group, called the value group. 

We shall denote valuations also by v. If v í9 v 2 are two valuations of X, we 
shall say that they are equivalent if there exists an order-preserving isomorphism 
/ of the image of v x onto the image of v 2 such that 


|x | 2 = A|xU 
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for all x e K. (We agree that A(0) = 0.) 

Valuations have additional properties, like absolute values. For instance, 
111 = 1 because 111 = 111 2 . Furthermore, 

I ± x\ = |x| 

for all xeK. Proof obvious. Also, if |x| < \y\ then 

\x + y\ = \y\. 

To see this, note that under our hypothesis, we have 
\y\ = \y + x - x| ^ max(|y -f x|, |x|) = |x + y\ ^ max(|x|, \y\) = \y\. 
Finally, in a sum 


x i + • • • + x n = 0, 

at least two elements of the sum have the same value. This is an immediate 
consequence of the preceding remark. 

Let K be a field. A subring o of K is called a valuation ring if it has the 
property that for any x e K we have xeoor x _1 eo. 

We shall now see that valuation rings give rise to valuations. Let o be a 
valuation ring of K and let U be the group of units of o. We contend that o is a 
local ring. Indeed suppose that xjeo are not units. Say x/y e o. Then 

1 + x/y = (x + y)/y e o. 

If x -f y were a unit then 1 /y e o, contradicting the assumption that y is not a unit. 
Hence x -f y is not a unit. One sees trivially that for zeo,zx is not a unit. Hence 
the nonunits form an ideal, which must therefore be the unique maximal ideal 
of o. 

Let m be the maximal ideal of o and let m* be the multiplicative system of 
nonzero elements of m. Then 

K* = m* u U u m* " 1 

is the disjoint union of m*, t/, and m*~\ The factor group K*/U can now be 
given an ordering. If x e X*, we denote the coset xU by |x|. We put |0| = 0. 
We define |x| < 1 (i.e. |x| e S) if and only if x e m*. Our set S is clearly closed 
under multiplication, and if we let T = K*/U then T is the disjoint union of S, 
1, S~ L In this way we obtain a valuation of K. 

We note that if x, y e K and x, y # 0, then 

Ixj < \y\o\x/y \ < 1 ox/yem*. 

Conversely, given a valuation of K into an ordered group we let o be the 
subset of K consisting of all x such that |x| < 1. It follows at once from the 
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axioms of a valuation that o is a ring. If |x| < 1 then |x“ 1 1 > 1 so that x“ 1 is 
not in o. If |x| = 1 then |x _1 1 = 1. We see that o is a valuation ring, whose 
maximal ideal consists of those elements x with | x | < 1 and whose units consist 
of those elements x with | x | = 1. The reader will immediately verify that there is 
a bijection between valuation rings of K and equivalence classes of valuations. 

The extension theorem for places and valuation rings in Chapter VII now 
gives us immediately the extension theorem for valuations. 

Theorem 4.1. Let Kbe a subfield of a field L. Then a valuation on K has an 
extension to a valuation on L. 

Proof. Let o be the valuation ring on K corresponding to the given valua- 
tion. Let cp : o -► o/m be the canonical homomorphism on the residue class field, 
and extend cp to a homomorphism of a valuation ring O of L as in §3 of Chapter 
VII. Let 2T2 be the maximal ideal of O. Since 99Î D o contains m but does not 
contain 1, it follows that Tl D o = m. Let U' be the group of units of 0. Then 
U' n K = U is the group of units of o. Hence we have a canonical injection 

K*/U - L*/U' 

which is immediately verified to be order-preserving. Identifying K*/U in 
L*/U' we have obtained an extension of our valuation of K to a valuation of L. 

Of course, when we deal with absolute values, we require that the value group 
be a subgroup of the multiplicative reals. Thus we must still prove something 
about the nature of the value group L*/U\ whenever L is algebraic over K. 

Proposition 4.2. Let L be a finite extension of K , of degree n. Let w be a 
valuation of L with value group T'. Let T be the value group of K. Then 
(T': T) ^ n. 

Proof. Let ...,y r be elements of L whose values represent distinct 
cosets of T in F. We shall prove that the y } are linearly independent over K. In 
a relation a^y^ -f • • • -f a r y r = 0 with a^e K , # 0 two terms must have the 

same value, say | a { y t | = \ajyf with i # j\ and hence 

ly.l = l^r 1 «jllyjl- 

This contradicts the assumption that the values of y h y^ (i # j) represent distinct 
cosets of T in F, and proves our proposition. 

Corollary 4.3. There exists an integer e ^ 1 such that the map y y e 
induces an injective homomorphism of F into F 

Proof Take e to be the index (F : T). 
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Corollary 4.4. // K is a field with a valuation v whose value group is an 

ordered subgroup of the ordered group of positive real numbers , and if L is an 
algebraic extension of K, then there exists an extension of v to L whose value 
group is also an ordered subgroup of the positive reals. 

Proof We know that we can extend v to a valuation vv of L with some value 
group P, and the value group T of v can be identified with a subgroup of R + . 
By Corollary 4.3, every element of P has finite period modulo P Since every 
element of R + has a unique e-th root for every integer e ^ 1, we can find in an 
obvious way an order-preserving embedding of P into R + which induces the 
identity on P In this way we get our extension of v to an absolute value on L. 

Corollary 4.5. If L is finite over K, and ifT is infinite cyclic, then P is also 
infinite cyclic. 

Proof. Use Corollary 4.3 and the fact that a subgroup of a cyclic group is 
cyclic. 

We shall now strengthen our preceding proposition to a slightly stronger one. 
We call (P : P) the ramifìcation index. 

Proposition 4.6. Let Lbea finite extension of degree nof a field K , and let O 
be a valuation ring of L. Let be its maximal ideal, let o = O n K, and let m 
be the maximal ideal of o, i.e. m = 9JÌ n o. Then the residue class degree 
[0/9W : o/m] isfinite. Ifwe denote it by f, and if e is the ramification index , then 
ef ^ n. 

Proof. Let y í9 ..., y e be representatives in L* of distinct cosets of P/r and 
let z í9 ..., z s be elements of O whose residue classes mod 9JÏ are linearly inde- 
pendent over o/m. Consider a relation 

Z a u z jy< = 0 

i,j 

with ciij e K, not all a, 7 = 0. In an inner sum 

S 

X a îj z j* 

j= 1 

divide by the coefficient a iv having the biggest valuation. We obtain a linear 
combination of z l9 ..., z s with coefficients in o, and at least one coefficient equal 
to a unit. Since z í9 ... 9 z s are linearly independent mod 501 over o/m, it follows 
that our linear combination is a unit. Hence 


Z a U z J 


j= i 


= \a h 
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for some index v. In the sum 


i (i^)y = 0 

viewed as a sum on i, at least two terms have the same value. This contradicts 
the independence of | y t |,..., | y e | mod T just as in the proof of Proposition 4.2. 

Remark. Our proof also shows that the elements {zjyJ are linearly in- 
dependent over K. This will be used again later. 

If vv is an extension of a valuation v, then the ramification index will be 
denoted by e(w\v) and the residue class degree will be denoted by f(w\v). 

Proposition 4.7. Let K be a field with a valuation v, and let K cz E cz Lbe 
finite extensions of K. Let w be an extension ofvtoE and let u be an extension 
ofwtoL. Then 


e(u | w)e(w | v) = e(u\v), 
f(u\w)f(w\v) = f(u\v). 


Proof. Obvious. 

We can express the above proposition by saying that the ramification index 
and the residue class degree are multiplicative in towers. 

We conclude this section by relating valuation rings in a finite extension with 
the integral closure. 

Proposition 4.8. Let o be a valuation ring in a field K. Let L be a finite 
extension ofK. Let © be a valuation ring ofL lying above o, and its maximal 
ideal. Let B be the integral closure of o in L, and let 93 = 3)î D B. Then © is 
equal to the local ring B%. 

Proof. It is clear that B v is contained in ©. Conversely, let x be an element 
of ©. Then x satisfies an equation with coeíficients in K , not all 0, say 

a n x n -f •• • 4- a 0 = 0, a t e K. 

Suppose that a s is the coefficient having the biggest value among the a { for the 
valuation associated with the valuation ring o, and that it is the coefficient 
farthest to the left having this value. Let b t = ai/a s . Then all b^ e o and 
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Divide the equation by We get 

+ -.- + ft, + 1 x+ 1) 

Let y and z be the two quantities in 
that we can write 



parentheses in the preceding equation, so 


— y = z/x and —xy = z. 


To prove our proposition it will suffice to show that y and z lie in B and that y is 
not in 5p. 

We use Proposition 3.5 of Chapter VII. If a valuation ring of L above 
contains x 9 then it contains y because y is a polynomial in x with coefficients in 
Hence such a valuation ring also contains z = ~xy. If on the other hand the 
valuation ring of L above contains ì/x 9 then it contains z because z is a 
polynomial in ì/x with coefficients in . Hence this valuation ring also contains 
y. From this we conclude by Chapter VII, Proposition 3.5, that y 9 z lie in B. 

Furthermore, since xe O, and b n9 ... 9 b s+í are in 501 by construction, it 
follows that y cannot be in 9W, and hence cannot be in 5p. This concludes the 
proof. 


Corollary 4.9. Let the notation be as in the proposition. Then there is only 
a finite number of valuation rings of L lying above . 


Proof. This comes from the fact that there is only a finite number of 
maximal ideals of B lying above the maximal ideal of o (Corollary of Pro- 
position 2.1, Chapter VII). 

Corollary 4.10. Let the notation be as in the proposition. Assume in addition 
that L is Galois over K. IfC and O' are two valuation rings of L lying above o, 
with maximal ideals 50Î, 9JÌ' respectively 9 then there exists an automorphism a 
of L over K such that <jO = O' and «jîR = 901'. 

Proof. Let $ = On5 and 5p' = O ' n B. By Proposition 2.1 of Chapter 
VII, we know that there exists an automorphism cr of L over K such that 
cr^3 = From this our assertion is obvious. 

Example. Let k be a field, and let K be a finitely generated extension of 
transcendence degree 1. If t is a transcendence base of K over k 9 then K is finite 
algebraic over k(t). Let O be a valuation ring of K containing /c, and assume that 
O is # K. Let o = O n k(t). Then o is obviously a valuation ring of k(t) (the 
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condition about inverses is afortiori satisfied), and the corresponding valuation 
of k(t) cannot be trivial. Either t or t~ 1 e o. Say t e o. Then o n k[t\ cannot be 
the zero ideal, otherwise the canonical homomorphism o -> o/m of o modulo its 
maximal ideal would induce an isomorphism on /c[í] and hence an isomorphism 
on fc(í), contrary to hypothesis. Hence m n /c[í] is a prime ideal p, generated by 
an irreducible polynomial p(t). The local ring /c[í] p is obviously a valuation 
ring, which must be o because every element of k(t) has an expression of type p r u 
where u is a unit in /c[í] p . Thus we have determined all valuation rings of k(t) 
containing /c, and we see that the value group is cyclic. Such valuations will be 
called discrete and are studied in greater detail below. In view of Corollary 4.5, 
it follows that the valuation ring O of K is also discrete. 

The residue class field o/m is equal to /c[í]/p and is therefore a finite exten- 
sion of k. By Proposition 4.6, it follows that 0/SJÎ is finite over k (if denotes 
the maximal ideal of 0). 

Finally, we observe that there is only a finite number of valuation rings O 
of K containing k such that t lies in the maximal ideal of O. Indeed, such a 
valuation ring must lie above /c[í] p where p = (í) is the prime ideal generated by 
/, and we can apply Corollary 4.9. 


§5. COMPLETIONS AND VALUATIONS 

Throughout this section, we deal with a non-archimedean absolute value 
vona. field K. This absolute value is then a valuation, whose value group V K is a 
subgroup of the positive reals. We let o be its valuation ring, m the maximal ideal. 

Let us denote by K the completion of K at v, and let ô (resp. m) be the closure 
of o (resp. m) in K. By continuity, every element of ô has value ^ 1, and every 
element of K which is not in ô has value >1. If xsK then there exists an 
element ye K such that |x — y\ is very small, and hence |x| = \y \ for such an 
element y (by the non-archimedean property). Hence 6 is a valuation ring in 
K , and m is its maximal ideal. Furthermore, 

ô n K = o and m n K = m, 
and we have an isomorphism 

o/m A ô/m. 

Thus the residue class field o/m does not change under completion. 

Let E be an extension of K , and let o E be a valuation ring of E lying above o. 
Let m E be its maximal ideal. We assume that the valuation corresponding to o £ 
is in fact an absolute value, so that we can form the completion E. We then have 
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a commutative diagram: 


°£/ m £-♦ ô £ /m £ 


o/m —'—> ô/m 

the vertical arrows being injections, and the horizontal ones being isomorphisms. 
Thus the residue class field extension of our valuation can be studied over the 
completions E of K. 

We have a similar remark for the ramification index. Let T V (K) and r v (K) 
denote the value groups of our valuation on K and K respectively (i.e. the image 
of the map x i-> |x| for xe K* and xe K* respectively). We saw above that 
r„(/Q = T„(K); in other words, the value group is the same under completion, 
because ofthe non-archimedean property. (This is ofcourse false in the archime- 
dean case.) If E is again an extension of K and w is an absolute value of E 
extending v, then we have a commutative diagram 


r W (E) ■ 


r„(K) 


r w (£) 


r v (K) 


from which we see that the ramification index (r w (£): r v (K)) also does not 
change under completion. 


§6. DISCRETE VALUATIONS 

A valuation is called discrete if its value group is cyclic. In that case, the 
valuation is an absolute value (if we consider the value group as a subgroup of 
the positive reals). The p -adic valuation on the rational numbers is discrete for 
each prime number p. By Corollary 4.5, an extension ofa discrete valuation to a 
finite extension field is also discrete. Aside from the absolute values obtained 
by embedding a field into the reals or complex numbers, discrete valuations are 
the most important ones in practice. We shall make some remarks concerning 
them. 

Let v be a discrete valuation on a field K , and let o be its valuation ring. Let 
m be the maximal ideal. There exists an element n of m which is such that its 
value | n | generates the value group. (The other generator of the value group is 
| n~ 1 1.) Such an element n is called a local parameter for v (or for m). Every 
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element x of K can be written in the form 

x = un r 

with some unit u of o, and some integer r. Indeed, we have |x| = \n\ r = 1 7 r r | 
for some r e Z, whence x/n r is a unit in o. We call r the order of x at v. It is 
obviously independent of the choice of parameter selected. We also say that x 
has a zero of order r. (If r is negative, we say that x has a pole of order — r.) 

In particular, we see that m is a principal ideal, generated by n. As an exercise, 
we leave it to the reader to verify that every ideal of o is principal, and is a power 
of m. Furthermore, we observe that o is a factorial ring with exactly one prime 
element (up to units), namely n. 

If x, yeK, we shall write x~y if|x| = |jp|. Let n x ( i = 1, 2,...) be a 
sequence of elements of o such that n t — n'. Let R be a set of representatives of 
o/m in o. This means that the canonical map o -► o/m induces a bijection of R 
onto o/m. 

Assume that K is complete under our valuation. Then every element x of o can 

he written as a convergent series 


x = a 0 + a x n x + a 2 n 2 + • • • 

with a { e R, and the a x are uniquely determined hy x. 

This is easily proved by a recursive argument. Suppose we have written 

x = a 0 + • • • + a n n n (mod m " +1 ) 

then x — (a 0 + • • • + a n n n ) = n n+l y for some yeo. By hypothesis, we can 
write y = a n+1 + nz with some a n + 1 eR. From this we get 

x = a 0 + ••• + a n+1 n n+1 (mod m" + 2 ), 

and it is clear that the n-th term in our series tends to 0. Therefore our series 
converges(by the non-archimedean behavior!). The fact that R contains precisely 
one representative of each residue class mod m implies that the a x are uniquely 
determined. 

Examples. Consider first the case of the rational numbers with the /?-adic 
valuation v p . The completion is denoted by Q p . It is the field of p-adic numbers. 
The closure of Z in Q p is the ring of p- adic integers Z p . We note that the prime 
number p is a prime element in both Z and its closure Z p . We can select our set 
of representatives R to be the set of integers (0, 11). Thus every p- 
adic integer can be written uniquely as a convergent sum X ûìP' where a t is an 
integer, 0 ^ a x ^ p — 1. This sum is called its p -adic expansion. Such sums 
are added and multiplied in the ordinary manner for convergent series. 



XII, §6 


DISCRETE VALUATIONS 489 


For instance, we have the usual formalism of geometric series, and if we take 
p = 3, then 


-1 =^^3 = 2(1 + 3 + 3 2 + •••)• 

We note that the representatives (0, 1 ,..., p — 1 ) are by no means the only 
ones which can be used. In fact, it can be shown that Z p contains the (p — l)-th 
roots of unity, and it is often more convenient to select these roots of unity as 
representatives for the non-zero elements of the residue class field. 

Next consider the case of a rational field k(t\ where k is any field and t is 
transcendental over k. We have a valuation determined by the prime element t 
in the ring k[t]. This valuation is discrete, and the completion of k[í] under this 
valuation is the power series ring k[[í]]. In that case, we can take the elements 
of k itself as repersentatives of the residue class field, which is canonically 
isomorphic to k. The maximal ideal of /c[[í]] is the ideal generated by t. 

This situation amounts to an algebraization of the usual situation arising in 
the theory of complex variables. For instance, let z 0 be a point in the complex 
plane. Let o be the ring of functions which are holomorphic in some disc around 
z 0 . Then o is a discrete valuation ring, whose maximal ideal consists of those 
functions having a zero at z 0 . Every element of o has a power series expansion 

/(z) = £ a v (z - z 0 y. 

v = m 

The representatives of the residue class field can be taken to be complex numbers, 
a v . If a m / 0, then we say that / (z) has a zero of order m. The order is the same, 
whether viewed as order with respect to the discrete valuation in the algebraic 
sense, or the order in the sense ofthe theory ofcomplex variables. We can select a 
canonical uniformizing parameter namely z — z 0 , and 

f(z) = (z - z 0 ) m g(z) 

where g(z) is a power series beginning with a non-zero constant. Thus g(z) is 
invertible. 

Let K be again complete under a discrete valuation, and let E be a finite 
extension of K. Let o £ , m £ be the valuation ring and maximal ideal in E lying 
above o, m in K. Let m be a prime element in E. If T £ and V K are the value 
groups of the valuations in E and K respectively, and 

e = (r E :r K ) 


is the ramification index, then 


\n*\ = \n\ 
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and the elements 


U l n j 9 0 ^ i ^ e — ÌJ = 0, 1, 2,... 
have order je + i in E. 

Let o) í9 ..., (o f be elements of E such that their residue classes mod m £ from 
a basis of o £ /m £ . If R is as before a set of representatives of o/m in o, then the set 
consisting of all elements 


UìCOj + • • • + a f (D f 

with aj e R is a set of representatives of o £ /m £ in o £ . From this we see that every 
element of o £ admits a convergent expansion 

I Z Z 

i = 0 v = 1 j= 0 

Thus the elements {co v n 1 } form a set of generators of o £ as a module over o. 
On the other hand, we have seen in the proof of Proposition 4.6 that these 
elements are linearly independent over K. Hence we obtain: 

Proposition 6.1. Let K be complete under a discrete valuation. Let Ebe a 
finite extension of K , and let e , / be the ramification index and residue class 
degree respectively. Then 


ef = ÍE.Kl 

Coroiiary 6.2. Let cce E, a / 0. Let v be the valuation on K and w its 
extension to E. Then 


ord„ Nf(a) = f(w\v) ord w a. 
Proof. This is immediate from the formula 

\N E M\ = \0L\ ef 


and the definitions. 

Corollary 6.3. Let K be any field and v a discrete valuation on K. Let Ebea 
finite extension of K. Ifv is xvell behaved in E (for instance if E is separable 
over K\ then 


Z e(w | v)f (w | y) = [£:K]. 

w|f 


IfE is Galois over K , then all e w are equal to the same number e 9 all / w are 



XII, §7 


ZEROS OF POLYNOMIALS IN COMPLETE FIELDS 


491 


equal to the same number /, and so 

efr = ÍE:Kl 

xvhere r is the number of extensions of v to E. 

Proof Our first assertion comes from our assumption, and Proposition 3.3. 
If E is Galois over K , we know from Corollary 4.10 that any two valuations of E 
lying above v are conjugate. Hence all ramification indices are equal, and 
similarly for the residue class degrees. Our relation efr = [£: k] is then 
obvious. 


§7. ZEROS OF POLYNOMIALS IN 
COMPLETE FIELDS 


Let K be complete under a non-trivial absolute value. 
Let 


f(X) = F1 (X - a 


be a polynomial in K[X~\ having leading coefíìcient 1, and assume the roots a f 
are distinct, with multiplicities r f . Let d be the degree of /. Let g be another 
polynomial with coefficients in K a , and assume that the degree of g is also d , and 
that g has leading coefficient 1. We let | g | be the maximum of the absolute values 
of the coefficients of g. One sees easily that if \g\ is bounded, then the absolute 
values of the roots of g are also bounded. 

Suppose that g comes close to /, in the sense that |/ — g\ is small. If / is 
any root of g , then 

\m - gm = \m\ = n i«. - /*r 

is small, and hence /J must come close to some root of /. As / comes close to 
say a = a^, its distance from the other roots of / approaches the distance of a t 
from the other roots, and is therefore bounded from below. In that case, we say 
that P belongs to a. 

Propositíon 7.1. Ifg is'sufficiently close to /, and fi u ..., are the roots of g 

belonging to a ( counting multiplicities ), then s = r^ is the multiplicity of a in /. 

Proof Assume the contrary. Then we can find a sequence g v of poly- 
nomials approaching / with precisely 5 roots j8\ v) ,..., p[ v) belonging to a, but 
with 5 # r. (We can take the same multiplicity s since there is only a finite 
number ofchoices for such multiplicities.) Furthermore, the other roots of# also 
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belong to roots of/, and we may suppose that these roots are bunched together, 
according to which root of/they belong to. Since lim g v = /, we conclude that a 
must have multiplicity s in /, contradiction. 

Next we investigate conditions under which a polynomial has a root in a 
complete field. 

We assume that K is complete under a discrete valuation , mth valuation ring o, 
maximal ideal p. We let n be a fixed prime element of p. 

We shall deal with n-space over o. We denote a vector (a l9 ... 9 a n ) with 
e o by A. If f(X l9 ..., X n ) e o[A"] is a polynomial in n variables, with integral 
coefficients, we shall say that A is a zero of / if / (A) = 0, and we say that A is a 
zero of / mod p m if / (A) = 0 (mod p m ). 

Let C = (c 09 ... 9 c n ) be in o ( " +1} . Let m be an integer ^ 1. We consider the 
nature of the solutions of a congruence of type 

(*) n m (c 0 + CjX! + • • • + c n x n ) = 0 (mod p m+l ). 

This congruence is equivalent with the linear congruence 

(**) c 0 + c^x! + • • • + c n x n = 0 (mod p). 

If some coefficient c x (i = 1,..., n) is not = 0 (mod p), then the set of solutions is 
not empty, and has the usual structure of a solution of one inhomogeneous 
linear equation over the fìeld o/p. In particular, it has dimension n — 1. 
A congruence (*) or (**) with some c t # 0 (mod p) will be called a proper 
congruence. 

As a matter of notation, we write Z/ / for the formal partial derivative of / 
with respect to X t . We write 

grad f(X) = (DJ(X) 9 ... 9 D n f(X)). 

Proposition 7.2. Let f(X) e o\_X~\. Let r be an integer ^ 1 and let A e o (n) be 
such that 


f (A) = 0 (mod p 2r ~ l ) 9 

Díf(A) = 0 (modp'" 1 ), forall i = 1.n, 

DJ(A) ^ 0 (mod p r ), for some i = 1 9 ... 9 n. 

Let v be an integer ^ 0 and let B e o (n) be such that 

B = A (mod p r ) and f(B) = 0 (mod p 2r-1 + v ). 
A vector Y e o (n) satisfies 

Y = B (mod p r + v ) and f(Y) = 0 (mod p 2r+v ) 
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if and only if Y can be written in the form Y = B + n r+v C 9 with some C 6 o (n) 
satisfying the proper congruence 

f(B) + n r+v grad f(B) C = 0 (mod p 2r+v ). 

Proof The proof is shorter than the statement of the proposition. Write 
Y = B + n r+v C. By Taylor’s expansion, 

f(B + n r+v C) = f(B) + n r+v grad f(B) • C (mod p 2r+2v ). 

To solve this last congruence mod p 2r+v , we obtain a proper congruence by 
hypothesis, because grad f(B) = grad f(A) = 0 (mod p r ~ l ). 

Corollary 7.3. Assumptions being as in Proposition 7.2, there exists a zero 
of f in o in) which is congruent to A mod p r . 

Proof. We can write this zero as a convergent sum 

A + n r+ì C x + 7r r + 2 C 2 + ... 

solving for C ì9 C 2 ,... inductively as in the proposition. 

Corollary 7.4. Let f be a polynomial in one variable in o[X], and let a e o 
be such that f(a) = 0 (mod p) but f'(a) # 0 (mod p). Then there exists 
b e o, b = a (mod p) such that f(b) = 0. 

Proof Take n = 1 and r = 1 in the proposition, and apply Corollary 7.3. 

Coroiiary 7.5. Let mbe a positive integer not divisible by the characteristic 
of K. There exists an integer r such that for any a e o, a = 1 (mod p r ), the 
equation X m — a = 0 has a root in K. 

Proof. Apply the proposition. 

Example. In the 2-adic field Q 2 , there exists a square root of —7, i.e. 
sf—ï e Q 2 , because —7 = 1—8. 

When the absolute value is not discrete, it is still possible to formulate a 
criterion for a polynomial to have a zero by Newton approximation. (Cf. my 
paper, “On quasi-algebraic closure,” Annals ofMath. (1952) pp. 373-390. 

Proposition 7.6. Let K be a complete under a non-archimedean absolute 
value (nontrivial). Let o be the valuation ring and let f(X) e o[2Q be a poly - 
nomial in one variable. Let a 0 e o be such that 

l/(«o)l < I / ( a o) 2 1 

(here f' denotes the formal derivative of f). Then the sequence 


ai+1_ai /'(«,) 
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converges to a root a of f in o, and we have 

/(«o) 


|a - «ol ^ 


< 1 . 


| /'(«o ) 2 

Proof. Let c = |/(a 0 )//'(a 0 ) 2 1 < 1. We show inductively that: 

1. |«f| ^ 1, 

2. | a ( - a 0 | ^ c. 


3. 


/(«,•) 


/w 


< c 2 ‘. 


These three conditions obviously imply our proposition. If i = 0, they are 
hypotheses. By induction, assume them for i. Then: 

1. |/(a/)//'( a i) 2 1 á c 2 ‘ gives |a í+1 - a t -| ^ c 2 ' < 1, whence \<x i+ j | ^ 1. 

2. |a i+1 - a 0 | ^ max{|a i+1 - a f |, |a f - a 0 |} = c. 


3. By Taylor’s expansion, we have 


/(«,) ^ Jm \ 2 


/(a f+1 ) = /(a f )-/'(a f )^ + /?| 

for some /? g o, and this is less than or equal to 

/(«*) 12 


//(«.: 
\/'(«,)/ 


/'(«.) 

in absolute value. 

Using Taylor’s expansion on /'(a I + 0 we conclude that 

|/'(a í+1 )| = |/'(a/)|. 

From this we get 


/(*i+i) 


/'(«<•+ 1) 2 


< c 


2* + 1 


as desired. 


The technique of the proposition is also useful when dealing with rings, say a 
local ring o with maximal ideal m such that m r = 0 for some integer r > 0. 
If one has a polynomial / in o[A"] and an approximate root a 0 such that 

/'(a 0 ) # 0 mod m, 

then the Newton approximation sequence shows how to refine a 0 to a root of/. 


Example in several variables. Let K be complete under a non-archimedean 
absolute value. Let f(X { , ..., X„+.) E K[X\ be a polynomial with coefficients 
in K. Let (a h . .., a n , b) e K n+Ì . Assume that f(a, b) = 0. Let D n+Ì be the 
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partial derivative with respect to the (n + ì)-th variable, and assume that 
D n +\f(a, b) + 0. Let (â) e K n be sufficiently close to (a). Then there exists an 
element b of K close to b such that f(â, b) = 0. 

This statement is an immediate corollary of Proposition 7.6. By multiplying 
all a h b by a suitable non-zero element of K one can change them to elements 
of o. Changing the variables accordingly, one may assume without loss of gen- 
erality that a h b e o, and the condition on the partial derivative not vanishing 
is preserved. Hence Proposition 7.6 may be applied. After perturbing (a) to 
(â), the element b becomes an approximate solution of f(â, X). As (à) approaches 
(a), f(â, b) approaches 0 and D n+Ì f(â, b) approaches D n+Ì f(a, b) + 0. 
Hence for (â) sufficiently close to (a), the conditions of Proposition 7.6 are 
satisfied, and one may refine b to a root of f(â, X ), thus proving the assertion. 

The result was used in a key way in my paper “On Quasi Algebraic Closure”. 
It is the analogue of Theorem 3.6 of Chapter XI, for real fields. 

In the language of algebraic geometry (which we now assume), the result 
can be reformulated as follows. Let V be a variety defined over K. Let P be a 
simple point of V in K. Then there is a whole neighborhood of simple points of 
V in K. Especially, suppose that V is defined by a finite number of polynomial 
equations over a finitely generated field k over the prime field. After a suitable 
projection, one may assume that the variety is affine, and defined by one equa- 
tion f(X \,. . . , X n+Ì ) = 0 as in the above statement, and that the point is 
P = («!,..., a n , b) as above. One can then select = x, close to a t but such 
that (x\,. . . , x n ) are algebraically independent over k. Let y bç the refinement 
of b such that f(x, y) = 0. Then (x, y) is a generic point of V over k, and the 
coordinates of (x, y) lie in K. In geometric terms, this means that the function 
field of the variety can be embedded in K over k, just as Theorem 3.6 of Chapter 
XI gave the similar result for an embedding in a real closed field, e.g. the real 
numbers. 


EXERCISES 

1. (a) Let K be a field with a valuation. If 

/(X) = û 0 + û 1 X + ..- + a n X" 

is a polynomial in K[X~\, defìne \j \ to be the max on the values \a t \(i = 0,..., n). 
Show that this defines an extension of the valuation to K[X\, and also that the 
valuation can be extended to the rational fìeld K(X). How is Gauss’ lemma a 
special case of the above statement ? Generalize to polynomials in several variables. 
(b) Let f be a polynomial with complex coefficients. Define | /1 to be the maximum 
of the absolute values of the coefficients. Let d be an integer ^ 1. Show that 
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there exist constants C l9 C 2 (depending only on d) such that, if /, g are polynomials 
in C[X~\ of degrees ^ d , then 

CM\\g\^\fg\^C 2 \f\\gÌ 

[Hint: Induction on the number of factors of degree 1. Note that the right 
inequality is trivial.] 

2. Let M q be the set of absolute values consisting of the ordinary absolute value and all 
p-adic absolute values v p on the field of rational numbers Q. Show that for any rational 
number a e Q, a 0, we have 


n m»= i. 

vbMq 

If K is a finite extension of Q, and M K denotes the set of absolute values on K extending 
those of M q , and for each vv e M K we let N w be the local degree [K w : QJ, show that 
for a e K, a ^ 0, we have 


n i«iw w = i- 

we Mk 


3. Show that the p-adic numbers Q p have no automorphisms other than the identity. 
[Hint: Show that such automorphisms are continuous for the p-adic topology. Use 
Corollary 7.5 as an algebraic characterization of elements close to 1.] 

4. Let A be a principal entire ring, and let K be its quotient field. Let o be a valuation ring 
of K containing A , and assume o # K. Show that o is the local ring A {p) for some prime 
element p. [This applies both to the ring Z and to a polynomial ring k[X\ over a field /c.] 

5. Let A be an entire ring, and let K be its quotient field. Assume that every finitely 
generated ideal of A is principal. Let o be a discrete valuation ring of K containing A. 
Show that o = A ip) for some element p of A, and that p is a generator of the maximal 
ideal of o. 

6. Let Q p be a p-adic field. Show that Q p contains infinitely many quadratic fields of 
type Q(>/ —m), where m is a positive integer. 

7. Show that the ring of p-adic integers Z p is compact. Show that the group of units in Z p 
is compact. 

8. If K is a field complete with respect to a discrete valuation, with finite residue class field, 
and if o is the ring of elements of K whose orders are g 0, show that o is compact. Show 
that the group of units of o is closed in o and is compact. 

9. Let K be a field complete with respect to a discrete valuation, let o be the ring of integers 
of K , and assume that o is compact. Let f l9 / 2 , ... be a sequence of polynomials in n 
variables, with coefficients in o. Assume that all these polynomials have degree ^ d, 
and that they converge to a polynomial / (i.e. that | / — /] -► 0 as i -► oo). If each/ has 
a zero in o, show that / has a zero in o. If the polynomials / are homogeneous ofdegree 
d, and if each /, has a non-trivial zero in o, show that / has a non-trivial zero in o. [Hint : 
Use the compactness of o and of the units of o for the homogeneous case.] 

(For applications of this exercise, and also of Proposition 7.6, cf. my paper “On 
quasi-algebraic closure,” Annals ofMath ., 55 (1952), pp. 412-444.) 
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10. Show that if p, p' are two distinct prime numbers, then the fields Q p and Q p , are not 
isomorphic. 

11. ProvethatthefieldQ p containsall(p — l)-th roots ofunity. [ifmí:UseProposition7.6, 
applied to the polynomial X p ~ 1 — 1 which splits into factors of degree 1 in the residue 
class field.] Show that two distinct (p — l)-th roots of unity cannot be congruent mod p. 

12. (a) Let/(X) be a polynomial of degree 1 in Z[X]. Show that the values f(a) for 

a E Z are divisible by infinitely many primes. 

(b) Let F be a finite extension of Q. Show that there are infinitely many primes p 
such that all conjugates of F (in an algebraic closure of Q p ) actually are contained 
in Q p . [Hint : Use the irreducible polynomial of a generator for a Galois extension 
of Q containing F.] 

13. Let K be a field of characteristic 0, complete with respect to a non-archimedean absolute 
value. Show that the series 


x x 

exp(x) =1+x + 2 ! + 3 T + '" 
x 2 x 3 

l0g(l +X) = X- y + y- 

converge in some neighborhood of 0. (The main problem arises when the characteristic 
of the residue class field is p > 0, so that p divides the denominators nl and n. Get an 
expression which determines the power of p occurring in n !.) Prove that the exp and 
log give mappings inverse to each other, from a neighborhood of 0 to a neighborhood 
of 1. 

14. Let K be as in the preceding exercise, of characteristic 0, complete with respect to a non- 
archimedean absolute value. For every integer n > 0, show that the usual binomial 
expansion for (1 -l- x) 1/n converges in some neighborhood of 0. Do this first assuming 
that the characteristic of the residue class field does not divide n, in which case the asser- 
tion is much simpler to prove. 

15. Let F be a complete fìeld with respect to a discrete valuation, let o be the valuation ring, 
7i a prime element, and assume that o/(n) = k. Prove that if a, b e o and a = b (mod n r ) 
with r > 0 then a pn = b pn (mod n r+n ) for all integers n ^ 0. 

16. Let F be as above. Show that there exists a system of representatives R for o/(n) in o 
such that R p = R and that this system is unique (Teichmiiller). [Hint : Let a be a residue 
class in k. For each v ^ 0 let a v be a representative in o of a p and show that the 
sequence a pv converges for v -> oo, and in fact converges to a representative a of a, 
independent of the choices of a v .] Show that the system of representatives R thus 
obtained is closed under multiplication, and that if F has characteristic p, then R is 
closed under addition, and is isomorphic to k. 

17. (a) (Witt vectors again). Let Iç be a perfect field of characteristic p. We use the 

Witt vectors as described in the exercises of Chapter VI. One can define an 
absolute value on W(k ), namely \x\ = p~ r if x r is the first non-zero component 
of x. Show that this is an absolute value, obviously discrete, defined on the ring, 
and which can be extended at once to the quotient field. Show that this quotient 
field is complete, and note that W(k) is the valuation ring. The maximal ideal 
consists of those x such that x 0 = 0, i.e. is equal to pW(k). 



498 ABSOLUTE VALUES 


XII, Ex 


(b) Assume that F has characteristic 0. Map each vector x e W(k) on the element 

Iff'V 

where is a representative of x, in the special system of Exercise 15. Show that 
this map is an embedding of W(k) ìnto o. 

18. (Local uniformization). Let k be a fìeld, K a fìnitely generated extension of transcendence 
degree 1, and o a discrete valuation ring of K over /c, with maximal ideal m. Assume that 
o/m = k. Let x be a generator of m, and assume that K is separable over k(x). Show that 
there exists an element y e o such that K = /c(x, y), and also having the following 
property. Let cp be the place on K determined by o. Let a = <p(x), b = cp(y) (of course 
a = 0). Let f(X , Y) be the irreducible polynomial in k[X , 7] such that /(x, y) = 0. 
Then D 2 / (a , b) ^ 0. [Hint : Write fìrst K = /c(x, z) where z is integral over /c[x]. Let 
z = Zj,..., z„(n ^ 2) be the conjugates of z over /c(x), and extend o to a valuation 
ring O of /c(x, z l9 ..., z„). Let 

z = a 0 + a x x + ••• + a r x r + ••• 

be the power series expansion of z with a { e /c, and let P r (x) = a 0 + • • • + a r x r . For 
i = 1,..., n let 


- PÁx) 


Taking r large enough, show that yi has no pole at O but y 2 ,..., y n have poles at O. 
The elements y í9 ..., y n are conjugate over k(x). Let f(X , T) be the irreducible poly- 
nomial of (x, y) over k. Then /(x, Y) = i p n (x)Y n + • • • + ip 0 (x) with ^,(x)/c[x]. We 
may also assume </<,(0) / 0 (since / is irreducible). Write /(x, T) in the form 


/(x, y) = ip n (x)y 2 - y n (Y- yi )(y 2 1 y - i) • • • ( y ; 1 y - 1). 


Show that ip n (x)y 2 • • • y n = u does not have a pole at €5. If vv e O, let w denote its residue 
class modulo the maximal ideal of O. Then 

0*m Y) = (-iy- l ù(Y - yi ). 

Let y = y l9 ŷ = b. We fìnd that D 2 f(a , b) = (—1)"" l ù ^ 0.] 

19. Prove the converse of Exercise 17, i.e. if K = /c(x, y), f(X , T) is the irreducible poly- 
nomial of (x, y) over /c, and if a,bek are such that f(a , b) = 0, but D 2 f(a , b) ^ 0, 
then there exists a unique valuation ring o of K with maximal ideal m such that x = a 
and y = b (mod m). Furthermore, o/m = /c, and x — a is a generator of m. [Hint : 
If g(x , y) e /c[x, ý] is such that g(a , ò) = 0, show that gf(x, y) = (x — a)/l(x, y)/B(x , y) 
where ^4, B are polynomials such that £(a, /?) ^ 0. If A(a , b) = 0 repeat the process. 
Show that the process cannot be repeated indefìnitely, and leads to a proof of the desired 
assertion.] 

20. (Iss’sa-Hironaka Ann. ofMath 83 (1966), pp. 34-46). This exercise requires a good 
working knowledge of complex variables. Let K be the field of meromorphic functions 
on the complex plane C. Let O be a discrete valuation ring of K (containing the 
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constants C). Shovv that the function z is in ©. [Hint: Let a lf a 2ì .. . be a discrete 
sequence of complex numbers tending to infìnity, for instance the positive integers. 
Let v l9 v 2 , . •., be a sequence of integers, 0 ^ ^ p - 1, for some prime number 

p, such that 2 ViP' is not the p-adic expansion of a rational number. Let/be an entire 
function having a zero of order v t p' at a, for each i and no other zero. If z is not in 
o, consider the quotient 



fl (2 - a,) v,p ' 


i= 1 

From the Weierstrass factorization of an entire function, show that g(z) = /i(z) p " for 
some entire function h(z). Now analyze the zero of g at the discrete valuation of o in 
terms of that of / and f] ( z — ûi) VipI t0 g et a contradiction.] 

If U is a non-compact Riemann surface, and L is the field of meromorphic functions 
on U, and if o is a discrete valuation ring of L containing the constants, show that every 
holomorphic function cponU lies in o. \_Hint : Map cp : U -* C, and get a discrete valua- 
tion of K by composing cp with meromorphic functions on C. Apply the first part of the 
exercise.] Show that the valuation ring is the one associated with a complex number. 
\Further hint : If you don’t know about Riemann surfaces, do it for the complex plane. 
For each zeU, let f z be a function holomorphic on U and having only a zero of order 1 
at z. If for some z 0 the function f ZQ has order ^ 1 at o, then show that o is the valuation 
ring associated with z 0 . Otherwise, every function f z has order 0 at o. Conclude that the 
valuation of o is trivial on any holomorphic function by a limit trick analogous to that 
of the fìrst part of the exercise.] 




Part Three 

LINEAR ALGEBRA 

and 

REPRESENTATIONS 


We shall be concerned with modules and vector spaces, going into their 
structure under various points of view. The main theme here is to study a pair, 
consisting of a module, and an endomorphism, or a ring of endomorphisms, 
and try to decompose this pair into a direct sum of components whose structure 
can then be described explicitly. The direct sum theme recurs in every chapter. 
Sometimes, we use a duality to obtain our direct sum decomposition relative 
to a pairing, and sometimes we get our decomposition directly. If a module 
refuses to decompose into a direct sum of simple components, then there is no 
choice but to apply the Grothendieck construction and see what can be ob- 
tained from it. 

The extension theme occurs only once, in Witt’s theorem, in a brief counter- 
point to the decomposition theme. 
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CHAPTER XIII 

Matrices and Linear Maps 


Presumably readers of this chapter will have had some basic acquaintance 
with linear algebra in elementary courses. We go beyond such courses by pointing 
out that a lot of results hold for free modules over a commutative ring. This is 
useful when one wants to deal with families of linear maps, and reduction modulo 
an ideal. 

Note that §8 and §9 give examples of group theory in the context of linear 
groups. 

Throughout this chapter, we let R be a commutative ring, and we let 
E , F be R- modules. We suppress the prefix R in front of linear maps and 
modules. 


§1. MATRICES 

By an m x n matrix in R one means a doubly indexed family of elements 
of R 9 (dij), (i = 1,..., m and j = 1,..., n), usually written in the form 

( a u 

a ml 

We call the elements a^ the coefficients or components of the matrix. A 
1 x n matrix is called a row vector(of dimension, or size, n ) and a m x 1 matrix 
is called a column vector (of dimension, or size, m). In general, we say that 
(m, n) is the size of the matrix, or also m x n. 

We define addition for matrices of the same size by components. If A = (a tJ ) 
and B = (b 0 ) are matrices of the same size, we define A H- B to be the matrix 
whose ij- component is + b^. Addition is obviously associative. We define 
the multiplication of a matrix A by an element c e R to be the matrix (ca^), 
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whose (/-component is ca i7 . Then the set of m x n matrices in R is a module 
(i.e. an R-module). 

We define the product AB of two matrices only under certain conditions. 
Namely, when A has size (m, n) and B has size (n, r), i.e. only when the size of 
the rows of A is the same as the size of the columns of B. If that is the case, let 
A = (dij) and let B = ( b jk ). We define AB to be the m x r matrix whose ik- 
component is 

j = 1 

If A, B , C are matrices such that AB is defined and BC is defined , then so is 

(AB)C and A(BC) and we have 

(AB)C = A(BC). 

This is trivial to prove. If C = ( c kì ), then the reader will see at once that the 
//-component of either of the above products is equal to 

X X a ijbjk c ki • 
j k 

An m x n matrix is said to be a square matrix if m = n. For example, a 
1 x 1 matrix is a square matrix, and will sometimes be identified with the 
element of R occurring as its single component. 

For a given integer n ^ 1 the set of square n x n matrices forms a ring. 

This is again trivially verified and will be left to the reader. 

The unit element of the ring of n x n matrices is the matrix 



whose components are equal to 0 except on the diagonal, in which case they 
are equal to 1. We sometimes write / instead of /„. 

If A = (aij) is a square matrix, we define in general its diagonal components 
to be the elements a fi . 

We have a natural ring-homomorphism of R into the ring of n x n matrices, 
given by 

c\-+cl n . 

Thus cl n is the square n x n matrix having all its components equal to 0 except 
the diagonal components, which are equal to c. Let us denote the ring ofn x n 



XIII, §1 


MATRICES 505 


matrices in R by Mat„(K). Then Mat n (R) is an algebra over R (with respect to 
the above homomorphism). 

Let A = (aij) be an m x n matrix. We define its transpose 'A to be the matrix 
(aji) (j = 1and í = 1,. .. , m). Then 'A is an n x m matrix. The reader 
will verify at once that if A, B are of the same size, then 

\A + B) = r A + '£. 

If c g K then r (cA) = c M. If A, B can be multiplied, then f B Í4 is defined and we 
have 


\AB) = ^B'A. 

We note the operations on matrices commute with homomorphisms. More 
precisely, let cp: R -* R' be a ring-homomorphism. If A, B are matrices in R, 
we define cpA to be the matrix obtained by applying cp to all the components of 
A. Then 

cp(A + B) = cpA + cpB , cp(AB) = (cpA)(cpB\ (p(cA) = cp(c)cpA , 

<P('A) = V(A). 

A similar remark will hold throughout our discussion of matrices (for 
instance in the next section). 

Let A = (dij) be a square n x n matrix in a commutative ring R. We define 
the trace of A to be 


tr(^) = £ a u ; 

i= 1 


in other words, the trace is the sum of the diagonal elements. 
If A, B are n x n matrices, then 

tr(AB) = tr (BA). 


Indeed, if A = (a l7 ) and B = (b 0 ) then 

tr(AB) = XI a.vfcv, = tr(B/t). 

i V 

/4s an application, we observe that if B is an invertible n x n matrix, then 

tr(B~ ^AB) = 


Indeed, tr (B~ l AB) = tr (ABB~ l ) = tr(^). 
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§2. THE RANK OF A MATRIX 

Let k be a field and let A be an m x n matrix in k. By the row rank of A we 
shall mean the maximum number of linearly independent rows of A, and by the 
column rank of A we shall mean the maximum number of linearly independent 
columns of A. Thus these ranks are the dimensions of the vector spaces gen- 
erated respectively by the rows of A and the columns of A. We contend that 
these ranks are equal to the same number, and we define the rank of A to be 
that number. 

Let A 1 ,..., A n be the columns of A, and let A u ..., A m be the rows of A. 
Let l X = (x í9 ..., x m ) have components x { e k. We have a linear map 

X * * X\Ai + • • • + x m A m 

of /c (m) onto the space generated by the row vectors. Let W be its kernel. Then 
W is a subspace of k {m) and 

dim W + row rank = m. 

If Y is a column vector of dimension m, then the map 
(X, Y)y-+'XY = X- Y 

is a bilinear map into k, if we view the 1 x 1 matrix l XY as an element of k. 
We observe that W is the orthogonal space to the column vectors A 1 ,..., A n , 
i.e. it is the space of all X such that X • A J = 0 for ally = 1,..., n. By the duality 
theorem of Chapter III, we know that /c (m) is its own dual under the pairing 

{X, Y)^XY 

and that k (m) /W is dual to the space generated by A 1 ,..., A n . Hence 
dim k (m) /W = column rank, 


or 


dim W + column rank = m. 
From this we conclude that 


column rank = row rank, 

as desired. 

We note that W may be viewed as the space of solutions of the system of n 
linear equations 


x i^i + • * * + x mA m = 0 , 
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in m unknowns x í9 Indeed,ifwewriteouttheprecedingvectorequation 

in terms of all the coordinates, we get the usual system of n linear equations. 
We let the reader do this if he or she wishes. 


§3. MATRICES AND LINEAR MAPS 

Let E be a module, and assume that there exists a basis (B = {Ç ,..., Ç n } 
for E over R. This means that every element of E has a unique expression as a 
linear combination 


X "L * ‘ * "P Xn£n 


with Xi e R. We call (x^,..., x n ) the components of x with respect to the basis. 
We may view this n-tuple as a row vector. We shall denote by X the transpose 
of the row vector (x^,..., x„). We call X the column vector of x with respect to 
the basis. 

We observe that if {Ç\,..., Ç m } is another basis of E over R , then m = n. 
Indeed, let p be a maximal ideal of R. Then E/pE is a vector space over the 
field R/pR , and it is immediately clear that if we denote by Ç the residue class 
of Ç mod pE, then {Ç l9 ..., | w } is a basis for E/pE over R/pR. Hence n is also 
the dimension of this vector space, and we know the invariance of the cardinality 
for bases of vector spaces over fields. Thus m = n. We shall call n the dimension 
of the module E over R. 

We shall view R (n) as the module of column vectors of size n. It is a free 
module of dimension n over R. It has a basis consisting of the unit vectors 
e 1 ,..., e n such that 


V = (0,...,0, 1,0, ...,0) 

has components 0 except for its i-th component, which is equal to 1. 
An m x n matrix A gives rise to a linear map 

L a : R in) - R im) 


by the rule 


X^AX. 

Namely, we have A(X + Y) = AX + AY and A(cX) = cAX for column 
vectors X , Y and c e R. 
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The above considerations can be extended to a slightly more general 
context, which can be very useful. Let E be an abelian group and assume that 
R is a commutative subring of 

End z (£) = Hom z (£, E). 

Then E is an K-module. Furthermore, if A is an m x n matrix in R, then we get 
a linear map 


L 4: E {n) -► E im) 

defined by a rule similar to the above, namely X i—► AX. However, this has to 
be interpreted in the obvious way. If A = (a u ) and X is a column vector of 
elements of E , then 


AX = 




n 

where = X a u x j- 


j=i 

If A , B are matrices in whose product is defined, then for any c e R we 
have 


Lab — L a L b and L CÁ — cL A . 

Thus we have associativity, namely 

A(BX) = 04B)*. 

An arbitrary commutative ring R may be viewed as a module over itself. 
In this way we recover the special case of our map from R (n) into R {m) . Further- 
more, if E is a module over R , then R may be viewed as a ring of endomorphisms 
of E. 


Proposition 3.1. Let E be a free module over R , and let {x^,..., x„} be a 
basis. Let y u ..., y n be elements of E. Let A be the matrix in R such that 



Then {y í9 ..., y n } is a basis of E if and only if A is invertible. 


Proof. Let X , Y be the column vectors of our elements. Then AX = Y. 
Suppose Y is a basis. Then there exists a matrix C in R such that CY = X. 
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Then CAX = X , whence CA = I and A is invertible. Conversely, assume that 
A is invertible. Then X = A~ l Y and hence x u are in the module 

generated by y í9 . .., y n . Suppose that we have a relation 

biyi + ••• + b n y n = 0 

with bi e R. Let B be the row vector (b u . .., b n ). Then 


B7 = 0 


and hence BAX = 0. But {x t ,..., x„} is a basis. Hence BA = 0, and hence 
BAA ~ 1 = B = 0. This proves that the components of 7 are linearly indepen- 
dent over R , and proves our proposition. 


We return to our situation of modules over an arbitrary commutative 
ring R. 

Let E, F be modules. We shall see how we can associate a matrix with a 
linear map whenever bases of E and F are given. We assume that E, F are free. 
Welet (B = {fj,. . . , £ n }and <B' = {fi,..., f„Jbebasesof£andFrespectively. 
Let 


f.E^F 

be a linear map. There exist unique elements £ R such that 
f(ti) = a ll ? i +~- + a ml ? n9 

y (^n) = ^ln £ 1 + ’ * * + ^mn ^m» 


or in other words, 


/«j) = Z 

î — 1 

(Observe that the sum is over the /rsí index.) We define 

Mi<f) = 0fly)• 

If x = x^! + • • • + x„^„ is expressed in terms of the basis, let us denote the 
column vector X of components of x by M ( g(x). We see that 

MAfM) = M«(/)M,(x). 

In other words, if A'' is the column vector of f (x), and M is the matrix associated 
with/ then X' = MX. Thus the operation of the linear map is reflected by the 
matrix multiplication, and we have f = L M . 
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Proposition 3.2. Let E , F, D be modules , and let ®, ®', ®" be/niíe bases 
of E , F, D, respectively. Let 

£4 £ 4 D 


be linear maps. Then 


mUq o /) = MUg)Ml(f). 

Proof. Let /1 and B be the matrices associated with the maps /, g respec- 
tively, with respect to our given bases. If X is the column vector associated with 
x e E, the vector associated with g(f(x)) is B(AX) = (BA)X. Hence BA is the 
matrix associated with g ° f. This proves what we wanted. 

CoroIIary 3.3. Let E = F. Then 


M%. (id)MS (id) = Af8:(id) - I. 

Each matrix M% (id) is invertible (i.e. is a unit in the ring of matrices). 

Proof. Obvious. 

Corollary 3.4. Let N = M%-( id). Then 

Ml(f) = Ml(ìâ)M%(f)M%(iâ) = NM«(f)N-\ 

Proof. Obvious 

CoroIIary 3.5. Let E be a free module of dimension n over R. Let ® be a 
basis of E over R. The map 


f i—» M®(/) 

is a ring-isomorphism of the ring of endomorphisms of E onto the ring ofnxn 
matrices in R. Infact , the isomorphism is one of algebras over R. 

We shall call the matrix M®(/) the matrix associated with/with respect to 
the basis ®. 

Let E be a free module of dimension n over R. By GL(E) or Aut^F) one 
means the group of linear automorphisms of E. It is the group of units in 
Endj^F). By GL n (R) one means the group of invertible n x n matrices in R. 
Once a basis is selected for E over R , we have a group-isomorphism 

GL(E)<-+GL n (R) 


with respect to this basis. 
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Let £ be as above. If 


/:£-►£ 

is a linear map, we select a basis ® and let M be the matrix associated with / 
relative to ®. We define the trace of / to be the trace of Aí, thus 

tr (/) = tr(M). 

If M' is the matrix of / with respect to another basis, then there exists an in- 
vertible matrix N such that M' = N~ l MN , and hence the trace is independent 
of the choice of basis. 


§4. DETERMINANTS 

Let fij, F be modules. A map 

/:£j x •• x £„-►£ 

is said to be F-muItilinear (or simply multilinear) if it is linear in each variable, 
i.e. if for every index i and elements x í9 ..., x,-_ l5 x í+15 ..., x„, Xj e Ej, the map 

x*->f(x u ..., x f _ j, x, x í + j,..., x„) 

is a linear map of E { into F. 

A multilinear map defined on an n-fold product is also called n-multilinear. 
If = • • • = E n = £, we also say that / is a multilinear map on £, instead of 
saying that it is multilinear on £ (n) . 

Let / be an n-multilinear map. If we take two indices i, j and i # j then 
fixing all the variables except the i-th and j -th variable, we can view / as a 
bilinear map on £ f x Ej. 

Assume that E x = ••=£„ = £. We say that the multilinear map / is 
alternating if/(xj,..., x n ) — 0 whenever there exists an index i, 1 ^ i á n — 1, 
such that x t = x i+1 (in other words, when two adjacent elements are equal). 

Proposition 4.1. Let f be an n-multilinear alternating map on £. Let 
x 1 ,...,x w e£. Then 

/(. . . , X,-, X,- + l5 . . .) = /(• • • > X l + l5 Xj, . . .). 

/n oí/ier vvords, vv/ien vve interchange two adjacent arguments of f, the value 
of f changes by a sign. If x, = x ; /or i # j then f(x í9 ..., x n ) = 0. 
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Proof Restricting our attention to the factors in the i-th and j -th place, with 
j = i + 1, we may assume/is bilinear for the first statement. Then for all x, 
y b E we have 


o = f(x + y, x + y) = f(x, y) + f(y, x). 

This proves what we want, namely /(y, x) = — /(x, y). For the second asser- 
tion, we can interchange successively adjacent arguments of/ until we obtain 
an n-tuple of elements of E having two equal adjacent arguments. This shows 
that when x f = x jy i # / then f(x u ..., x„) = 0. 

Corollary 4.2. Let f be an n-multilinear alternating map on E. Let 
Xj,..., x„ e E. Let i # j and let ae R. Then the value of f on (x^,..., x„) 
does not change if we replace x f by x, + axj and leave all other components 
fixed. 

Proof. Obvious. 

A multilinear alternating map taking its value in R is called a multilinear 
alternating form. 

On repeated occasions we shall evaluate multilinear alternating maps on 
linear combinations of elements of E. Let 

vvj = a xl v x + ••• + a ín v n , 

= <* n iVi + ••• + a nn v n . 

Let f be n-multilinear alternating on E. Then 

f(w u ...,w n ) = ffauV! + ■■■ + a ln v n ,..., a ni Vi +■■■ + a nn v n ). 

We expand this by multilinearity, and get a sum of terms of type 


a l,<r(l) * * * a n,<T(n)f( V <T( 1 )> • • • » V a{n j), 

where <x ranges over arbitrary maps of {1,..., n} into itself. If <x is not a bijection 
(i.e. a permutation), then two arguments v a{i) and v aU) are equal for i ^ / and 
the term is equal to 0. Hence we may restrict our sum to permutations <x. 
Shuffling back the elements (i^d, ..., v a{n) ) to their standard ordering and using 
Proposition 4.1, we see that we have obtained the following expansion: 

Lemma 4.3. // Wj,..., w w are as above , then 

f(w u ..., w„) = £ t(a)a Ua(1) ■ ■ ■ ...,v n ) 

<T 

xvhere the sum is taken over all permutations g of {1,..., n} and e(a) is the 
sign of the permutation. 
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For determinants, I shall follow Artin’s treatment in Galois Theory. 

By an n x n determinant we shall mean a mapping 

det: Mat n (R) -► R 

also written 

D : Mat„(K) - R 

which, when viewed as a function of the column vectors A 1 ,..., A n of a matrix 
A , is multilinear alternating, and such that D(I) = 1. In this chapter, we use 
mostly the letter D to denote determinants. 

We shall prove later that determinants exist. For the moment, we derive 
properties. 

Theorem 4.4. (Cramer’s Rule). Leî A 1 ,... ,A n be column vectors of dimen - 
sion n. Let x ls ..., x n e R be such that 

x^A 1 + • • • + x n A n = B 

for some column vector B. Thenfor each i we have 

x i D(A\...,A n ) = D(A l ,...,B,...,A n ), 

xvhere B in this last line occurs in the i-th place. 

Proof. Say i = í. We expand 

D(B, A 2 ,...,A")= £ XjD(A j , A 2 ,..., A"), 
j= 1 

and use Proposition 4.1 to get what we want (all terms on the right are equal 
to 0 except the one having x^ in it). 

Corollary 4.5. Assume that R is a field. Then A l 9 ...,A n are linearly 
dependent if and only if D(A X ,..., A n ) = 0. 

Proof. Assume we have a relation 


x^A 1 + ••• + x n A n = 0 

with x f e R. Then x t D(A) = 0 for all i. If some x f # 0 then D(A) = 0. Con- 
versely, assume that A 1 ,..., A n are linearly independent. Then we can express 
the unit vectors e 1 ,..., e n as linear combinations 

e 1 = bnA 1 + ••• + bi„A n , 
e n = b^A 1 H- • • • H- b nn A n 
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with bij e R. But 

1 =D(e\...,e n ). 

Using a previous lemma, we know that this can be expanded into a sum of 
terms involving D(A l ,..., A n \ and hence D(A) cannot be 0. 

Proposition 4.6. If determinants exist , they are unique. If A 1 ,..., A n are 
the column vectors of dimension n, of the matrix A = (a 0 ), then 

D(A \..., A") = X e(ff)a.(i),i • • • 

<x 

where the sum is taken over all permutations o of n}, and e(o) is the 

sign of the permutation. 

Proof. Let e 1 ,..., e n be the unit vectors as usual. We can write 
A 1 = a ! i e 1 + • • • -f a nl e n , 

A n = a ln e n + ••• + a nn e n . 

Therefore 

D(A\..., A n ) = X D ,1 • • • a<T(n), n 

<T 

by the lemma. This proves that the value of the determinant is uniquely deter- 
mined and is given by the expected formula. 

Corollary 4.7. Let cp : R -► R' be a ring-homomorphism into a commutative 
ring. If A is a square matrix in R , define cpA to be the matrix obtained by 
applying cp to each component of A. Then 

<P(D(A)) = D((pA). 

Proof. Apply cp to the expression of Proposition 4.6. 

Proposition 4.8. If A is a square matrix in R then 

D(A) = D('A). 


Proof. In a product 


a o(l),l ' * * a <r(n),n 

each integer k from 1 to n occurs precisely once among the integers (j(l),..., o(n). 
Hence we can rewrite this product in the form 

a l,<r~ l (í) * * ’ a n, <r ~ ^(n) • 
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Since e(a) = <(a '), we can rewrite the sum in Proposition 4.6 in the form 
£e(<r l )a u „-,(!) ■ ■ ■(!„'„ Hn) . 

a 

In this sum, each term corresponds to a permutation o. However, as <7 ranges 
over all permutations, so does o ~*. Hence our sum is equal to 

X/ 1, a( 1) * * ’ &n, a(n) > 

a 

which is none other than DÇA), as was to be shown. 

Corollary 4.9. The determinant is multilinear and alternating mth respect 
to the rows of a matrix. 

We shall now prove existence, and prove simultaneously one additional 
important property of determinants. 

When n = 1, we define D(a) = a for any ae R. 

Assume that we have proved the existence of determinants for all integers 
< n (n ^ 2). Let A be an n x n matrix in R, A = (a 0 ). We let A^ be the 
(n — 1) x (n — 1) matrix obtained from A by deleting the z-th row and j-th 
column. Let i be a fixed integer, 1 ^ i ^ n. We define inductively 

D(A) = (-1 ) i+í a n D(A n ) + ... + (-1 ) i+n a in D(A in ). 

(This is known as the expansion of D according to the i'-th row.) We shall prove 
that D satisfies the definition of a determinant. 

Consider D as a function of the /c-th column, and consider any term 

(~i y+^jD^j). 

If/ # k then a {j does not depend on the /c-th column, and D(Aìj) depends linearly 
on the /c-th column. If j = /c, then a tJ depends linearly on the /c-th column, and 
D(Aìj) does not depend on the /c-th column. In any case our term depends 
linearly on the /c-th column. Since D(A) is a sum ofsuch terms, it depends linearly 
on the /c-th column, and thus D is multilinear. 

Next, suppose that two adjacent columns of A are equal, say A k = A k+1 . 
Let j be an index ^ k and # k + 1. Then the matrix A i} has two adjacent equal 
columns, and hence its determinant is equal to 0. Thus the term corresponding 
to an index j ^ k or k + 1 gives a zero contribution to D(A). The other two 
terms can be written 


(-1 r k a ik D(A ik ) + (-1 ) i+k+ \ k+l D(A itk+ì ). 

The two matrices A ik and A i k+1 are equal because of our assumption that the 
/c-th column of A is equal to the (k + l)-th column. Similarly, a ik = a itk+1 . 
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Hence these two terms cancel since they occur with opposite signs. This proves 
that our form is alternating, and gives: 

Proposition 4.10. Determinants exist and satisfy the rule of expansion 
according to rows and columns. 

(For columns, we use the fact that D(A) = D(*A).) 

Example. We mention explicity one of the most important determinants. 
Let jcj, . . ., x n be elements of a commutative ring. The Vandermonde deter- 
minant V = V(x Ì9 ..., x n ) of these elements is defined to be 

1 1 ••• 1 

*\ *2 * * * 

v= : : . , 

v «-1 v / i -1 ... v «-1 

x \ x 2 x n 

whose value can be determined explicitly to be 

V=Yl (xj - x, ). 

i<j 

If the ring is entire and x t Xj for i j, it follows that V =£ 0. The proof for 
the stated value is done by multiplying the next to the last row by x x and subtracting 
from the last row. Then repeat this step going up the rows, thus making the 
elements of the fìrst column equal to 0, except for 1 in the upper left-hand corner. 
One can then expand according to the first column, and use the homogeneity 
property and induction to conclude the proof of the evaluation of V. 

Theorem 4.11. Let Ebe a module over R , and let v í9 ..., v n be elements of E. 
Let A = (aij) be a matrix in R , and let 



Let A be an n-multilinear alternating map on E. Then 

A(wj,..., w„) = D(A) A(v í9 ..., v n ). 

Proof. We expand 

^(« 11 ^! + ••• 4- a ln v n9 ... 9 a nì v x + ••• + a nn v n \ 
and find precisely what we want, taking into account D(A) = D(*A). 
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Let E, F be modules, and let L”(£, F) denote the set of n-multilinear alter- 
nating maps of E into F. If F = £, we also write L”(£, R ) = L£(L). It is clear 
that L£(L, F) is a module over R , i.e. is closed under addition and multiplication 
by elements of R. 

Corollary 4.12. Let Ebea free module over £, and let {v u ...,v n }bea basis. 
Let F be any module, and let weF. There exists a unique n-multilinear 
alternating map 

À w : E x •• • x E -► F 
such that AyXv ìy ...,v n ) = w. 

Proof. Without loss of generality, we may assume that E = R (n \ and then, 
if A*,..., A n are column vectors, we define 

A w (A\...,A n ) = D(A)w. 

Then A w obviously has the required properties. 

Corollary 4.13. IfE is free over R, and has a basis consisting of n elements, 
then L n (E) is free over R, and has a basis consisting of 1 element. 

Proof. We let A t be the multilinear alternating map taking the value 1 on a 
basis {v u ..., v n }. Any element cp e L n (E) can then be written in a unique way 
as cAj, with some ce R, namely c = (p(v u ..., v n ). This proves what we wanted. 

Any two bases of L£(L) in the preceding corollary differ by a unit in R. In 
other words, if A is a basis of L n (E), then À = cA x = A c for some ceR, and c 
must be a unit. Our À t depends of course on the choice of a basis for E. When 
we consider R {n) , our determinant D is precisely Àj, relative to the standard 
basis consisting of the unit vectors e 1 ,..., e n . 

It is sometimes convenient terminology to say that any basis of L n (E) is a 
determinant on E. In that case, the corollary to Cramer’s rule can be stated as 
follows. 

Corollary 4.14. Let Rbe a field. Let Ebe a vector space of dimension n. 
Let À be any determinant on E. Let v l9 ...,v n eE. In order that {v 1 ,...,v n } 
be a basis of E it is necessary and sujficient that 

A(v u ...,v n ) / 0. 

Proposition 4.15. Let A, B be n x n matrices in R. Then 


D(AB) = D(A)D(B). 
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Proof. This is actually a corollary of Theorem 4.11. We take v ì9 ... 9 v n 
to be the unit vectors e\ ..., e n , and consider 



We obtain 

D( Wl , ..., w n ) = D(AB)D(e \.... e n ). 

On the other hand, by associativity, applying Theorem 4.11 twice, 

D(w„..., w n ) = D(A)D(B)D(e\..., e n ). 

Since D(e l , ..., e") = 1, our proposition follows. 

Let A = (íj ( j ) be an n x n matrix in R. We let 

À = (b u ) 

be the matrix such that 

(Note the reversal of indices!) 

Proposition 4.16. Let d = D(A). Then AÂ = ÂA = dl. The determinant 
D(A) is invertible in R if and only if A is invertible , and then 



Proof For any pair of indices i 9 k the //c-component of AÂ is 

a n b lk + a i2 b 2k + ■■■ + a in b„ k = a n (-l) k+1 D(A kl ) + ■■■ + a in (- l) k+n D(A kn ). 

If i = /c, then this sum is simply the expansion of the determinant according 
to the i-th row, and hence this sum is equal to d. If i # fc, let Â be the matrix 
obtained from A by replacing the /c-th row by the i-th row, and leaving all other 
rows unchanged. Ifwe delete the /c-th row and the j -th column from Â, we obtain 
the same matrix as by deleting the /c-th row and j-th column from A. Thus 

^kj = Akj 9 

and hence our sum above can be written 


a,i(- 1)* + x D(A ki ) + ■■■ + a in (-l) k+n D(A kn ). 
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This is the expansion of the determinant of Â according to the z-th row. Hence 
D(Â) = 0, and our sum is 0. We have therefore proved that the z/c-component 
of AÂ is equal to d if z = k (i.e. if it is a diagonal component), and is equal to 0 
otherwise. This proves that AÂ = dl. On the other hand, we see at once from 
the definitions that 54 = 54. Then 


\ÂA) = 54 54 = Î4Î4 = dl y 


and consequently, ÂA = dl also, since \dl) = dl. When d is a unit in R , then A 
isinvertible,itsinverse being d~ l Â. Conversely,if/1 isinvertible,and44~ 1 = /, 
then D(A)D(A~ l ) = 1, and hence D(A) is invertible, as was to be shown. 

CoroIIary 4.17. Let F be any R-module, and let w ìt ..., w n be elements of 
F. Let A - (a t j) be an n x n matrix in R. Let 

a u w x + •• • + a ìn w n = v x 

a n\W\ + • • • + a nn W n = v n . 

Then one can solve explicitly 


lD{A)w x \ 


!w x \ 


/v A 

\ 

= D(A) 

: 

= A 

. 

\D(A)w„J 


\ Wn ì 


w 


In particular, ifv t = 0 for all z, then D(A)wì = 0 for all i. Ifv^ = 0 for all i 
and F is generated by w ìt ..., w n , then D(A)F = 0. 

Proof. This is immediate from the relation AA = D(A)I , using the remarks 
in §3 about applying matrices to column vectors whose components lie in the 
module. 

Proposition 4.18. Let E y F be free modules of dimension n over R. Let 
f'.E^Fbea linear map. Let CB, (B' be bases of £, F respectively over R. 
Then f is an isomorphism if and only if the determinant of its associated 
matrix M%(f) is a unit in R. 

Proof. Let A = M| (/). By definition, / is an isomorphism if and only 
if there exists a linear map g : F -> E such that g °f = id and /° g = id. If / is 
anisomorphism,and B = M^'(g\thtnAB = BA = I. Takingthedeterminant 
of the product, we conclude that D(A) is invertible in R. Conversely, if D(A) 
is a unit, then we can define A~ l by Proposition 4.16. This A~ l is the associated 
matrix of a linear map g:F —> E which is an inverse for/, as desired. 

Finally, we shall define the determinant of an endomorphism. 
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Let E be a free module over R , and let ® be a basis. Let /:£-►£ be an 
endomorphism of E. Let 


M = M«(/). 

If is another basis of E , and M' = M®'(/), then there exists an invertible 
matrix iV such that 


M' = NMN~\ 

Taking the determinant, we see that D(M') = D(M). Hence the determinant 
does not depend on the choice of basis, and will be called the determinant of the 
linear map/. We shall give below a characterization of this determinant which 
does not depend on the choice of a basis. 

Let E be any module. Then we can view L n a (E) as a functor in the variable E 
(contravariant). In fact, we can view L”(£, F) as a functor of two variables, 
contravariant in the first, and covariant in the second. Indeed, suppose that 

E' 4 E 

is a linear map. To each multilinear map q> : E {n) -► F we can associate the 
composite map q> ° f {n \ 

E' x • •. x E' fin) > E x • • • x £ 4 £ 

where f {n) is the product of / with itself n times. The map 

L n a (f) : L n a (E , F) -> L n a (E\ F) 

given by 

(py-.Cpo / <n >, 

is obviously a linear map, which defines our functor. We shall sometimes write 
/* instead of L%(f). 

In particular, consider the case when E = E' and F = R. We get an induced 
map 

f*:L n a (E)^L n a (E). 

Proposition4.19. Let Ebe afree module over R, of dimensionn. Let{N\bea 
basis of L a (E). Let f:E^>Ebean endomorphism of E. Then 

f* A = D(f) A. 

Proof. This is an immediate consequence of Theorem 4.11. Namely, we 
let {v i,..., v n } be a basis of £, and then take A (or f A) to be a matrix of / relative 
to this basis. By definition, 

f*A(v u ... 9 v„) = A(f(v í ),...,f(v n )l 
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and by Theorem 4.11, this is equal to 

D(A) A(vv n ). 

By Corollary 4.12, we conclude that /* A = D(A)A since both of these forms take 
on the same value on(v u ... 9 v n ). 

The above considerations have dealt with the determinant as a function on 
all endomorphisms of a free module. One can also view it multiplicatively, as 
a homomorphism. 

det: GL n (R) -> R * 

from the group of invertible n x n matrices over R into the group of units of R. 
The kernel of this homomorphism, consisting of those matrices with deter- 
minant 1, is called the special linear group, and is denoted by SL n (R). 

We now give an application of determinants to the situation of a free module 
and a submodule considered in Chapter III, Theorem 7.8. 

Proposition 4.20. Let R be a principal entire ring. Let F be a free module 
over R and let M be a fìnitely generated submodule. Let {e lt ..., e m , ... } be 
a basis ofF such that there exist non-zero elements a íf ..., a m e R such that: 

(i) The elements a x e x , ..., a m e m forrn a basis of M over R. 

(ii) We have a { | a i+x for i = 1 ,... 9 m - 1. 

Let L s a be the set of all s-multilinear alternating forms on F. Let J s be the ideal 
generated by all elements f(y\,---,y s )> / e L s a andy x , ..., y s e M. Then 

J s (a x ti s ). 

Proof We first show that J s <= (a x •• • a s ). Indeed, an element yeM can be 
written in the form 


y = c x a x e x H- • • • + c r a r e r . 

Hence if y ìt ... ,y s eM, and/is multilinear alternatingonF, then f(y x ,. . . , y s ) 
is equal to a sum in terms of type 

Cii ' • * c is a h * * * a isf( e h* ■ • •. e û- 

This is non-zero only when e it ,..., e is are distinct, in which case the product 
a x • • • a s divides this term, and hence J s is contained in the stated ideal. 

Conversely, we show that there exists an s-multilinear alternating form which 
gives precisely this product. We deduce this from determinants. We can write 
F as a direct sum 


F = (e u ...,e r )® F r 
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with some submodule F r . Let /, (i = 1,..., r) be the linear map F -» R such 
that/(ej) = ôij, and such that/j has value 0 on F r . For v u ..., v s e F we define 

f(v i, ...,v s ) = det (f(vj)). 

Then / is multilinear alternating and takes on the value 


f(e 2 ,...,e s ) = 1, 

as well as the value 

f(a t e u ..., a s e s ) = a, - a,. 

This proves the proposition. 

The uniqueness of Chapter III, Theorem 7.8 is now obvious, since first (a^) 
is unique, then ( a x a 2 ) is unique and the quotient ( a 2 ) is unique, and so forth by 
induction. 


Remark. Compare the above theorem with Theorem 2.9 of Chapter XIX, 
in the theory of Fitting ideals, which gives a fancier context for the result. 


§5. DUALITY 

Let R be a commutative ring, and let E , F be modules over R. An R - 
bilinear form on E x F is a map 

f:E x F-+ R 

having the following properties: For each xg£, the map 

y^f(x,y) 

is R-linear, and for each yeF, the map 

xh/(x, y ) 

is /?-linear. We shall omit the prefix R- in the rest of this section, and write 
<x, y} f or <x, y) instead of /(x, y). If xeF, we write x ly if <x, y> = 0. 
Similarly, if S is a subset of F, we define x _L 5 if x 1 y for all y e S. We then say 
that x is perpendicular to S. We let S x consist of all elements of E which are 
perpendicular to S. It is obviously a submodule of E. We define perpendicu- 
larity on the other side in the same way. We define the kernel of/ on the left 
to be F 1 and the kernel on the right to be E 1 . We say that /is non-degenerate 
on the left if its kernel on the left is 0. We say that /is non-degenerate on the 
right if its kernel on the right is 0. If E 0 is the kernel of/ on the left, then we 
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get an induced bilinear map 

E/E 0 x F -> R 

which is non-degenerate on the left, as one verifies trivially from the definitions. 
Similarly, if F 0 is the kernel of / on the right, we get an induced bilinear map 

E/E 0 x F/F 0 -> R 

which is non-degenerate on either side. This map arises from the fact that the 
value <x, y> depends only on the coset of x modulo E 0 and the coset of y 
modulo F 0 . 

We shall denote by L 2 (E , F ; R ) the set of all bilinear maps of £ x F into R. 
It is clear that this set is a module (i.e. an R-module), addition of maps being the 
usual one, and also multiplication of maps by elements of R. 

The form / gives rise to a homomorphism 

^p f :E -► Hom r (F 9 R) 

such that 

(P f (x)(y) = f(x, y) = <x, y), 

for all x g E and y e F. We shall call Hom R (F, R) the dual module of F , and denote 
it by F v . We have an isomorphism 


L\E , F;R) <- Hom R (E, Hom R (F, R)) 

given by / h— its inverse being defined in the obvious way: If 

q> :E-> Hom R (F 9 R) 

is a homomorphism, we let / be such that 

/(x, y) = <p(x)(y). 

We shall say that /is non-singular on the left if cp f is an isomorphism, in 
other words if our form can be used to identify E with the dual module of F. 
We define non-singular on the right in a similar way, and say that / is non- 
singular if it is non-singular on the left and on the right. 

Warning: Non-degeneracy does not necessarily imply non-singularity. 

We shall now obtain an isomorphism 


End r (E)^L 2 (E 9 F\R) 


depending on a fixed non-singular bilinear map f:E x F ■— R. 
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Let A e End K (£) be a linear map of F into itself. Then the map 
(x, y) H+ (Ax, y} = (Ax, y} f 

is bilinear, and in this way, we associate linearly with each A e End R (E) a bilinear 
map in L 2 (£, F; R). 

Conversely, let h : E x F — R be bilinear. Given x e £, the map h x : F -> R 
such that h x (y) = h(x 9 y) is linear, and is in the dual space F v . By assumption, 
there exists a unique element x' e E such that for all y e F we have 

/i(x, y) = <x', y>. 

It is clear that the association x i—► x' is a linear map of E into itself. Thus with 
each bilinear map F x F -► F we have associated a linear map E -► E. 

It is immediate that the mappings described in the last two paragraphs are 
inverse isomorphisms between Ená R (E) and L 2 (F, F; R). We emphasize of 
course that they depend on our form f. 

Of course, we could also have worked on the right, and thus we have a 
similar isomorphism 


L 2 (F,F;R)<-End*(F) 


depending also on our fixed non-singular form f. 

As an application, let A : E —> F be linear, and let (x, y) i—► (Ax, y ) be its 
associated bilinear map. There exists a unique linear map 

l A : F - F 

such that 

(Ax, y) = <x, Uy> 

for all x e F and y e F. We call '/í the transpose of with respect to /. 

It is immediately clear that if, A, B are linear maps of F into itself, then for 

ce R, 

\cA) = c'A, \A + B) = l A + l B, and \AB) = 'BU. 

More generally, let F, F be modules with non-singular bilinear forms denoted 
by ( , ) E and ( , ) F respectively. Let A: E —> F be a linear map. Then by the 
non-singularity of ( , ) E there exists a unique linear map l A: F —> E such that 

(Ax, y) F - (x, r Ay) £ for all x e F and y e F. 

We also call l A the transpose with respect to these forms. 

Examples. For a nice classical example of a transpose, see Exercise 33. 
For the systematic study when a linear map is equal to its transpose, see the 
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spectral theorems of Chapter XV. Next I give another example of a transpose 
from analysis as follows. Let E be the (infinite dimensional) vector space of 
C x functions on R, having compact support, i.e. equal to 0 outside some finite 
interval. We define the scalar product 

00 

</. g) = I f(x)g(x)dx. 

— oc 

Let D:E —> E be the derivative. Then one has the formula 

(Df, g) = -</, Dg). 

Thus one says that l D = -D , even though the scalar product is not “non-singular”, 
but much of the formalism of non-singular forms goes over. Also in analysis, 
one puts various norms on the spaces and one extends the bilinear form by 
continuity to the completions, thus leaving the domain of algebra to enter the 
domain of estimates (analysis). Then the spectral theorems become more com- 
plicated in such analytic contexts. 

Let us assume that E = F. Let f:Ex E R be bilinear. By an auto- 
morphism of the pair (£■,/), or simply of f we shall mean a linear automorphism 
A : E -► E such that 


(Ax, Ay > = <x, y} 

for all x, yeE. The group of automorphisms of / is denoted by Aut(/). 

Proposition 5.1. Let f:Ex E -► R be a non-singular bilinear form. Let 
A : E -► E be a linear map. Then A is an automorphism of f if and only if 
l AA = id, and A is invertible. 

Proof. From the equality 

<x, y > = <^4x, Ay > = <x, x AAy > 

holding for all x,yeE , we conclude that l AA = id if A is an automorphism of f. 
The converse is equally clear. 

Note. If E is free and finite dimensional, then the condition l AA = id 
implies that A is invertible. 

Let f :E x E -> R be a bilinear form. We say that / is symmetric if 
/(x, y) = f(y , x) for all x, y e E. The set of symmetric bilinear forms on E will 
be denoted by L s 2 (£). Let us take a fixed symmetric non-singular bilinear form 
f on E, denoted by (x, y) <x, y). An endomorphism A : E -> E will be said 
to be symmetric with respect to / if l A = A. It is clear that the set of sym- 
metric endomorphisms of E is a module, which we shall denote by Sym(£). 
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Depending on our fixed symmetric non-singular fi we have an isomorphism 


L s 2 (£)~Sym(£) 


which we describe as follows. If g is symmetric bilinear on L, then there exists 
a unique linear map A such that 

9(x, y) = ( Ax , y) 

for all x, y e L. Using the fact that both /, g are symmetric, we obtain 

(Ax, y} = (Ay, x> = <y, 'Ax > = <Ux, y>. 

Hence /1 = 'A The association g\-+ A gives us a homomorphism from L 2 (£) 
into Sym(L). Conversely, given a symmetric endomorphism >1 of L, we can 
define a symmetric form by the rule (x, y) i—► <Ax , y>, and the association of 
this form to A clearly gives a homomorphism of Sym(L) into L S (L) which is 
inverse to the preceding homomorphism. Hence Sym(L) and L 2 (L) are iso- 
morphic. 

We recall that a bilinear form g : L x L -> R is said to be alternating if 
g(x 9 x) = 0 for all xeL, and consequently g(x , y) = --g(y, x) for all x, y e L. 
The set of bilinear alternating forms on L is a module, denoted by L 2 (L). 

Let / be a fixed symmetric non-singular bilinear form on L. An endo- 
morphism A :E ^ E will be said to be skew-symmetric or alternating with 
respect to / if 'A = — A, and also < Ax , x> = 0 for all x e L. If for all a e R, 
2a = 0 implies a = 0, then this second condition <^x, x> = 0 is redundant, 
because <4x, x> = — (Ax, x> implies < Ax , x> = 0. It is clear that the set of 
alternating endomorphisms of L is a module, denoted by Alt(L). Depending 
on our fixed symmetric non-singular fiorm /, we have an isomorphism 


Z2(£)~Alt(£) 


described as usual. If g is an alternating bilinear form on L, its corresponding 
lineâr map A is the one such that 

g(x 9 y) = (Ax, y} 

for all x, y e L. One verifies trivially in a manner similar to the one used in the 
symmetric case that the correspondence g <-> A gives us our desired iso- 
morphism. 

Examples. Let k be a field and let L be a finite-dimensional vector space 
over k. Let /:LxL—»Lbea bilinear map, denoted by (x, y) i-» xy. To each 
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x e E, we associate the linear map X x : £ i —»E such that 

4(y) = xy. 

Then the map obtained by taking the trace, namely 

(x, y) t-+ tr(A xv ) 

is a bilinear form on E. If xy = yx , then this bilinear form is symmetric. 

Next, let E be the space of continuous functions on the interval [0, 1]. Let 
K(s, t) be a continuous function of two real variables defined on the square 
0 ^ s ^ 1 and 0 ^ t :§ 1. For (p, i/f e E we define 

<<P, <A> = JJ <P(s)K(s, t)il/(t) ds dt, 

the double integral being taken on the square. Then we obtain a bilinear form 
on E. If K(s 9 t ) = K(t 9 s), then the bilinear form is symmetric. When we discuss 
matrices and bilinear forms in the next section, the reader will note the similarity 
between the preceding formula and the bilinear form defined by a matrix. 

Thirdly, let U be an open subset of a real Banach space E (or a finite-dimen- 
sional Euclidean space, if the reader insists), and let /: U -► R be a map which 
is twice continuously differentiable. For each x e U 9 the derivative 
Df(x): E -► R is a continuous linear map, and the second derivative D 2 f(x ) 
can be viewed as a continuous symmetric bilinear map of E x E into R. 


§6. MATRICES AND BILINEAR FORMS 

We shall investigate the relation between the concepts introduced above and 
matrices. Let f:E x F -> R be bilinear. Assume that E, F are free over R. Let 
CB = {v l9 ..., v m } be a basis for E over R 9 and let = { w l9 ..., vv w } be a basis 
for F over R. Let = (v i9 w ; ). If 

X = x x v x + ••• + X m v m 

and 

y = y 1*1 + ••• + y n w n 

are elements of E and F respectively, with coordinates x i9 e R 9 then 

m rt 

<x,y>= £ XdijXiyj. 

i= 1 j =1 
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Let X, Y be the column vectors of coordinates for x, y respectively, with respect 
to our bases. Then 


<*, y> = ‘XGY 


where G is the matrix (g 0 ). We could write G = M|-(/). We call G the matrix 
associated with the form f relative to the bases ®, (B'. 

Conversely, given a matrix G (of size m x n) 9 we get a bilinear form from 
the map 

(X, Y)h-'XGy. 


In this way, we get a correspondence from bilinear forms to matrices and back, 
and it is clear that this correspondence induces an isomorphism (of R- modules) 


L 2 (E, F, R)*-> Mat mxn (R) 


given by 

f^Mun 

The two maps between these two modules which we described above are clearly 
inverse to each other. 

If we have bases ® = {v í9 ..., v n } and ®' = {wj,..., w„} such that 
(v i9 Wj > = ô ij9 then we say that these bases are dual to each other. In that case, 
if X is the coordinate vector of an element of £, and Y the coordinate vector of 
an element of F, then the bilinear map on X, Y has the value 

X Y= Xyy^ + ••• -f x n y n 


given by the usual dot product. 

It is easy to derive in general how the matrix G changes when we change 
bases in E and F. However, we shall write down the explicit formula only when 
E = F and ® = ®'. Thus we have a bilinear form/: E x E -► R. Let C be 
another basis of E and write X^ and X e for the column vectors belonging to 
an element x of E , relative to the two bases. Let C be the invertible matrix 
id), so that 

x® = cx e . 


Then our form is given by 


We see that 

( 1 ) 


<x, y} = ^Xq'CGCYq. 


M%{f) = 'CM|(/)C. 


In other words, the matrix of the bilinear form changes by the transpose. 
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IfF is free over R , with a basis {r] u , r] n }, then Hom^F, R) is also free , 
and we have a dual basis { 77 I,... , r/'} such that 

Vi(Vj) = Sìj. 

This has already been mentioned in Chapter III, Theorem 6.1. 

Proposition 6.1. Let E, F be free modules of dimension n over R and let 
f:ExF-*Rbea bilinear form. Then the following conditions are equiv- 
alent : 

/ is non-singular on the left. 
f is non-singular on the right. 
f is non-singular. 

The determinant of the matrix of f relative to any bases is invertible in R. 

Proof Assume that / is non-singular on the left. Fix bases of E and F 
relative to which we write elements of these modules as column vectors, and 
giving rise to the matrix G for/. Then our form is given by 

(X, Y) 'XGY 

where X, Y are column vectors with coefficients in R. By assumption the map 

X^ l XG 

gives an isomorphism between the module of column vectors, and the module 
of row vectors of length n over R. Hence G is invertible, and hence its deter- 
minant is a unit in R. The converse is equally clear, and if det(G) is a unit, we 
see that the map 

y- GT 

must also be an isomorphism between the module of column vectors and itself. 
This proves our assertion. 

We shall now investigate how the transpose behaves in terms of matrices. 
Let E, F be free over R, of dimension n. 

Let/ : E x F — R be a non-singular bilinear form, and assume given a basis 
CB of E and (B' of F. Let G be the matrix of / relative to these bases. Let 
A : E — E be a linear map. If x e E, y e F, let X, Y be their column vectors 
relative to (B, (B'. Let M be the matrix of A relative to (B. Then for x e E and 
yeF we have 

(Ax, y> = \MX)GY = 'X^MGY. 

Let N be the matrix of l A relative to the basis (B'. Then NY is the column vector 
of l Ay relative to (B'. Hence 


<x, 'Ay) = l XGNY. 
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From this we conclude that X MG = GN , and since G is invertible, we can solve 
for N in terms of M. We get: 

Proposition 6.2. Let E , F befree over R , of dimension n. Let f:E x F -> R 
be a non-singular bilinear form. Let ®, ®' be bases of E and F respectively 
over R , and let G be the matrix off relative to these bases. Let A : E -► Ebe a 
linear map 9 and let M be its matrix relative to ®. Then the matrix of l A 
relative to ®' is 


(G~ l yMG. 

Corollary 6.3. If G is the unit matrix , then the matrix of the transpose is 
equal to the transpose of the matrix. 


In terms of matrices and bases, we obtain the following characterization 
for a matrix to induce an automorphism of the form. 

Corollary 6.4. Let the notation be as in Proposition 6.2, and let E = F, 

® = ®'. An n x n matrix M is the matrix of an automorphism of the form 

f ( relative to our basis) if and only if 

l MGM = G. 

If this condition is satisfied , then in particular, M is invertible. 

Proof. We use the defìnitions, together with the formula given in 
Proposition 6.2. We note that M is invertible, for instance because its deter- 
minant is a unit in R. 

A matrix M is said to be symmetric (resp. alternating) if l M = M (resp. 
l M = — M and the diagonal elements of M are 0). 

Let f.ExE^R be a bilinear form. We say that / is symmetric if 
/(x, y) = f(y , x) for all x, y e E. We say that / is alternating if/(x, x) = 0 for 
all x e E. 


Proposition 6.5. Let Ebe a free module of dimension n over R , and let ® 
be a fixed basis. Themap 


induces an isomorphism betxveen the module of symmetric bilinear forms on 
E x E (resp. the module of alternating forms on E x E) and the module of 
symmetric n x n matrices over R (resp. the module of alternating n x n 
matrices over R). 
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Proof. Consider first the symmetric case. Assume that / is symmetric. In 
terms of coordinates, let G = M$(f). Our form is given by l XGY which must 
be equal to 'YGX by symmetry. However, l XGY may be viewed as a 1 x 1 
matrix, and is equal to its transpose, namely 1 Y l GX. Thus 

l YGX = 'Y'GX 

for all vectors X , Y. It follows that G = r G. Conversely, it is clear that any 
symmetric matrix defines a symmetric form. 

As for the alternating case, replacing x by x + y in the relation <x, x> = 0 
we obtain 


<*,y> = 0 ; ,*> = 0 . 

In terms of the coordinate vectors X , Y and the matrix G, this yields 

l XGY + l YGX = 0. 

Taking the transpose of, say, the second of the 1 x 1 matrices entering in this 
relation, yields (for all X 9 Y): 

l XGY + 'X'GY = 0. 

Hence G + r G = 0. Furthermore, letting X be any one of the unit vectors 

r (0,...,0, 1,0,...,0) 

and using the relation X XGX = 0, we see that the diagonal elements of G 
must be equal to 0. Conversely, if G is an n x n matrix such that r G H- G = 0, 
and such that g u = 0 for i = 1 ,..., n then one verifies immediately that the 
map 

C X , Y)hVXGY 

defines an alternating form. This proves our proposition. 

Of course, if as is usually the case, 2 is invertible in R f then our condition 
l M = — M implies that the diagonal elements of M must be 0. Thus in that 
case, showing that G + r G = 0 implies that G is alternating. 


§7. SESQUILINEAR DUALITY 

There exist forms which are not quite bilinear, and for which the results 
described above hold almost without change, but which must be handled 
separately for the sake of clarity in the notation involved. 
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Let R have an automorphism of period 2. We write this automorphism as 
ay-+â (and think of complex conjugation). 

Following Bourbaki, we say that a map 

f.ExF^R 

is a sesquilinear form if it is Z-bilinear, and if for x e E, y e F, and a e R we 
have 

f(ax,y) = af(x, y) 

and 

f(x, ay) = âf(x, y). 

(Sesquilinear means l^ times linear, so the terminology is rather good.) 

Let E, E' be modules. A map cp: E -► E f is said to be anti-linear (or semi- 
linear) if it is Z-linear, and cp(ax) = âcp(x) for all xe E. Thus we may say that 
a sesquilinear form is linear in its first variable, and anti-linear in its second 
variable. We let Hom R (E, E') denote the module of anti-linear maps of E 
into E'. 

We shall now go systematically through the same remarks that we made 
previously for bilinear forms. 

We define perpendicularity as before, and also the kernel on the right and 
on the left for any sesquilinear form /. These kernels are submodules, say E 0 
and F 0 , and we get an induced sesquilinear form 

E/E 0 x F/F 0 -+ R, 

which is non-degenerate on either side. 

Let F be an R- module. We defìne its anti-module F to be the module whose 
additive group is the same as F, and such that the operation R x F ^ F is 
given by 

(a, y) ay. 

Then F is a module. We have a natural isomorphism 
Hom^F, R) <-> Hom^F, R), 

as R- modules. 

The sesquilinear form f\ExF-+R induces a linear map 
cp f \E^ Hom^(F, R). 

We say that/is non-singular on the left if cp f is an isomorphism. Similarly, we 
have a corresponding linear map 

cp'f'-F -► Hom*(£, R) 
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from F into the dual space of £, and we say that / is non-singular on the right 
if <p' f is an isomorphism. We say that / is non-singular if it is non-singular on 
the left and on the right. 

We observe that our sesquilinear form/can be viewed as a bilinear form 

f:E x F^R, 

and that our notions of non-singularity are then compatible with those defined 
previously for bilinear forms. 

If we have a fixed non-singular sesquilinear form on E x F, then depending 
on this form, we obtain an isomorphism between the module of sesquilinear 
forms on E x F and the module of endomorphisms of E. We also obtain an 
anti-isomorphism between these modules and the module of endomorphisms 
of F. In particular, we can define the analogue of the transpose, which in the 
present case we shall call the adjoint. Thus, let/: E x F -> R be a non-singular 
sesquilinear form. Let A : E -► E be a linear map. There exists a unique linear 
map 

A* : F -► F 


such that 


(Ax,y) = <x, A*y} 

for all x e E and y e F. Note that A* is linear, not anti-linear. We call A* the 
adjoint of A with respect to our form f. We have the rules 

(cA)* = cA*, (A -f B)* = A* + B*, (AB)* = B*A* 

for all linear maps A, B of E into itself, and ce R. 

Let us assume that E = F. Let f:ExE-+R be sesquilinear. By an 
automorphism of/we shall mean a linear automorphism A : E -► E such that 

(Ax, Ay} = <x, y> 

just as we did for bilinear forms. 

Proposition 7.1. Let f:E x E^R be a non-singular sesquilinear form. 
Let A : E -> E be a linear map. Then A is an automorphism off if and only 
if A*A = id, and A is invertible. 

The proof, and also the proofs of subsequent propositions, which are 
completely similar to those of the bilinear case, will be omitted. 

A sesquilinear form g : E x E -> R is said to be hermitian if 


g(x, y) = g(y, x) 

for all x, y e E. The set of hermitian forms on E will be denoted by L 2 h (E). Let 
R 0 be the subring of R consisting of all elements fixed under our automorphism 
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a -> a (i.e. consisting of all elements ae R such that a = a). Then L 2 h (E) is an 
R 0 - module. 

Let us take a fixed hermitian non-singular form / on E , denoted by 
(x, ỳ)\- ><x, y ). An endomorphism A : E-+ E will be said to be hermitian 
with respect to/if A* = A. It is clear that the set of hermitian endomorphisms 
is an R 0 - module, which we shall denote by Herm(£). Depending on our fixed 
hermitian non-singular form /, we have an R 0 -isomorphism 


L 2 h (E) «-* Herm(£) 


described in the usual way. A hermitian form g corresponds to a hermitian 
map A if and only if 


g(x, y ) = <Ax, y) 

for all x, y e E. 

We can now describe the relation between our concepts and matrices, just 
as we did with bilinear forms. 

We start with a sesquilinear form /: E x F -> R. 

If E , F are free, and we have selected bases as before, then we can again 
associate a matrix G with the form, and in terms of coordinate vectors X , Y 
our sesquilinear form is given by 

(X, y)h- r XGF, 

where Y is obtained from Y by applying the automorphism to each component 
of y. 

If E = F and we use the same basis on the right and on the left, then with 
the same notation as that used in formula (1), if / is sesquilinear, the formula 
now reads 

(1S) M|(/) = 'CM%{f)C. 

The automorphism appears. 

Proposition 7.2. Let E , F be free modules of dimension n over R , and let 
f:E x F-*R be a sesquilinear form. Then the following conditions are 
equivalent. 

f is non-singular on the left. 
f is non-singular on the right. 
f is non-singular. 

The determinant of the matrix of f relative to any bases is invertible in R. 
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Proposition 7.3. Let E , F be free over R , of dimension n. Letf: E x F —► R 
be a non-singular sesquilinearform. Let ®, ®' be bases ofE and F respectively 
over R , and let G be the matrix of f relative to these bases. Let A.E^Ebe 
a linear map , and let M be its matrix relative to ®. Then the matrix of A* 
relative to ®' is 


(G~ l yMG. 


Corollary 7.4. If G is the unit matrix , then the matrix of A* is equal to l M. 

Corollary 7.5. Let the notation be as in the proposition , and let ® = ®' 
be a basis of E. An n x n matrix M is the matrix of an automorphism of f 
(relative to our basis ) if and only if 

l MGM = G. 

A matrix M is said to be hermitian if 'M = M. 

Let R 0 be as before the subring of R consisting of all elements fixed under 
our automorphism a i—► a (i.e. consisting of all elements ae R such that a = a). 

Proposition 7.6. Let E be a free module of dimension n over R , and let ® 
beabasis. Themap 


f\~* M|(/) 

induces an R 0 -isomorphism betxveen the R 0 -module of hermitian forms on E 

and the R 0 -module of n x n hermitian matrices in R. 

Remark. If we had assumed at the beginning that our automorphism 
a i * d has period 2 or 1 (i.e. if we allow it to be the identity), then the results 
on bilinear and symmetric forms become special cases of the results of this 
section. However, the notational differences are sufficiently disturbing to warrant 
a repetition of the results as we have done. 

Terminology 

For some confusing reason, the group of automorphisms of a symmetric 
(resp. alternating, resp. hermitian) form on a vector space is called the orthogonal 
(resp. symplectic, resp. unitary) group of the form. The word orthogonal is 
especially unfortunate, because an orthogonal map preserves more than 
orthogonality: It also preserves the scalar product, i.e. length. Furthermore, 
the word symplectic is also unfortunate. It turns out that one can carry out a 
discussion of hermitian forms over certain division rings (having automorphisms 
of order 2), and their group of automorphisms have also been called symplectic, 
thereby creating genuine confusion with the use of the word relative to alter- 
nating forms. 
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In order to unify and improve the terminology, I have discussed the matter 
with several persons, and it seems that one could adopt the following con- 
ventions. 

As said in the text, the group of automorphisms of any form/is denoted by 
Aut (/). 

On the other hand, there is a standard form, described over the real numbers 
in terms of coordinates by 


f(x, x) = x\ + • • • + x 2 , 
over the complex numbers by 

/(X, X) = X^Xj -f ••• -f x„x„, 

and over the quaternions by the same formula as in the complex case. The 
group of automorphisms of this form would be called the unitary group, and 
be denoted by U n . The points of this group in the reals (resp. complex, resp. 
quaternions) would be denoted by 


UJL R), U „(C), ujt K), 


and these three groups would be called the real unitary group (resp. complex 
unitary group, resp. quaternion unitary group). Similarly, the group of points 
of U n in any subfield or subring k of the quaternions would be denoted by U n (k). 
Finally, if / is the standard alternating form, whose matrix is 



one would denote its group of automorphisms by A 2n , and call it the alternating 
form group, or simply the alternating group, if there is no danger of confusion 
with the permutation group. The group of points of the alternating form 
group in a field k would then be denoted by A 2n (k). 

As usual, the subgroup of Aut(/) consisting of those elements whose 
determinant is 1 would be denoted by adding the letter S in front, and would 
still be called the special group. In the four standard cases, this yields 

SU n ( R), SU n ( C), SU n ( K), SA 2n (k). 


§8. THE SIMPLICITY OF SL 2 (F)/± 1 

Let F be a field. Let n be a positive integer. By GL n (F) we mean the group 
ofn x n invertible matrices over F. By SL n (F) we mean the subgroup of those 
matrices whose determinant is equal to 1. By PGL n (F) we mean the factor 
group of GL n (F) by the subgroup of scalar matrices (which are in the center). 
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Similarly for PSL„(F). In this section, we are interested in giving an application 
of matrices to the group theoretic structure of SL 2 . The analogous statements 
for SL„ with n ^ 3 will be proved in the next section. 

The standard Borel subgroup B of GL 2 is the group of all matrices 

a b 
0 d 

with a, b, d e F and ad # 0. For the Borel subgroup of SL 2 , we require in 
addition that ad = 1. By a Borel subgroup we mean a subgroup which is 
conjugate to the standard Borel subgroup (whether in GL 2 or SL 2 ). We let 
U be the group of matrices 


/1 b\ 

u(b) =1 \, with b e F. 


We let A be the group of diagonal matrices 



For the rest of this section, we let 

G = GL 2 (F) or SL 2 (F). 
Lemma 8.1. The matrices 


- (' J) and V(c) = (‘ J) 


generate SL 2 (F). 

Proof. Multiplying an arbitrary element of SL 2 (F) by matrices of the 
above type on the right and on the left corresponds to elementary row and 
column operations, that is adding a scalar multiple of a row to the other, etc. 
Thus a given matrix can always be brought into a form 
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by such multiplications. We want to express this matrix with a =é 1 in the form 

c x x x:) 

Matrix multiplication will show that we can solve this equation, by selecting x 
arbitrarily =é 0, then solving for b , c, and d successively so that 


1 + bx = a, c — 


—x 


d = 


1 + bx ’ " 1 -f bc ' 

Then one finds 1 4- bc = (1 4 xb)~ l and the two symmetric conditions 

b + bcd + d = 0 
c 4 bcx 4 * = 0, 

so we get what we want, and thereby prove the lemma. 

Let U be the group of lower matrices 


ct) 


Then we see that 

wUw~ 

Also note the commutation relation 


= U. 


a 0\ 

0 df 


d 0 
0 aì' 


so vv normalizes A. Similarly, 


wBw 1 = B 


is the group of lower triangular matrices. 

We note that 

B = AU = UA , 

and also that A normalizes U. 

There is a decomposition of G into disjoint subsets 


G = B u BwB. 


Indeed, view G as operating on the left of column vectors. The isotropy group of 



is obviously U. The orbit Be l consists of all column vectors whose second 
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component is 0. On the other hand, 



and therefore the orbit Bwe l consists of all vectors whose second component 
is # 0, and whose first component is arbitrary. Since these two orbits of B and 
BxvB cover the orbit Ge 1 , it follows that the union of B and BwB is equal to G 
(because the isotropy group U is contained in B\ and they are obviously 
disjoint. This decomposition is called the Bruhat decomposition 

Proposition 8.2. The Borel subgroup B is a maximal proper subgroup. 

Proof. By the Bruhat decomposition, any element not in B lies in BwB, 
so the assertion follows since B, BwB cover G. 

Theorem 8.3. IfF has at least four elements , then SL 2 (F) is equal to its own 

commutator group. 

Proof. We have the commutator relation (by matrix multiplication) 

s(a)u(b)s(a)~ l u(b )~ 1 = u(ba 2 — b) = u(b(a 2 - 1)). 

Let G = SL 2 (F) for this proof. We let G' be the commutator subgroup, and 
similarly let B' be the commutator subgroup of B. We prove the first assertion 
that G = G'. From the hypothesis that F has at least four elements, we can 
find an element a / 0 in F such that a 2 / 1, whence the commutator relation 
shows that B' = U. It follows that G' => G, and since G' is normal, we get 

G' => wUw~^. 

From Lemma 8.1, we conclude that G' = G. 

Let Z denote the center of G. It consists of ±/, that is ± the identity 2x2 
matrix if G = SL 2 (F); and Z is the subgroup of scalar matrices if G = GL 2 (F). 

Theorem 8.4. IfF has at least four elements, then SL 2 (F)/Z is simple. 

The proof will result from two lemmas. 

Lemma 8.5. The intersection of all conjugates of B in G is equal to Z. 

Proof. We leave this to the reader, as a simple fact using conjugation 
with vv. 

Lemma 8.6. Let G = SL 2 (F). If H is normal in G, then either H a Z or 

H => G'. 

Proof. By the maximality of B we must have 


HB = B or HB = G. 
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If HB = B then H <= B. Since H is normal, we conclude that H is contained in 
every conjugate of B , whence in the center by Lemma 8.5. On the other hand, 
suppose that HB = G. Write 


w = hb 

with he H and b e B. Then 

wUw ~ 1 = V = hbUb-^h' 1 = hUh ~ 1 <= HU 

because H is normal. Since U <= HU and £/, U generate SL 2 (F\ it follows that 
HU = G. Hence 


G/H = HU/H % U/(U n H) 

is abelian, whence H => G', as was to be shown. 

The simplicity of Theorem 8.4 is an immediate consequence of Lemma 8.6. 


§9. THE GROUP SL n (F). n ^ 3. 

In this section we look at the case with n ^ 3, and follow parts of Artin’s 
Geometric Algebra , Chapter IV. (Artin even treats the case of a non-commuta» 
tive division algebra as the group ring, but we omit this for simplicity.) 

For ij = 1,..., n and i / j and c e F, we let 



be the matrix which difîers from the unit matrix by having c in the ij -component 
instead of 0. We call such EJc) an elementary matrix. Note that 

det EJc) = 1. 

If A is any n x n matrix, then multiplication E i} (c)A on the left adds c times the 
7 *-th row to the i-th row of A. Multiplication AEJc) on the right adds c times 
the i-th column to the j-th column. We shall mostly multiply on the left. 

For fixed i # j the map 


C ^ £,/c) 
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is a homomorphism of F into the multiplicative group of n x n matrices 
GL n {F). 

Proposition 9.1. The group SL n (F) is generated by the elementary matrices. 
If A e GL n (F), then A can be xvritten in theform 

A = SD, 

where S e SL n (F) and D is a diagonal matrix of theform 



so D has 1 on the diagonal except on the loxver right corner, where the com - 

ponent is d = det(/l). 

Proof. Let A e GL n (F). Since A is non-singular, the first component of 
some row is not zero, and by an elementary row operation, we can make 
a n / 0. Adding a suitable multiple of the first row to the second row, we make 
a 2ì /0, and then adding a suitable multiple of the second row to the first we 
make a n = 1. Then we subtract multiples of the first row from the others to 
make a xl = 0 for i / 1. 

We now repeat the procedure with the second row and column, to make 
a 22 = 1 and a i2 = 0 if i > 2. But then we can also make a l2 = 0 by sub- 
tracting a suitable multiple of the second row from the first, so we can get 
a i2 = 0 for i / 2. 

We repeat this procedure until we are stopped at a nn = d / 0, and a nj = 0 
for j # n. Subtracting a suitable multiple of the last row from the preceding 
ones yields a matrix D of the form indicated in the statement of the theorem, 
and concludes the proof. 

Theorem 9.2. For n ^ 3, SL n (F) is equal to its own commutator group. 

Proof. It suffices to prove that E xj (c) is a commutator. Using n ^ 3, let 
k # i,j. Then by direct computation, 

expresses L,/c) as a commutator. This proves the theorem. 

We note that if a matrix M commutes with every element of SL n (F), then 
it must be a scalar matrix. Indeed, just the commutation with the elementary 
matrices 


Eijd) = / + l tj 
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shows that M commutes with all matrices l^- (having 1 in the ij- component, 
0 otherwise), so M commutes with all matrices, and is a scalar matrix. Taking 
the determinant shows that the center consists of jJt w (F)7, where it„(F) is the 
group of n-th roots of unity in F. 

We let Z be the center of SL n (F), so we have just seen that Z is the group 
of scalar matrices such that the scalar is an n- th root of unity. Then we define 

PSL n (F) = SL n (F)/Z. 

Theorem 9.3. For n ^ 3, PSL n (F) is simple. 

The rest of this section is devoted to the proof. We view GL n (F) as operating 
on the vector space E = F n . If X is a non-zero functional on E , we let 

H x = Ker X, 

and call H x (or simply H) the hyperplane associated with X. Then dim H = n — 1, 
and conversely, if H is a subspace of codimension 1, then E/H has dimension 
1, and is the kernel of a functional. 

An element T e GL n (F) is called a transvection if it keeps every element of 
some hyperplane H fixed, and for all xe E, we have 

Tx = x + h for some he H. 

Given any element u e H x we define a transvection T u by 

T u x = x + X(x)u. 

Every transvection is of this type. If u 9 v e H À , it is immediate that 

T u + V — T u o T v . 

If T is a transvection and A e GL n (F ), then the conjugate ATA~ l is ob- 
viously a transvection. 

The elementary matrices £,/c) are transvections, and it will be useful to 
use them with this geometric interpretations, rather than formally as we did 
before. Indeed, let e l9 ..., e n be the standard unit vectors which form a basis 
of F (n) . Then E i} (c) leaves e k fixed if k # j, and the remaining vector e^ is moved 
by a multiple of e x . We let H be the hyperplane generated by e k with k # j, 
and thus see that E i} (c) is a transvection. 

Lemma 9.4. For n ^ 3, the transvections # / form a single conjugacy class 

in SL n (F). 

Proof. First, by picking a basis of a hyperplane H = H k and using one 
more element to form a basis of F (n) , one sees from the matrix of a transvection 
T that det T = 1, i.e. transvections are in SL n (F). 
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Let T' be another transvection relative to a hyperplane H'. Say 

Tx = x + A(x)u and T'x = x -f X'(x)u' 

with ue H and u e H'. Let z and z' be vectors such that A(z) = 1 and Â'(z') = 1. 
Since a basis for H together with z is a basis for F (n \ and similarly a basis for 
H' together with z' is a basis for F (n \ there exists an element A e GL n (F) such 
that 


Au = u\ AH = H\ Az = z'. 

It is then immediately verified that 

ATA~ l = T\ 

so T, T' are conjugate in GL n (F). But in fact, using n ^ 3, the hyperplanes JF/, 
H' contain vectors which are independent. We can change the image of a basis 
vector in H' which is independent of u' by some factor in F so as to make 
det A = 1, so A e SL n (F). This proves the lemma. 

We now want to show that certain subgroups of GL n (F) are either con- 
tained in the center, or contain SL n (F). Let G be a subgroup of GL n (F). We 
say that G is 5L n -invariant if 

AGA ~ 1 c= G for all A e SL n (F). 

Lemma 9.5. Let n ^ 3. Let G be SL n invariant , and suppose that G contains 

a transvection T / /. Then SL n (F) c G. 

Proof. By Lemma 9.4, all transvections are conjugate, and the set of 
transvections contains the elementary matrices which generate SL n (F) by 
Proposition 9.1, so the lemma follows. 

Theorem 9.6. Let n ^ 3. IfG is a subgroup of GL n (F) which is SL„-invariant 

and which is not contained in the center of GL n (F ), then SL n (F) cz G. 

Proof. By the preceding lemma, it suffices to prove that G contains a 
transvection, and this is the key step in the proof of Theorem 9.3. 

We start with an element A e G which moves some line. This is possible 
since G is not contained in the center. So there exists a vector u # 0 such that 
Au is not a scalar multiple of w, say Au = v. Then u, v are contained in some 
hyperplane H = Ker À. Let T = T u and let 

B = AT A~ x T~ l . 


Then 


ATA~ l # T and B = ATA~ l T~ l # /. 
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This is easily seen by applying say B to an arbitrary vector x, and using the 
definition of T u . In each case, for some x the left-hand side cannot equal the 
right-hand side. 

For any vector x e F {n) we have 

Bx — X £ (u, v), 

where (n, v) is the plane generated by u, v. It follows that BH <= H, so 
BH = H and Bx — x e H. 

We now distinguish two cases to conclude the proof. First assume that B 
commutes with all transvections with respect to H. Let vv e H. Then from the 
definitions, we find for any vector x: 

BT w x = Bx + A.(x)Bw 

T w Bx = Bx + X(Bx)w = Bx + A(x)w. 

Since we are in the case BT W = T W B , it follows that Bw = w. Theretore B 
leaves every vector of H fixed. Since we have seen that Bx — x e H for all x, 
it follows that B is a transvection and is in G, thus proving the theorem in this 
case. 

Second, suppose there is a transvection T w with w e H such that B does not 
commute with T w . Let 

C = BT w B~ l T~ l . 

Then C / / and C e G. Furthermore C is a product of T w 1 and BT w B~ l 
whose hyperplanes are H and BH , which is also H by what we have already 
proved. Therefore C is a transvection, since it is a product of transvections 
with the same hyperplane. And C e G. This concludes the proof in the second 
case, and also concludes the proof of Theorem 9.6. 

We now return to the main theorem, that PSL n (F) is simple. Let G be a 
normal subgroup of PSL n (F ), and let G be its inverse image in SL n (F). Then G 
is SL„-invariant, and if G # 1, then G is not equal to the center of SL n (F). 
Therefore G contains SL n (F) by Theorem 9.6, and therefore G = PSL n (F), thus 
proving that PSL n (F) is simple. 

Example. By Exercise 41 of Chapter I, or whatever other means, one sees 
that PSL 2 ( F 5 ) » A 5 (where F 5 is the finite field with 5 elements). While you are 
in the mood, show also that 


PGL 2 ( F 3 ) « S 4 but SL 2 ( F 3 ) i* S 4 , PSL 2 (F 3 ) « A 4 . 
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EXERCISES 

1. Interpret the rank of a matrix A in terms of the dimensions of the image and kernel 
of the linear map L Á . 

2. (a) Let A be an invertible matrix in a commutative ring R. Show that CA)" 1 = '(A -1 ). 
(b) Let / be a non-singular bilinear form on the module E over R. Let A be an 

/?-automorphism of E. Show that ('A) -1 = '(A -1 ). Prove the same thing in the 
hermitian case, i.e. (A*) _1 = (A' 1 )*. 

3. Let V, W be finite dimensional vector spaces over a field k. Suppose given 

non-degenerate bilinear forms on V and W respectively, denoted both by ( , ). 

Let L: V —> W be a surjective linear map and let 'L be its transpose; that is, 

( Lv , vv) = ( v , 'Lvv) for v E V and vv E W. 

(a) Show that 'L is injective. 

(b) Assume in addition that if v E V, v 0 then ( v, v) + 0. Show that 

V = Ker L ® Im 'L, 

and that the two summands are orthogonal. (Cf. Exercise 33 for an example.) 

4. Let A, ..., A r be row vectors of dimension n, over a field k. Let X = (x 1? ..., x n ). Let 
b l9 ..., b r e k. By a system of linear equations in k one means a system of type 

Aj X = b l ,...,A r X = b r . 

If b { = • • = b r = 0, one says the system is homogeneous. We call n the number of 
variables, and r the number of equations. A solution X of the homogeneous system 
is called trivial if jc, = 0, / = 1,...,«. 

(a) Show that a homogeneous system of r linear equations in* n unknowns with 
n > r always has a non-trivial solution. 

(b) Let L be a system of homogeneous linear equations over a field k. Let k be a 
subfield of k!. If L has a non-trivial solution in k\ show that it has a non-trivial 
solution in k. 

5. LetA/beann x n matrix over a field k. Assumethat ir(MX) = Oforalln x nmatrices 
X ìn k. Show that M = 0. 

6. Let S be a set of n x n matrices over a fìeld k. Show that there exists a column vector 
X^Oof dimension n in k, such that MX = X for all MeS if and only if there exists 
such a vector in some extension fìeld k' of k. 

. 7. Let H be the division ring over the reals generated by elements /, j, k such that 
i 2 = j 2 = k 2 = — 1, and 

ij = - ji = k, jk = — kj = i, ki = -ik= j. 

Then H has an automorphism of order 2, given by 

a 0 + a x i + a 2 j + a 3 k\-> a 0 - a { i - a 2 j - a 3 k. 

Denote this automorphism by a á. What is aâ? Show that the theory of hermitian 
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forms can be carried out over H, which is called the division ring of quaternions (or by 
abuse of language, the non-commutative field of quaternions). 

8. Let N be a strictly upper triangular n x n matrix, that ìs N = (a tJ ) and a u = 0 ìf i ^ j. 
Show that N n = 0. 

9. Let E be a vector space over /c, of dimension n. Let T: E -+ E be a linear map such 
that T is nilpotent, that is T m = 0 for some positive integer m. Show that there exists 
a basis of E over k such that the matrix of T with respect to this basis is strictly 
upper triangular. 

10. If N ìs a nilpotent n x n matrix, show that I + N is invertible. 

11. Let R be the set of all upper triangular n x n matrices (a u ) with a u in some field /c, so 
a u = 0 if i > j. Let J be the set of all strictly upper triangular matrices. Show that J 
is a two-sided ìdeal in R. How would you describe the factor ring R/Jl 

12. Let G be the group of upper triangular matrices with non-zero diagonal elements. 
Let H be the subgroup consisting of those matrices whose diagonal element ìs 1. 
(Actually prove that H ìs a subgroup). How would you describe the factor group G/Hl 

13. Let R be the ring of n x n matrices over a field k. Let L be the subset of matrices 
which are 0 except on the first column. 

(a) Show that L is a left ideal. 

(b) Show that L is a minimal left ideal; that is, if L' C L is a left ideal and 
L' + 0, then L' = L. (For more on this situation, see Chapter VII, §5.) 

14. Let F be any field. Let D be the subgroup of diagonal matrices ìn GL n (F). Let N be 
the normalizer of D in GL n (F). Show that N/D is isomorphic to the symmetric group 
on n elements. 

15. Let F be a fìnite field with q elements. Show that the order of GL n (F) ìs 

(<?" - 1 X <?" -</) •(<?"- <?"'') = <?"'"' 1)12 fl (<?■' - 1 ). 

i = 1 

[Hint: Let Xj,..., x„ be a basis of F n . Any element of GL n (F) ìs uniquely determined 
by its effect on this basis, and thus the order of GL n (F) ìs equal to the number of all 
possible bases. If A e GL n (F\ let Ax^ = y u For y { we can select any of the q n - 1 
non-zero vectors in F n . Suppose mductively that we have already chosen y,,..., y r 
with r < n. These vectors span a subspace of dimension r which contains q r elements. 
For y í+1 we can select any of the q n — q r elements outside of this subspace. The 
formula drops out.] 

16. Again let F be a finite field with q elements. Show that the order of SL n (F) is 

n 

q n(n ~ D/2 Y[ (qi - 1); 

i=2 

and that the order of PSL n (F) ìs 

V - ,,/2 n'tf -1). 

a i = 2 

where d is the greatest common divisor of n and q — 1. 
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17. Let F be a fimte field vvith q elements. Show that the group of all upper triangular 
matrices with 1 on the diagonal ìs a Sylow subgroup of GL n (F) and of SL n (F). 

18. The reduction map Z —► Z/NZ, where N is a positive integer defines a homomorphism 

SL 2 ( Z) - SL 2 (Z/NZ). 

Show that this homomorphism is surjective. [Hint : Use elementary divisors, i.e. the 
structure of submodules of rank 2 over the principal ring Z.] 

19. Show that the order of SL 2 (Z/NZ) is equal to 



where the product is taken over all primes dividing N. 

20. Show that one has an exact sequence 

1 -► SL 2 (Z/NZ) -> GL 2 (Z/NZ) ^ (Z/NZ)* - 1. 

In fact, show that 

GL 2 (Z/NZ) = SL 2 (Z/NZ)G n , 
where G N is the group of matrices 


with de(Z/NZ)*. 

21. Show that SL 2 ( Z) is generated by the matrices 

c i) - c ■:)■ 

22. Let p be a prime ^ 5. Let G be a subgroup of SL 2 (Z/p n Z) with n ^ 1. Assume that 
the image of G in SL 2 (Z/pZ) under the natural homomorphism is all of SL 2 (Z/pZ). 
Prove that G = SL 2 (Z/p n Z). 

Note. Exercise 22 is a generalization by Serre of a result of Shimura; see Serre’s Abelian 

t-adic Representations and elliptic curves , Benjamin, 1968, IV, §3, Lemma 3. See also 

my exposition in Elliptic Functions , Springer Verlag, reprinted from Addison-Wesley, 

1973, Chapter 17, §4. 

23. Let k be a field in which every quadratic polynomial has a root. Let B be the Borel 
subgroup of GL 2 (k). Show that G is the union of all the conjugates of B. (This cannot 
happen for finite groups!) 

24. Let A, B be square matrices of the same size over a field k. Assume that B is non- 
singular. If t is a variable, show that det(/l 4- tB) is a polynomial in r, whose leading 
coefficient is det(B), and whose constant term is det(4). 

25. Let û 11 ,...,û ln be elements from a principal ideal ring, and assume that they generate 
the unit ideal. Suppose n > 1. Show that there exists a matrix (a^) with this given 
first row, and whose determinant is equal to 1. 


C 
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26. Let A be a commutative ring, and / = (x,,..., x r ) an ideal. Let c 0 e A and let 

r 

yi= I c U x r 
j= 1 

Let /' = (y„ ..., y r ). Let D = det(c (j ). Show that DI c: /'. 

27. Let L be a free module over Z with basis c,,..., c„. Let M be a free submodule of the 
same rank, with basis u l9 ..., u n . Let u, = ]T Show that the index (L: M) is 
given by the determinant: 

(L:M) = |det(c 0 )|. 

28. (The Dedekind determinant). Let G be a finite commutative group and let F be the 
vector space of functions of G into C. Show that the characters of G (homomorphisms 
of G into the roots of unity) form a basis for this space. If /: G —> C is a function, 
show that for a, b E G. 

det (f(ab~ 1 )) = f[ Z *(«)/(«)> 

X aeG 

where the product is taken over all characters. [Hint: Use both the characters and 
the char^cteristic functions of elements of G as bases for F, and consider the linear map 

r = I/(a)r 0 , 

where T a is translation by a.] Also show that 

det (f(ab~ ‘)) = ( X f(a)) det‘) - J{b ~')), 

\a 6 G / 


where the determinant on the left is taken for all a 9 b eG, and the determinant on 
the right is taken only for a, b =t= 1. 

29. Let g be a module over the commutative ring R. A bilinear map g x g -► g, written 
(x, y) [x, ý] 9 is said to make g a Lie algebra if [x, x] = 0 and 

[[*, xL 2 ] + [[y, z], x] 4- [[z, x], y~\ = 0 


for all x, y, z e g. 

(a) Let M n (R) be the ring of matrices over R. If x, y e M n (R\ show that the 
product 

(x, y) i-> [x, ý] = xy - yx 
makes M n (R) into a Lie algebra. 

(b) Let g be a Lie algebra. Let x e g, and let L x , L(x) or Lie x be the linear map 
given by L x (y) = [x, y\ . Show that L x is a derivation of g into itself (i.e. 
satisfies the rule D([y,z\) = [Dy,z\ + [y, Dz]). 

(c) Show that the map x i—► L x is a Lie homomorphism of g into the module of 
derivations of g into itself. 

30. Given a set of polynomials (P V (A' 0 )} in the polynomial ring F[2f 0 ] (1 ^ i,j ^ n), a 
zero of this set in R is a matrix x = (x 0 ) such that x 0 e R and P v (x 0 ) = 0 for all v. 
We use vector notation, and write (2Q = (X u ). We let G(R) denote the set of zeros 
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of our set of polynomials {P v }. Thus G(R) c= M n (R\ and if R' is any commutative 
associative /?-algebra we have G(R ) c= M n (R'). We say that the set {P v } defines an 
algebraic group over R if G(/?') is a subgroup of the group GL n (R') for all R' (where 
GL n (R') is the multiplicative group of invertible matrices in R'). 

As an example, the group of matrices satisfying the equation { XX = I n ìs an alge- 
braic group. 

Let R' be the R- algebra which is free, with a basis {1, t} such that t 2 = 0. Thus 
R' = /?[t]. Let g be the set of matrices x 6 M n (R) such that /„ + rx 6 G(/?[í]). Show 
that g is a Lie algebra. [Hinf. Note that 

PMn + tX) = P V (I„) + grad P v (/„)íX. 

Use the algebra R[t, u ] where t 2 = w 2 = 0 to show that if I n + txe G(R[t~\) and 
/„ 4- uy e G(P[u]) then [x, y\ e g.] 

(I have taken the above from the first four pages of [Se 65]. For more information 
on Lie algebras and Lie Groups, see [Bo 82] and [Ja 79]. 

[Bo 82] N. Bourbaki, Lie Algebras and Lie Groups , Masson, 1982 
[Ja 79] N. Jacobson, Lie Algebras , Dover, 1979 (reprinted from Interscience, 
1962) 

[Se 65] J. P. Serre, Lie Algebras and Lie Groups , Benjamin, 1965. Reprinted 
Springer Lecture Notes 1500. Springer/Verlag 1992 

Non-commutative cocycles 

Let Kbea fìnite Galois extension of a field k. Let T = GL n (K\ and G = Ga\(K/k). 
Then G operates on T. By a cocycle of G in T we mean a family of elements {A(o)} 
satisfying the relation 

A(o)(tA(t) = A(ot). 

We say that the cocycle splits ìf there exists B eV such that 
A(o) = B~ l oB for all cr g G. 

In this non-commutative case, cocycles do not form a group, but one could define an 
equivalence relation to define cohomology classes. For our purposes here, we care 
only whether a cocycle splits or not. When every cocycle splits, we also say that 
H l (G, T) = 0(or 1). 

31. Prove that H l (G , GL n (K)) = 1. [Hint: Let {e l5 ..., e N } be a basis of Mat n (k) over /c, 
say the matrices with 1 in some component and 0 elsewhere. Let 

N 

X = X X i e i 

i- 1 

with variables Xj. There exists a polynomial P(X) such that x is invertible if and only 
if P,(x {ì ..., x N ) ^ 0. Instead of P(x { , ..., x N ) we also write P(x). Let {A(o)} be a 
cocycle. Let {t a } be algebraically independent variables over k. Then 
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because the polynomial does not vanish when one t y is replaced by 1 and the others 
are replaced by 0. By the algebraic independence of automorphisms from Galois 
theory, there exists an element y e K such that if we put 

B = £ (yy)A(y) 


32. 


then P(B) # 0, so B is invertible. It is then immediately verified that A(o) = BaB l . 
But when k is finite, cf. my Algebraic Groups over Finite Fields , Am. J. Vol 78 No. 
3, 1956.] 

Invaríant bases. (Kolchin-Lang, Proc. AMS Vol 11 No. 1, 1960). Let K be a finite 
Galois extension of /c, G = Gal(A7/c) as in the preceding exercise. Let V be a 
finite-dimensional vector space over K , and suppose G operates on V in such a 
way that cr(av) = cr(a)cr(v) for a E K and v E V. Prove that there exists a basis 
{vvj,. . . , w n } such that oWj = vv, for all i = 1,.. ., n and all cr e G (an invariant 
basis). Hint\ Let {v u .. ., v n } be any basis, and let 



where A(o) is a matrix in GL n (K). Solve for B in the equation (oB)A((r) = fî, and let 



The next exercises on harmonic polynomials have their source in Whittaker, Math. 
Ann. 1902; see also Whittaker and Watson, Modern Analysis , Chapter XIII. 

33. Harmonic polvnomials. Let Pol(n, d) denote the vector space of homogeneous poly- 
nomials of degree d in n variables X lt ... 9 X n over a field k of characteristic 0. 


For an n-tuple of integers ( v x ,. 
monomial 


v n ) with v { ^ 0 we denote by M (v) as usual the 


M (V) (X) =X\ 1 --- X v n \ 


Prove: 

(a) 


/n — l + d\ 

The number of monomials of degree d is I I, so this number is 

\ n — 1 / 

the dimension of Pol(n, d). 

(b) Let (D) = (D x ,..., D n ) where D, is the partial derivative with respect to the 
/-th variable. Then we can defìne P(D) as usual. For P,Q e Pol(«, d), define 

<P, Q) = P(D)Q( 0). 

Prove that this defines a symmetric non-degenerate scalar product on 
Pol(«, d). If k is not real, it may happen that P ± 0 but (P, P) = 0. However, 
if the ground field is real, then (P, P) > 0 for P 0. Show also that the 
monomials of degree d form an orthogonal basis. What is (M {v)ì M (l/) )? 

(c) The map P I-» P(D) is an isomorphism of Pol(n, d) onto its dual. 
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(d) Let A = Df + • • • + D 2 . Note that A: Pol(n, d) —> Pol(n, d - 2) is a linear 
map. Prove that A is surjective. 

(e) Define Har(n, d) = KerA = vector space of harmonic homogeneous poly- 
nomials of degree d. Prove that 

dim Har (n, d) = (n + d - 3)! (n + 2d - 2)/(n - 2 )\d\. 

In particular, if n = 3, then dim Har(3, d) = 2d + 1. 

(f) Let r 2 = X 2 + • • • + X 2 . Let S denote multiplication by r 2 . Show that 

<AP, Q) = <P, SQ) for P e Pol(w, d) and Q e Pol(n, d - 2), 

so 'A = S. More generally, for R e Pol(n, m) and Q e Pol(n, d - m) we 
have 

(R(D)P , Q) = <P, RQ). 

(g) Show that [A, S] = 4d + 2n on Pol(n, d). Here [A, S] = A °5 - 5° A. 
Actually, [A, 5] = 4E + 2 n, where E is the Euler operator E = ^X^D^ 
which is, however, the degree operator on homogeneous polynomials. 

(h) Prove that Pol(«, d) = Har (n, d) ® r 2 Pol(«, d — 2) and that the two summands 
are orthogonal. This is a classical theorem used in the theory of the Laplace 
operator. 

(i) Let (cj,. . . , c n ) e k n be such that 'Zcf = 0. Let 

H d c (X) = (c x X x + • • • + c n X n ) d . 

Show that H d is harmonic, i.e. lies in Har (n, d). 

(j) For any Q E Pol(n, d ), and a positive integer m, show that 

Q(D)H?(X) = m(m - 1) • • • (m - d + 1 )Q(c)H^~ d (X). 

34. (Continuation of Exercise 33). Prove: 

Theorem. Let k be algebraically closed of characteristic 0. Let n ^ 3. Then 
Har(«, d) as a vector space over k is generated by all polynomials H d with (c) E k n 
such that 2c 2 = 0. 

[Hint : Let Q E Har(«, d) be orthogonal to all polynomials H d with (c) E k n . By 
Exercise 33(h), it suffices to prove that r 2 \Q. But if 2c 2 = 0, then by Exercise 
33(j) we conclude that Q(c) = 0. By the Hilbert Nullstellensatz, it follows that there 
exists a polynomial F(X) such that 

Q(X) S = r 2 (X)F(X) for some positive integer s. 

But n ^ 3 implies that r 2 (X) is irreducible, so r 2 (X) divides Q(X).] 

35. (Continuation of Exercise 34). Prove that the representation of 0(n) — C/ W (R) on 
Har(«, d) is irreducible. 

Readers will find a proof in the following: 

S. Helgason, Topics in Harmonic Analysis on Homogeneous Spaces , Birkháuser, 1981 
(see especially §3, Theorem 3.1 (ii)) 

N. Vilenkin, Special Functions and the Theory of Group Representations , AMS Trans- 
lations of mathematical monographs Vol. 22, 1968 (Russian original, 1965), Chapter 
IX, §2. 
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R. Howe and E. C. Tan, Non-Abelian Harmonic Analysis, Universitext, Springer Verlag, 
New York, 1992. 

The Howe-Tan proof runs as follows. We now use the hermitian product 
(P, Q) = J P(x) Q(x) da(x), 

S n~ì 

where a is the rotation invariant measure on the (n-l)-sphere S” -1 . Let 
e\,...,e n be the unit vectors in R\ We can identify 0(n - 1) as the subgroup of 
0(n) leaving e n fixed. Observe that 0(n) operates on Har(/i,í/), say on the right by 
composition P i-> P o A, A e 0(n), and this operation commutes with A. Let 

X\ Har(M) C 

be the functional such that X(P) = P(e n ). Then X is 0(n - l)-invariant, and since the 
hermitian product is non-degenerate, there exists a harmonic polynomial Q n such 
that 


X(P) = (P,Q n y for all P e Har(«, d). 

Let M <= Har (n,d) be an 0(n)-submodule. Then the restriction X\ / of X to M is 
nontrivial because 0(n) acts transitively on S w_1 . Let Qjf be the orthogonal pro- 
jection of Q n on M. Then Q„ is 0(n - l)-invariant, and so is a linear combination 

Qn(x) = 2 Cj x{ rf_ x . 

j+2k=d 

Furthermore Q„ is harmonic. From this you can show that Q% is uniquely determined, 
by showing the existence of recursive relations among the coefficients Cj. Thus the 
submodule M is uniquely determined, and must be all of Har (n, d). 


Irreducibility of sl w (/ r ). 

36. Let F be a field of characteristic 0. Let g = s!„(F) be the vector space of matrices 
with trace 0, with its Lie algebra structure [ X, Y ] = XY - YX. Let Ey be the matrix 
having (/, y)-component 1 and all other components 0. Let G = SL n (F). Let A be 
the multiplicative group of diagonal matrices over F. 

(a) Let Hi = Eu — E i+ \ t i + \ for i=\,...,n-\. Show that the elements Ey 
(i # j), H\,..., H n -1 form a basis of g over F. 

(b) For g e G let c(g) be the conjugation action on g, that is c (g)X = gXg~ l . 
Show that each Ey is an eigenvector for this action restricted to the group A. 

(c) Show that the conjugation representation of G on g is irreducible, that is, if 
V # 0 is a subspace of g which is c(G)-stable, then V = g. Hint: Look up 
the sketch of the proof in [JoL 01], Chapter VII, Theorem 1.5, and put in all 
the details. Note that for i # j the matrix Ey is mlpotent, so for variable t, 
the exponential series exp (tEy) is actually a polynomial. The derivative with 
respect to t can be taken in the formal power series F[[t]}, not using limits. If 
X is a matrix, and x(t) = exp(rY), show that 

= XY - YX = [X, Y]. 

t =o 


j [X (t)Yx(,r' 



CHAPTER XIV 

Representation of One 
Endomorphism 


We deal here with one endomorphism of a module, actually a free module, 
and especially a finite dimensional vector space over a field k. We obtain the 
Jordan canonical form for a representing matrix, which has a particularly simple 
shape when k is algebraically closed. This leads to a discussion of eigenvalues 
and the characteristic polynomial. The main theorem can be viewed as giving 
an example for the general structure theorem of modules over a principal ring. 
In the present case, the principal ring is the polynomial ring k[X] in one variable. 


§1. REPRESENTATIONS 

Let k be a commutative ring and E a module over k. As usual, we denote by 
End k (E) the ring of /c-endomorphisms of £, i.e. the ring of fc-linear maps of E into 
itself. 

Let R be a k -algebra (given by a ring-homomorphism k -► R which allows 
us to consider R as a fc-module). By a representation of R in E one means a k- 
algebra homomorphism R -> End k (£), that is a ring-homomorphism 

p.R^ End k (E) 

which makes the following diagram commutative: 

R ->End k (E) 


k 
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[As usual, we view End k (E) as a /c-algebra; if / denotes the identity map of £, 
we have the homomorphism of k into End k (E) given by a t—► al. We shall also 
use / to denote the unit matrix if bases have been chosen. The context will 
always make our meaning clear.] 

We shall meet several examples of representations in the sequel, with various 
types of rings (both commutative and non-commutative). In this chapter, the 
rings will be commutative. 

We observe that E may be viewed as an End k (E) module. Hence E may be 
viewed as an K-module, defining the operation of R on E by letting 

(x, v) i ► p(x)v 

for x 6 R and vsE. We usually write xv instead of p(x)v. 

A subgroup F of E such that RF c= F will be said to be an invariant sub- 
module of E. (It is both /?-invariant and /:-invariant.) We also say that it is 
invariant under the representation. 

We say that the representation is irreducible, or simple, if E / 0, and if the 
only invariant submodules are 0 and E itself. 

The purpose of representation theories is to determine the structure of all 
representations of various interesting rings, and to classify their irreducible 
representations. In most cases, we take k to be a field, which may or may not 
be algebraically closed. The difficulties in proving theorems about representa- 
tions may therefore lie in the complication of the ring R , or the complication of 
the field /c, or the complication of the module £, or all three. 

A representation p as above is said to be completely reducible or semi-simple 
if E is an R-direct sum of /^-submodules E i9 

E = E x ® • • • ® E m 

such that each E t is irreducible. We also say that E is completely reducible. 
It is not true that all representations are completely reducible, and in fact those 
considered in this chapter will not be in general. Certain types of completely 
reducible representations will be studied later. 

There is a special type of representation which will occur very frequently. 
Let ve E and assume that E = Rv. We shall also write E = (v). We then say 
that E is principal (over R\ and that the representation is principal. If that is 
the case, the set of elements x e R such that xv = 0 is a left ideal a of R (obvious). 
The map of R onto E given by 


xi —>xv 

induces an isomorphism of R-modules, 

R/a -► E 


(viewing R as a left module over itself, and R/a as the factor module). In this 
map, the unit element 1 of R corresponds to the generator v of E. 
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As a matter of notation, if v u ..., v n e £, we let (v u ..., v n ) denote the sub- 
module of E generated by v u ... ,v n . 

Assume that E has a decomposition into a direct sum of R-submodules 

E = E x © • • • © E s . 

Assume that each E t is free and of dimension ^ 1 over k. Let ®j,... , ® s be 
bases for E x ,.. ., E s respectively over k. Then { (B j,... , ® J is a basis for E. 
L et <p e R, and let <p t be the endomorphism induced by <p on E r Let Aí f be the 
matrix of <Pi with respect to the basis ® r Then the matrix M of <p with respect 
to {® j, • • • , ® 5 } looks like 



A matrix of this type is said to be decomposed into blocks, M u ... M s . When 
we have such a decomposition, the study of cp or its matrix is completely reduced 
(so to speak) to the study of the blocks. 

It does not always happen that we have such a reduction, but frequently 
something almost as good happens. Let E' be a submodule of E , invariant 
under R. Assume that there exists a basis of E' over k, say {v u ..., v m }, and that 
this basis can be completed to a basis of £, 

{l^l, ..., v m9 v m + 1 , • • •, v n }. 

This is always the case if k is a field. 

Let <pe R. Then the matrix of <p with respect to this basis has the form 



Indeed, since E' is mapped into itself by cp , it is clear that we get M' in the upper 
left, and a zero matrix below it. Furthermore, for each j = m + 1,..., n we can 
write 

<pV Cj\V\ + . . . + Vj m V m + Cj m -f. i V m -f- i + . . . + Cj n V n . 

The transpose of the matrix (c Jt ) then becomes the matrix 

♦ 

M" 


occurring on the right in the matrix representing <p. 
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Furthermore, consider an exact sequence 
0 - E - E - E" - 0. 

Let v m+ j,..., v n be the images of i? OT+ ì9 ..., v„ under the canonical map E -► E". 
We can define a linear map 

(/?" : E" -► E" 

in a natural way so that ( cpi ) = cp"{v) for all veE. Then it is clear that the 
matrix of q>" with respect to the basis {v u .. ., v n } is M". 


§2. DECOMPOSITION OVER ONE 
ENDOMORPHISM 

Let k be a field and E a finite-dimensional vector space over /c, E # 0. Let 
A e End k (E) be a linear map of E into itself. Let t be transcendental over k. We 
shall define a representation of the polynomial ring fc[r] in E. Namely, we have 
a homomorphism 

k[t] -► k[A~] c= End k (E) 

which is obtained by substituting A for t in polynomials. The ring k[A] is the 
subring of End*(£) generated by A, and is commutative because powers of A 
commute with each other. Thus if f(t) is a polynomial and v e E, then 

f(t)v = f(A)v. 

The kernel of the homomorphism f(t) \-+f(A) is a principal ideal of fc[í], 
which is 0 because k[A~\ is finite dimensional over k. It is generated by a 
unique polynomial of degree > 0, having leading coefficient 1. This polynomial 
will be called the minimal polynomial of A over k, and will be denoted by q A (t). 
It is of course not necessarily irreducible. 

Assume that there exists an element ve E such that E = k[t]v = k[A]v. 
This means that E is generated over k by the elements 

v , Av 9 A 2 v ,.... 

We called such a module principal, and if R = k[t~\ we may write E = Rv = (v). 
If q A (t) = t d + a d _ x t d ~ 1 + • • • -F a 0 then the elements 

v , Av ,..., A d ~ l v 

constitute a basis for E over k. This is proved in the same way as the analogous 
statement for finite field extensions. First we note that they are linearly inde 
pendent, because any relation of linear dependence over k would yield a poly- 
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nomial g(t) of degree less than deg q A and such that g(A) = 0. Second, they 
generate E because any polynomial f(t) can be written f(t) = g(t)q A (t) + r(t) 
with deg r < deg q A . Hence f(A) = r(A). 

With respect to this basis, it is clear that the matrix of A is of the following 
type: 



/° 

0 

0 

... 0 

— a 0 



f i 

0 

0 

... 0 

-<*i 



0 

1 

0 

... 0 

~a 2 



0 

0 

0 

... 0 

— a d- 2 



0 

l 

0 

0 

... 1 

~ ai - 1 ì 


If E = (v) is principal, then E is 

isomorphic to k[t]/(q A (t)) under the map 

f(t)ï-+f(A)v. The polynomial q A is 

uniquely determined by A, and does not 


depend on the choice of generator v for E. This is essentially obvious, because 
if f\ , f 2 are two polynomials with leading coefficient 1, then k[t]/(fi(0) is iso- 
morphic to k[f]Kf 2 (t)) if and only if f x =f 2 . (Decompose each polynomial into 
prime powers and apply the structure theorem for modules over principal rings.) 

If E is principal then we shall call the polynomial q A above the polynomial 
invariant of £, with respect to A, or simply its invariant. 


Theorem 2.1. Let Ebe a non-zero finite-dimensional space over the field /c, 
and let A e End k (E). Then E admits a direct sum decomposition 

E = E\ © • • • © £ r , 

where each E t is a principal k[A]-submodule 9 xvith invariant q { / 0 such that 

The sequence (q l9 ..., q r ) is uniquely determined by E and A , and q r is the 
minimal polynomial of A. 

Proof The first statement is simply a rephrasing in the present language 
for the structure theorem for modules over principal rings. Furthermore, it is 
clear that q r (A) = 0 since q^q,. for each i. No polynomial of lower degree than 
q r can annihilate E, because in particular, such a polynomial does not annihilate 
E r . Thus q r is the minimal polynomial. 

We shall call (q l9 ..., q r ) the invariants of the pair (E, +). Let E = k {n \ and 
let A be an n x n matrix, which we view as a linear map of E into itself. The 
invariants (q l9 .-.,q r ) will be called the invaríants of + (over k). 


CoroIIary 2.2. Let k' be an extension field of k and let A beann x n matrix 
in k. The invariants of A over k are the same as its invariants over k'. 
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Proof Let {v l9 ... 9 v n } be a basis of k {n) over k. Then we may view it also 
as a basis of k ,{n) over k\ (The unit vectors are in the fc-space generated by 
v l9 ..., v n ; hence v í9 ... , v n generate the n-dimensional space k ,{n) over k'.) Let 
E = k {n) . Let L a be the linear map of E determined by A. Let L' A be the linear 
map of k ,(n) determined by A. The matrix of L Á with respect to our given basis is 
the same as the matrix of L' Á . We can select the basis corresponding to the 
decomposition 

E = E^ © • • • 0 E r 

determined by the invariants q l9 ..., q r . It follows that the invariants don’t 
change when we lift the basis to one of k ,{n) . 

CoroIIary 2.3. Let A, B be n x n matrices over a field k and let k! be an 
extension field ofk. Assume that there is an invertible matrix C in k ' such that 
B = CAC~ X . ThenthereisaninvertiblematrixC inksuchthat B = C/IC -1 . 

Proof Exercise. 

The structure theorem for modules over principal rings gives us two kinds 
of decompositions. One is according to the invariants of the preceding theorem. 
The other is according to prime powers. 

Let E / 0 be a finite dimensional space over the field k, and let A : E -► E 
beinEnd k (E). Letg = ^beitsminimalpolynomial. Then q has a factorization, 

q = pV ■pî’ (e,èi) 

into prime powers (distinct). Hence E is a direct sum of submodules 


E = E(pj)® •••© £(p s ), 


such that each E(p t ) is annihilated by pf‘. Furthermore, each such submodule 
can be expressed as a direct sum of submodules isomorphic to k[f]/(p e ) for 
some irreducible polynomial p and some integer e ^ 1. 

Theorem2.4. Let q A (t) = (t — ct) e for some ae/c, e ^ 1. Assume that E 
is isomorphic to k[t]/(q). Then E has a basis over k such that the matrix of A 
relative to this basis is of type 
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Proof. Since E is isomorphic to there exists an element veE 

such that k[t]v = E. This element corresponds to the unit element of k[t] in the 
isomorphism 


k[t]/(q) - E. 


We contend that the elements 


v 9 (t - ot)v 9 ... 9 (t - ot) e l v , 

or equivalently, 

v 9 (A — ol)v 9 .. ., (A — ot) e ~ l v, 

îorm a basis for E over k. They are linearly independent over k because any 
relation of linear dependence would yield a relation of linear dependence between 

v, Av ,..., A e ~ ^Vy 

and hence would yield a polynomial g(t) of degree less than deg q such that 
g(A) = 0. Since dim E = e 9 it follows that our elements form a basis for E 
over k. But (A - ct) e = 0. It is then clear from the definitions that the matrix of 
A with respect to this basis has the shape stated in our theorem. 

Corollary 2.5. Let k be algebraically closed , and let Ebeafinite-dimensional 
non-zero vector space over k. Let A e End fc (£). Then there exists a basis of 
E over k such that the matrix of A mth respect to this basis consists ofblocks 9 
and each b!ock is of the type described in the theorem. 

A matrix having the form described in the preceding corollary is said to be in 

Jordan canonical form. 

Remark 1. A matrix (or an endomorphism) N is said to be nilpotent if 
there exists an integer d > 0 such that N d = 0. We see that in the decomposition 
of Theorem 2.4 or Corollary 2.5, the matrix M is written in the form 

M = B + N 

where N is nilpotent. In fact, N is a triangular matrix (i.e. it has zero coefficients 
on and above the diagonal), and B is a diagonal matrix, whose diagonal elements 
are the roots of the minimal polynomial. Such a decomposition can always be 
achieved whenever the field k is such that all the roots of the minimal polynomial 
lie in k. We observe also that the only case when the matrix N is 0 is when all 
the roots of the minimal polynomial have multiplicity 1. In this case, if 
n = dim £, then the matrix M is a diagonal matrix, with n distinct elements on 
the diagonal. 
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Remark 2. The main theorem of this section can also be viewed as falling 
under the general pattern of decomposing a module into a direct sum as far as 
possible, and also giving normalized bases for vector spaces with respect to 
various structures, so that one can tell in a simple way the effect of an endo- 
morphism. More formally, consider the category of pairs ( E,A ), consisting 
of a finite dimensional vector space E over a field k , and an endomorphism 
A : E E. By a morphism of such pairs 

/: (E, A) -> (E\ A) 


we mean a /:-homomorphism /: E —> E’ such that the following diagram is 
commutative: 



It is then immediate that such pairs form a category, so we have the notion of 
isomorphism. One can reformulate Theorem 2.1 by stating: 


Theorem 2.6. Two pairs (E , A) and (.F , B) are isomorphic if and only if they 
have the same invariants. 

You can prove this as Exercise 19. The Jordan basis gives a normalized form 
for the matrix associated with such a pair and an appropriate basis. 

In the next chapter, we shall find conditions under which a normalized matrix 
is actually diagonal, for hermitian, symmetric, and unitary operators over the 
complex numbers. 

As an example and application of Theorem 2.6, we prove: 


CoroIIary 2.7. Let kbe a field and let K be a fìnite separable extension of 
degree n. Let V be a finite dimensional vector space of dimension n over k, and 
let p, p': —> End*(V) be two representations of K on V; that is, embeddings 
of K in End*(V). Then p, p’ are conjugate; that is, there exists B e Aut*(V) 
such that 

pU) = Bpi&B-'for allÇeK. 

Proof. By the primitive element theorem of field theory, there exists an 
element a e K such that K = k[a]. Let p(t) be the irreducible polynomial of a 
over k. Then (V, p(a)) and (V, p’(a)) have the same invariant, namely p(t). 
Hence these pairs are isomorphic by Theorem 2.6, which means that there exists 
B e Aut*(V) such that 

p’(a) = Bp(a)B~ l . 

But all elements of K are linear combinations of powers of a with coefficients 
in k , so it follows immediately that p'(Ç) = Bp(Ç)B~ l for all £ e K, as desired. 
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To get a representation of K as in corollary 2.7, one may of course select a 
basis of K , and represent multiplication of elements of K on K by matrices with 
respect to this basis. In some sense, Corollary 2.7 tells us that this is the only 
way to get such representations. We shall return to this point of view when 
considering Cartan subgroups of GL n in Chapter XVIII, §12. 


§3. THE CHARACTERISTIC POLYNOMIAL 

Let k be a commutative ring and E a free module of dimension n over k. 
We consider the polynomial ring fc[t], and a linear map A\E -► E. We have a 
homomorphism 

k[t] - k[A ] 

as before, mapping a polynomial /(í) on f(A ), and E becomes a module over 
the ring R = k[t]. Let M be any n x n matrix in k (for instance the matrix of A 
relative to a basis of E). We define the characteristic polynomial P M (t) to be the 
determinant 

det(í/„ - M) 

where /„ is the unit n x n matrix. It is an element of k[t]. Furthermore, if N 
is an invertible matrix in R, then 

det(í/„ - N~ l MN) = det(ATHíJ, - M)N) = det(í/„ - M). 

Hence the characteristic polynomial of N~ l MN is the same as that of M. We 
may therefore define the characteristic polynomial of A, and denote by P A , the 
characteristic polynomial of any matrix M associated with A with respect to 
some basis. (If E = 0, we defìne the characteristic polynomial to be 1.) 

If (p: k -► k! is a homomorphism of commutative rings, and M is an n x n 
matrix in k, then it is clear that 

P (pM(t) = <pp m( 0 

where (pP M is obtained from P M by applying cp to the coefficients of P M . 

Theorem 3.1. (Cayley-Hamilton). We have P a (A) = 0. 

Proof. Let {v l9 ..., v n } be a basis of E over k. Then 

n 

tV j= Y. a u v i 

i= 1 

where (a i; ) = M is the matrix of A with respect to the basis. Let B(t) be the 
matrix with coefficients in fe[r], defined in Chapter XIII, such that 

B(t)B(t) = P A (t)I n . 
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Then 


because 


/ v i\ 

B(í)fl(r)l ! 




Hence P A (t)E = 0, and therefore P a (A)E = 0. This means that P A (Á) = 0, 
as was to be shown. 


Assume now that k is a field. Let E be a finite-dimensional vector space over 
/c, and let A e End k (E). By an eigenvector w of A in E one means an element 
we£, such that there existsan element X e k for which Aw = Xw. If vv / 0, then 
X is determined uniquely, and is called an eigenvalue of A. Of course, distinct 
eigenvectors may have the same eigenvalue. 


Theorem 3.2. The eigenvalues of A are precisely the roots of the character- 
istic polynomial of A. 

Proof. Let X be an eigenvalue. Then A — XI is not invertible in End k (E\ 
and hence det(,4 — XI) = 0. Hence X is a root of P A . The arguments are re- 
versible, so we also get the converse. 

For simplicity of notation, we often write A — X instead of A — XI. 


Theorem 3.3. Let w u . .., w m he non-zero eigenvectors of A, having distinct 
eigenvalues. Then they are linearly independent. 

Proof Suppose that we have 

+ a m w m = 0 

with a t e /c, and let this be a shortest relation with not all a, = 0 (assuming such 
exists). Then a x # 0 for all i. Let X u ..., X m be the eigenvalues of our vectors. 
Apply A — X x to the above relation. We get 


a 2 (A 2 - Aj)w 2 + • • • + a m (À m - = 0, 

which shortens our relation, contradiction. 

Corollary 3.4. If A has n distinct eigenvalues X u ..., X„ belonging to eigen- 
vectors v í9 ..., v n9 and dim E = n, then {v í9 ..., v n } is a basisfor E. The matrix 
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of A with respect to this basis is the diagonal matrix : 



VVarning. It is not always true that there exists a basis of E consisting of 
eigenvectors! 

Remark. Let k be a subfield of k'. If M is a matrix in k, we can define its 
characteristic polynomial with respect to /c, and also with respect to /c'. It is 
clear that the characteristic polynomials thus obtained are equal. If E is a vector 
space over /c, we shall see later how to extend it to a vector space over k'. A 
linear map A extends to a linear map of the extended space, and the character- 
istic polynomial of the linear map does not change either. Actually, if we select 
a basis for E over /c, then E % k (n \ and k {n) c= k' {n) in a natural way. Thus selecting 
a basis allows us to extend the vector space, but this seems to depend on the 
choice of basis. We shall give an invariant definition later. 

Let E = ® • • • ® E r be an expression of E as a direct sum of vector 

spaces over /c. Let A e End k (E\ and assume that AE t c= for all i = 1,..., r. 
Then A induces a linear map on We can select a basis for E consisting of 
bases for E u ..., E r , and then the matrix for A consists of blocks. Hence we see 
that 

P A (t) = fi^xo. 

i — 1 

Thus the characteristic polynomial is multiplicative on direct sums. 

Our condition above that AE { c= E t can also be formulated by saying that 
E is expressed as a /c[^]-direct sum of /c[/l]-submodules, or also a /c[í]-direct 
sum of /c[í]-submodules. We shall apply this to the decomposition of E given 
in Theorem 2.1. 

Theorem 3.5. Let E be a finite-dimensional vector space over a field /c, let 

A 6 End k (£), and let q í9 ..., q r be the invariants of (£, A). Then 


Pa (0 = <h(t) - - - q r (t). 


Proof We assume that E = k {n) and that A is represented by a matrix M. 
We have seen that the invariants do not change when we extend k to a larger 
field, and neither does the characteristic polynomial. Hence we may assume that 
k is algebraically closed. In view of Theorem 2.1 we may assume that M has a 
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single invariant q. Write 

q(t) = (í - aj* 1 • • • (í - ct s ) e * 

with distinct ct l9 ... , a s . We view M as a linear map, and split out vector space 
further into a direct sum of submodules (over /c[í]) having invariants 

(t - aO' 1 ,..., (í - a s ) es 

respectively (this is the prime power decomposition). For each one of these 
submodules, we can select a basis so that the matrix of the induced linear map has 
the shape described in Theorem 2.4. From this it is immediately clear that the 
characteristic polynomial of the map having invariant (í — oì) e is precisely 
(í — a) e , and our theorem is proved. 


Corollary 3.6. The minimal polynomial of A and its characteristic poly- 
nomial have the same irreducihle factors. 

Proof Because q r is the minimal polynomial, by Theorem 2.1. 

We shall generalize our remark concerning the multiplicativity of the 
characteristic polynomial over direct sums. 

Theorem 3.7. Let khe a commutative ring , and in the folloxving diagram , 


0 

0 


E' 

i' 

E' 



0 

0 


let the rows be exact sequences of free modules over k, of finite dimension , and 
let the vertical maps be k-linear maps making the diagram commutative. Then 

P A (t) = P A (t)PAt). 

Proof. We may assume that E' is a submodule of E. We select a basis 
{v l9 ..., v m } for E'. Let {v m+19 ... , v} be a basis for E", and let v m+l9 ..., v n 
be elements of E mapping on v m+19 .. ., v n respectively. Then 

{v l ,...,v m ,v m+1 ,...,v n } 

is a basis for E (same proof as Theorem 5.2 of Chapter III), and we are in the 
situation discussed in §1. The matrix for A has the shape 
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where M' is the matrix for A’ and M" is the matrix for A". Taking the character- 
istic polynomial with respect to this matrix obviously yields our multiplicative 
property. 

Theorem 3.8. Let kbe a commutative ring , and E afree module of dimension 
n over k. Let A e End k (E). Let 

P A (t) = f + c n - l t”- 1 + ... + c 0 . 

Then 

tr(/l) = and det(/l) = (-l)”c 0 . 

Proof For the determinant, we observe that P A ( 0) = c 0 . Substituting 
t = 0 in the definition of the characteristic polynomial by the determinant shows 
thatco = (-l)"det(>4). 

For the trace, let M be the matrix representing A with respect to some basis, 
M = (, Qìj ). We consider the determinant det (tl n - a^. In its expansion as a sum 
over permutations, it will contain a diagonal term 

(í - fln) ••• (í - o, 

which will give a contribution to the coefficient of t n ~ 1 equal to 

— (fln + • • • + a nn ). 

No other term in this expansion will give a contribution to the coefficient of 
í"” 1 , because the power of t occurring in another term will be at most t n ~ 2 . 
This proves our assertion concerning the trace. 

Corollary 3.9. Let the notation be as in Theorem 3.7. Then 

tr(y4) = tr(y4') -f tr(y4") and det(/l) = det(^4') det(,4"). 

Proof Clear. 

We shall now interpret our results in the Euler-Grothendieck group. 

Let k be a commutative ring. We consider the category whose objects are 
pairs ( E , A), where E is a /c-module, and A e End k (E). We define a morphism 

(F, A) - (£, A) 

to be a fc-linear map E’ ^ E making the following diagram commutative: 
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Then we can define the kernel of such a morphism to be again a pair. Indeed, 
let E' 0 be the kernel of/: E' -> E. Then A' maps E' 0 into itself because 

fA'E' 0 = AfE' 0 = 0. 


We let A' 0 be the restriction of A' on E' 0 . The pair (E' 0 , A' 0 ) is defined to be the 
kernel of our morphism. 

We shall denote by/again the morphism of the pair (E', A') -> (E, A). We 
can speak of an exact sequence 

(F, A') (E, A) (E", A"), 

meaning that the induced sequence 

e" 

is exact. We also write 0 instead of (0, 0), according to our universal convention 
to use the symbol 0 for all things which behave like a zero element. 

We observe that our pairs now behave formally like modules, and they in 
fact form an abelian category. 

Assume that k is a field. Let d consist of all pairs (E, A) where E is finite 
dimensional over k. 

Then Theorem 3.7 asserts that the characteristic polynomial is an Euler- 
Poincaré map defined for each object in our category Q, with values into the 
multiplicative monoid of polynomials xvith leading coefficient 1. 

Since the values of the map are in a monoid, this generalizes slightly the notion 
of Chapter III, §8, when we took the values in a group. Of course when k is a 
field, which is the most frequent application, we can view the values of our map 
to be in the multiplicative group of non-zero rational functions, so our previous 
situation applies. 

A similar remark holds now for the trace and the determinant. If k is a 
field, the trace is an Euier map into the additive group of the field, and the deter- 
minant is an Euler map into the multiplicative group of the field. We note also that 
all these maps (like all Euler maps) are defined on the isomorphism classes of 
pairs, and are defined on the Euler-Grothendieck group. 

Theorem 3.10. Let kbe a commutative ring, M an n x n matrix in k, andf 
a polynomial in k[t]. Assume that P M (t) has a factorization, 

P M (t) = fl (í - a,) 

i = 1 

into linear factors over k. Then the characteristic polynomial of f(M) is 
given by 

P/m(t) = fï(í ~/(«;)), 

i = 1 
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and 


tr(/(M)) = X /(«,), det(/(M)) = [] /(«,)• 

i= 1 i= 1 

Proof. Assume first that k is a field. Then using the canonical decomposi- 
tion in terms of matrices given in Theorem 2.4, we find that our assertion is 
immediately obvious. When k is a ring, we use a substitution argument. It is 
however necessary to know that if X = (* /y ) is a matrix with algebraically 
independent coefficients over Z, then P x (t ) has n distinct roots y^, ..., y n [in 
an algebraic closure of Q(X)] and that we have a homomorphism 

Z[x l7 ,y yj -► k 

mapping X onM and y ^,..., y n on a x ,..., a„ . This is obvious to the reader who 
read the chapter on integral ring extensions, and the reader who has not can 
forget about this part of the theorem. 


EXERCISES 


1. Let T be an upper triangular square matrix over a commutative ring (i.e. all the ele- 
ments belovv and on the diagonal are 0). Show that T is nilpotent. 

2. Carry out explicitly the proof that the determinant of a matrix 



where each M, is a square matrix, is equal to the product of the determinants of the 
matrices M u , M s . 

3. Let k be a commutative ring, and let M, M' be square n x n matrices in k. Show that 
the characteristic polynomials of MM' and M'M are equal. 

4. Show that the eigenvalues of the matrix 


( 0 1 0 0 \ 
0 0 1 Ol 

0 0 0 1 I 
1 0 0 0 / 


in the complex numbers are + 1, + i. 
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5. Let M, M' be square matrices over a field k. Let q, q ' be their respective minimal 
polynomials. Show that the minimal polynomial of 

/M 0 \ 

\0 M'j 

is the least common multiple of q, q'. 

6. Let A be a nilpotent endomorphism of a fìnite dimensional vector space E over the field 
k. Show that tr(^4) = 0. 

7. Let R be a principal entire ring. LqíE be a free module over R, and let/s v = Hom^fE, R) 
be its dual module. Then £ v is free of dimension n. Let F be a submodule of E. 
Show that £ v /F 1 can be viewed as a submodule of F v , and that its invariants are 
the same as the invariants of F in E. 

8. Let E be a finite-dimensional vector space over a field k. Let A e Aut k (E). Show that 
the following conditions are equivalent: 

(a) A = I + N, with N nilpotent. 

(b) There exists a basis of E such that the matrix of A with respect to this basis has 
all its diagonal elements equal to 1 and all elements above the diagonal equal 
to 0. 

(c) All roots of the characteristic polynomial of A (in the algebraic closure of k) 
are equal to 1. 

9. Let k be a field of characteristic 0, and let M be an n x n matrix in k. Show that M is 
nilpotent if and only if tr(M v ) = 0 for 1 ^ v ^ n. 

10. Generalize Theorem 3.10 to rational functions (instead of polynomials), assuming 
that k is a field. 

11. Let E be a fìnite-dimensional space over the field k. Let a e k. Let E a be the subspace 
of E generated by all eigenvectors of a given endomorphism A of E, having a as an 
eigenvalue. Show that every non-zero element of E a is an eigenvector of A having a as 
an eigenvalue. 

12. Let E be finite dimensional over the field k. Let A e End k (E). Let v be an eigenvector 
for A. Let B e End k (E) be such that AB = BA. Show that Bv is also an eigenvector 
for A (if Bv 0), with the same eigenvalue. 

Diagonalizable endomorphisms 

Let E be a finite-dimensional vector space over a field /c, and let S e End k (E). We say 
that S is diagonalizable if there exists a basis of E consisting of eigenvectors of S. The 
matrix of S with respect to this basis is then a diagonal matrix. 

13. (a) If S is diagonalizable, then its minimal polynomial over k is of type 

m = fî o - 

i = 1 

where ..., X m are distinct elements of k. 

(b) Conversely, if the minimal polynomial of S is of the preceding type, then S is 
diagonalizable. [Hint : The space can be decomposed as a direct sum of the 
subspaces E Xi annihilated by S — A, .] 
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(c) If S is diagonalizable, and if F is a subspace of F such that SF c= F, show that S 
is diagonalizable as an endomorphism of F, ì.e. that F has a basis consisting of 
eigenvectors of S. 

(d) Let 5, T be endomorphisms of F, and assume that S, T commute. Assume that 
both 5, T are diagonalizable. Show that they are simultaneously diagonalizable, 
i.e. there exists a basis of E consisting of eigenvectors for both S and T. [ Hint: 
If X is an eigenvalue of S, and E x is the subspace of E consisting of all vectors v 
such that Sv = Xv , then TE x cz E x .~\ 

14. Let E be a finite-dimensional vector space over an algebraically closed field k. Let 
A e End k (E). Show that A can be written ìn a unique way as a sum 

A = S + N 

where S is diagonalizable, N ìs mlpotent, and SN = NS. Show that S, N can be ex- 
pressed as polynomials in A. [ Hint : Let P^r) = (í — A,) m ' t> e the factorization 
of P A (t) with distinct A f . Let F, be the kernel of (A — X) mi . Then F is the direct sum of 
the F,. Define S on F so that on F,, St? = X t v for all v e F f . Let N = /I - S. Show 
that S, N satisfy our requirements. To get S as a polynomial in /4, let g be a polynomial 
such that gf(r) = A, mod (r - A,) m ' for all i, and ^(r) = 0 mod r. Then S = g(A) 
and N = A - g(Á).\ 

15. After you have read the section on the tensor product of vector spaces, you can easily 
do the following exercise. Let F, F be finite-dimensional vector spaces over an alge- 
braically closed field /c, and let A : F -► F and B : F -+ F be fc-endomorphisms of F, F, 
respectively. Let 

^(0 = n 0 and P B (r) = n (í - ftr 

be the factorizations of their respectively characteristic polynomials, into distinct 
linear factors. Then 

pauo = n <í - 

ij 

[Hint \ Decompose F into the direct sum of subspaces F,, where F, is the subspace of 
F annihilated by some power of A — a, . Do the same for F, getting a decomposition 
into a direct sum of subspaces F } . Then show that some power of A ® B — (Xipj 
annihilates E t <g) F } . Use the fact that F (x) F is the direct sum of the subspaces F, ® F^, 
and that dim*(F, <g) F^) = njmj.] 

16. Let T be a free abelian group of dimension n ^ 1. Let F be a subgroup of dimension n 
also. Let {v ^..., v n } be a basis of T, and let {\Vj,..., w„} be a basis of F. Write 

Show that the index (T: F) is equal to the absolute value of the determinant of the 
matrix (a, v ). 

17. Prove the normal basis theorem for fìnite extensions of a finite field. 

18. LetA = (a^) be a square n x n matrix over a commutative ring/c. Let/I, 7 bethematrix 
obtained by deleting the í-th row and ;-th column from A. Let = ( - Ì) l+J áet(Aj), 
and let B be the matrix (ò 0 ). Show that det(P) = det(/4)"" 1 , by reducing the problem to 
the case when A is a matrix with variable coeíficients over the integers. Use this same 
method to give an alternative proof of the Cayley-Hamilton theorem, that P a (A) = 0. 
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19. Let (£, A) and (£', A') be pairs consisting of a finite-dimensional vector space over a 
field k, and a /c-endomorphism. Show that these pairs are isomorphic if and only if 
their invariants are equal. 

20. (a) How many non-conjugate elements of GL 2 (C) are there with characteristic poly- 
nomial t 3 (t -I- 1 ) 2 (t — 1)? 

(b) How many with characteristic polynomial t 3 — 1001 /? 

21. Let V be a finite dimensional vector space over Q and let A : V — > V be a Q-linear 
map such that A 5 = Id. Assume that if v E V is such that Av = v, then v = 0. Prove 
that dim V is divisible by 4. 

22. Let V be a finite dimensional vector space over R, and let A: V —> V be an R-linear 
map such that A 2 = -Id. Show that dim V is even, and that V is a direct sum of 2- 
dimensional A-invariant subspaces. 

23. Let £ be a fìnite-dimensional vector space over an algebraically closed field /c. Let 
/4, B be /c-endomorphisms of £ which commute, i.e. AB = BA. Show that A and B have 
a common eigenvector. [Hint : Consider a subspace consisting of all vectors having 
a fixed element of k as eigenvalue.] 

24. Let V be a finite dimensional vector space over a field k. Let A be an endomorphism 
of V. Let Tr (A m ) be the trace of A m as an endomorphism of V. Show that the following 
power series in the variable t are equal: 

exp( X ~Tr(A m )— ) = det(/ - tA) or log det(/ - tA) = X Tr (A m )t m . 

\m= i m / at m =i 

Compare with Exercise 23 of Chapter XVIII. 

25. Let V, W be finite dimensional vector spaces over k, of dimension n. Let ( v, w) l-> 

( v , vv) be a non-singular bilinear form on V x W. Let c G k, and let A: V —> V and 
V: W W be endomorphisms such that 


Show that 
and 


(Av, Bw) = c(v, w) for all v E V and w e W. 

det(A)det(í/ - B) = (-l) w det(c/ - M) 


det(A)det(£) = c”. 

For an application of Exercises 24 and 25 to a context of topology or algebraic 
geometry, see Hartshorne’s Algebraic Geometry, Appendix C, §4. 

26. Let G = SL n ( C) and let K be the complex unitary group. Let A be the group of di- 
agonal matrices with positive real components on the diagonal. 

(a) Show that if g e Nor^(A) (normalizer of A in G), then c (g) (conjugation by 
g) permutes the diagonal components of A, thus giving rise to a homo- 
morphism Norc(A) —> W to the group W of permutations of the diagonal 
coordinates. 

By definition, the kemel of the above homomorphism is the centralizer Cen G (A). 

(b) Show that actually all permutations of the coordinates can be achieved by 
elements of K , so we get an isomorphism 

W « Nor G (A)/Cen G (A) « Nor*(A)/Cen*(/1). 

In fact, the K on the right can be taken to be the real unitary group, because 
permutation matrices can be taken to have real components (0 or ±1). 



CHAPTER XV 

Structure of Bilinear Forms 


There are three major types of bilinear forms: hermitian (or symmetric), 
unitary, and alternating (skew-symmetric). In this chapter, we give structure 
theorems giving normalized expressions for these forms with respect to suitable 
bases. The chapter also follows the standard pattern of decomposing an object 
into a direct sum of simple objects, insofar as possible. 


§1. PRELIMINARIES, ORTHOGONAL SUMS 

The purpose of this chapter is to go somewhat deeper into the structure 
theory for our three types of forms. To do this we shall assume most of the time 
that our ground ring is a field, and in fact a field of characteristic # 2 in the 
symmetric case. 

We recall our three definitions. Let E be a module over a commutative 
ring R. Let g : E x E -► R be a map. If g is bilinear, we call g asymmetric form 
if g(x, y) = g(y\ x ) for all x, y e E. We call g alternating if g(x, x) = 0, and hence 
g(x, y) = —g()\x) for all x, yeE. If R has an automorphism of order 2, 
written a »-> â, we say that g is a hermitian form if it is linear in its first variable, 
antilinear in its second, and 

g(x, y) = g(y , x). 

We shall write g(x, y) = <x, y) if the reference to g is clear. We also oc- 
casionally write g(x, y) = x • y or g(x, x) = x 2 . We sometimes call g a scalar 
product. 
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£, we denote by (v u ..., v m ) the submodule of E generated by 

v i,...,v m • 

Let g be symmetric, alternating, or hermitian. Then it is clear that the left 
kernel of g is equal to its right kernel, and it will simply be called the kernel of g. 

In any one of these cases, we say that g is non-degenerate if its kernel is 0. 
Assume that E is finite dimensional over the field k. The form is non-degenerate 
if and only if it is non-singular, i.e., induces an isomorphism of E with its dual 
space (anti-dual in the case of hermitian forms). 

Except for the few remarks on the anti-linearity made in the previous 
chapter, we don’t use the results of the duality in that chapter. We need only 
the duality over fields, given in Chapter III. Furthermore, we don’t essentially 
meet matrices again, except for the remarks on the pfaffian in §10. 

We introduce one more notation. In the study of forms on vector spaces, 
we shall frequently decompose the vector space into direct sums of orthogonal 
subspaces. If E is a vector space with a form g as above, and F, F' are subspaces, 
we shall write 


E = F 1 F' 

to mean that E is the direct sum of F and F', and that F is orthogonal (or 
perpendicular) to F', in other words, x 1 y (or <x, y) = 0) for all xe F and 
y e F'. We then say that E is the orthogonal sum of F and F'. There will be no 
confusion with the use of the symbol ± when we write F 1 F' to mean simply that 
F is perpendicular to F'. The context always makes our meaning clear. 

Most of this chapter is devoted to giving certain orthogonal decompositions 
of a vector space mth one of our three types of forms, so that each factor in the sum 
is an easily recognizable type. 

In the symmetric and hermitian case, we shall be especially concerned with 
direct sum decompositions into factors which are 1-dimensional. Thus if 
< , > is symmetric or hermitian, we shall say that [v { ,.. ., v n } is an orthogonal 
basis (with respect to the form) if (v t , Vj > = 0 whenever i # j. We see that an 
orthogonal basis gives such a decomposition. If the form is nondegenerate, 
and if {v u ..., v n } is an orthogonal basis, then we see at once that <u,-, vj) ^ 0 
for all i. 

Proposition 1.1. Let Ebe a vector space over the field k , and let gbe a form 
of one of the three above types. Suppose that E is expressed as an orthogonal 
sum , 


E = E x 1 '±E m . 

Then g is non-degenerate on E if and only if it is non-degenerate on each . 
IfEi is the kernel of the restriction of g to £,, then the kernel of g in E is the 
orthogonal sum 


E° = £? 1 •••!££. 
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Proof. Elements v, w of E can be written uniquely 


with v h e Ei. Then 


v = 


I’.. 

; = i 



m 

V ‘W = Y, V i' W i' 


and v • vv = 0 if v t • w t = 0 for each i = 1, ..., m. From this our assertion is 
obvious. 

Observe that if E l9 ..., E m are vector spaces over k, and are forms 

on these spaces respectively, then we can define a form g = g x © • • • © g m on the 
direct sum E = E x © • • • © E m \ namely if v, w are written as above, then we let 

m 

g(v,w)= w,). 

i= 1 

It is then clear that, in fact, we have E = E x 1 • • • 1 E m . We could also write 

g *= g\ 1 ••• -Lffm- 

Proposition 1.2. Let E be a fìnite-dimensional space over the field k , and let 
gbe a form of the preceding type on E. Assume that g is non-degenerate. Let 
F be a subspace of E. The form is non-degenerate on F if and only if 
F + F 1 = E, and also if and only if it is non-degenerate on F 1 . 

Proof. We have (as a trivial consequence of Chapter III, §5) 

dim F -f dim F 1 = dim E = dim (F -f F 1 ) -f dim(F n F 1 ). 

Hence F -f F 1 = E if and only if dim(F n F 1 ) = 0. Our first assertion follows 
at once. Since F, F 1 enter symmetrically in the dimension condition, our second 
assertion also follows. 

Instead of saying that a form is non-degenerate on £, we shall sometimes say, 
by abuse of language, that E is non-degenerate. 

Let E be a finite-dimensional space over the field k, and let g be a form of 
the preceding type. Let E 0 be the kernel of the form. Then we get an induced 
form of the same type 

g 0 : E/E 0 x E/E 0 —► /c, 

because g(x , y) depends only on the coset of x and the coset of y modulo E 0 . 
Furthermore, g Q is non-degenerate since its kernel on both sides is 0. 

Let £, E' be finite-dimensional vector spaces, with forms g, g' as above, 
respectively. A linear map o is said to be metric if 


g'(ox, oy) = g(x, y) 
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or in the dot notation, ox • oy = x • y for all x, y e E. If o is a linear isomorphism, 
and is metric, then we say that o is an isometry. 

Let E, E 0 be as above. Then we have an induced form on the factor space 
E/E 0 . If W is a complementary subspace of £ 0 , in other words, E = E 0 ® W, 
and if we let o: E -> E/E 0 be the canonical map, then o is metric, and induces 
an isometry of W on E/E 0 . This assertion is obvious, and shows that if 

E = E 0 ®W' 

is another direct sum decomposition of £, then W' is isometric to W. We know 
that W % E/E 0 is nondegenerate. Hence our form determines a unique non- 
degenerate form, up to isometry, on complementary subspaces of the kernel. 


§2. QUADRATIC MAPS 

Let R be a commutative ring and let E, F be £-modules. We suppress the 
prefix R- as usual. We recall that a bilinear map /: E x E -► F is said to be 
symmetric if f (x, y) = f (y, x) for all x, y e E. 

We say that F is vvithout 2-torsion if for all y e F such that 2 y = 0 we have 
y = 0. (This holds if 2 is invertible in R.) 

Let/ : E -+ F be a mapping. We shall say that/is quadratic (i.e. £-quadratic) 
if there exists a symmetric bilinear map g: E x E -+ F and a linear map h: E -► F 
such that for all x e E we have 

/(x) = g(x , x) -F h(x). 

Proposition 2.1. Assume that F is mthout 2 -torsion. Let /: E -> F be 
quadratic, expressed as above in terms of a symmetric bilinear map and a 
linear map. Then g, h are uniquely determined byf For all x, ye E we have 

2 g(x, y)=f(x + y)-f(x)-f(y). 

Proof. If we compute f(x + y) —f(x)— f(y), then we obtain 2g(x, y). 
If g ] is symmetric bilinear, h { is linear, and f(x) = gfx,x) -F hfx), then 
2g(x, y) = 2gfx, y). Since F is assumed to be without 2-torsion, it follows that 
g(x, y) = g\(x, y) for all x, y e E, and thus that g is uniquely determined. But 
then h is determined by the relation 

h(x) = f(x) - g(x, x). 

We call g, h the bilinear and linear maps associated with/ 

If/: E -> F is a map, we define 


A f:E x E -> F 
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by 


A/(x, y) = f(x + y) -f(x) -f(y). 


We say that / is homogeneous quadratic if it is quadratic, and if its associated 
linear map is 0. We shall say that F is uniquely divisible by 2 if for each zeF 
there exists a unique ue F such that 2 u = z. (Again this holds if 2 is invertible 
in R.) 


Proposition 2.2. Let f:E->F be a map such that A f is bilinear. Assume 
that F is uniquely divisible by 2. Then the map xy-+f(x) — ^À/(x, x) is 
Z-linear. If f satisfies the condition f (2x) = 4/(x), then f is homogeneous 
quadratic. 

Proof Obvious. 

By a quadratic form on E, one means a homogeneous quadratic map 
/:£-►/?, with values in R. 

In what follows, we are principally concerned with symmetric bilinear 
forms. The quadratic forms play a secondary role. 


§3. SYMMETRIC FORMS, ORTHOGONAL BASES 

Let k be a field of characteristic 2. 

Let E be a vector space over k , with the symmetric form g. We say that g 
is a null form or that E is a null space if <x, y} = 0 for all x, ye E. Since we 
assumed that the characteristic of k is # 2, the condition x 2 = 0 for all x e E 
implies that g is a null form. Indeed, 

4x ■ y = (x + y) 1 - (x - y) 2 . 


Theorem 3.1. Let E be =k 0 and finite dimensional over k. Let g be a sym- 
metric form on E. Then there exists an orthogonal basis. 

Proof. We assume first that g is non-degenerate, and prove our assertion by 
induction in that case. If the dimension n is 1, then our assertion is obvious. 

Assume n > 1. Let ^eEbe such that v\ # 0 (such an element exists since 
g is assumed non-degenerate). Let F = (v x ) be the subspace generated by v^. 
Then F is non-degenerate, and by Proposition 1.2, we have 

E = F + F 1 . 

Furthermore, dim F 1 = n - 1. Let {v 2 ,..., v n } be an orthogonal basis of F 1 . 
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Then {v x ,..., v n } are pairvvise orthogonal. Furthermore, they are linearly 
independent, for if 


a x v x -F •• • -F a n v n = 0 

with a { e k then we take the scalar product with v t to get a t vf = 0 whence a { = 0 
for all /. 

Remark. We have shown in fact that if g is non-degenerate, and v e E is such 
that v 2 # 0 then we can complete v to an orthogonal basis of E. 

Suppose that the form g is degenerate. Let E 0 be its kernel. We can write 
E as a direct sum 


E = E 0 @W 

for some subspace W. The restriction of g to W is non-degenerate; otherwise 
there would be an element of W which is in the kernel of E, and ^ 0. Hence if 
{v u ..., v r } isa basis of E 0 , and {wj,..., w„_ r } is an orthogonal basis of W, then 

{v l9 ...,v r ,w j,..., w„_ r } 

is an orthogonal basis of E, as was to be shown. 

Corollary 3.2. Let {v x ,.. ., v n } be an orthogonal basis of E. Assume that 
vf 0 jor i ^ r and vf = 0 for i > r. Then the kernel of E is equal to 
(v r+l ,..., v n ). 

Proof. Obvious. 

If {v u ..., v n } is an orthogonal basis of E and if we write 
X = x x v x -F ••• -F x n v n 


with x, £ k, then 


X 2 = a x x\ -F • • • -F a n xf 

where a { = (v i9 vf). In this representation of the form, we say that it is diagonal- 
ized. With respect to an orthogonal basis, we see at once that the associated 
matrix of the form is a diagonal matrix, namely 

/"■ i 

a 2 0 

«r 

0 0 



XV, §4 


SYMMETRIC FORMS OVER ORDERED FIELDS 577 


Example. Note that Exercise 33 of Chapter XIII gave an interesting example 
of an orthogonal decomposition involving harmonic polynomials. 


§4. SYMMETRIC FORMS OVER ORDERED FIELDS 

Theorem 4.1. (Sylvester) Let k be an ordered field and let E be a finite 
dimensional vector space over k y with a non-degenerate symmetric form g. There 
exists an integer r ^ 0 such that, if {v x ,. . . ,v n } is an orthogonal basis of E, 
then precisely r among the n elements v\,..., v 2 n are > 0, and n — r among 
these elements are < 0. 

Proof. Let a t = vf, for i = 1,..., n. After renumbering the basis elements, 
say a { ,... ,a r > Oand n, < Ofor i > r. Let {vv^,..., w n } beany orthogonalbasis, 
and let h, = xvf. Say b u ..., b s > 0 and bj < 0 for j > s. We shall prove that 
r = s. Indeed, it will suffice to prove that 

v u ...,v r9 w s+ j,..., w„ 

are linearly independent, for then we get r + n - s < n, whence r < s, and 
r = s by symmetry. Suppose that 

+ ••• + x r v r -F y s+í w s+í + ••• -F y n w n = 0. 


Then 


*i^i + ••• + x r v r = - y s+1 w s + l - y n w„. 

Squaring both sides yields 

a^x] -F • • • -F a r xf = b s+1 yf + !+••• + b n yf. 

The left-hand side is > 0, and the right-hand side is ^ 0. Hence both sides are 
equal to 0, and it follows that x, = y^ = 0, in other words that our vectors are 
linearly independent. 

CoroIIary 4.2. Assume that every positive element of k is a square. Then 
there exists an orthogonai basis {v x , ..., v n } of E such that vf = 1 for i < r 
and vf = — 1 for i > r, and r is uniquely determined. 

Proof. We divide each vector in an orthogonal basis by the square root of 
the absolute value of its square. 

A basis having the property of the corollary is called orthonormal. If X is an 
element of E having coordinates (x 1? ..., x n ) with respect to this basis, then 

X 2 = x\ + • • • + xf - xf + ! - • • • - xf. 
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We say that a symmetric form g is positive definite if X 2 > 0 for all 
X e E, X ± 0. This is the case if and only if r = n in Theorem 4.1. We say 
that g is negative definite if X 2 < 0 for all X e E, X ± 0. 


Corollary 4.3. The vector space E admits an orthogonai decomposition 
E = E + 1 E~ such that g is positive definite on E + and negative definite on 
E~. The dimension of E + (or E~) is the same in all such decompositions. 


Let us now assume that the form g is positive definite and that every positive 
element of k is a square. 

We define the norm of an element v e E by 

\v\ = yjv • v. 

Then we have \v\ > 0 if v # 0. We also have the Schwarz inequality 

\v • w| < \ v\ | w| 

for all v, vv e E. This is proved in the usual way, expanding 
0 < (av ± bw) 2 = (av ± bw) • (av ± bw) 


by bilinearity, and letting b = \ v\ and a = |w|. One then gets 

+ 2abv-w < 21 1 ;| 2 1w| 2 . 

If | v | or | vv | = 0 our inequality is trivial. If neither is 0 we divide by | v \ | vv | to get 
what we want. 

From the Schwarz inequality, we deduce the triangle inequality 

\v ± w\ ^ \v\ ± | w\. 

We leave it to the reader as a routine exercise. 

When we have a positive definite form, there is a canonical way of getting an 
orthonormal basis, starting with an arbitrary basis {v u ..., v n } and proceeding 
inductively. Let 

v ' = M 1 ’ 1 ' 

Then v^ has norm 1. Let 

W 2 = v 2 - (v 2 • v\)v\. 


v 2 


|w 2 | 


XV 7 


and then 
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Inductively, we let 


w r = v r - (v r • v\)v\ - (v r v' r -. j 


and then 


" r iw r r- 


The {v \,..., v' n } is an orthonormal basis. The inductive process just described 
is known as the Gram-Schmidt orthogonalization. 


§5. HERMITIAN FORMS 

Let k 0 be an ordered field (a subfield of the reals, if you wish) and let k = /c o (0» 
where i = yf— T. Then k has an automorphism of order 2, whose fixed field 
is k 0 . 

Let E be a finite-dimensional vector space over k. We shall deal with a hermi- 
tian form on E , i.e. a map 


E x £-/c 


written 


(x, y) <x, y> 

which is /c-linear in its first variable, /c-anti-linear in its second variable, and such 
that 


<*> y> = <y, *> 


for all x, ye E. 

We observe that <x, x> e /c 0 for all xe E. This is essentially the reason why 
the proofs of statements concerning symmetric forms hold essentially without 
change in the hermitian case. We shall now make the list ofthe properties which 
apply to this case. 

Theorem 5.1. There exists an orthogonal basis. Iftheform is non-degenerate , 
there exists an integer r having the follomng property. If {v Xì .. . , v n } is an 
orthogonal basis y then precisely r among the n elements 

(v U Vi) . (v n , v n ) 

are > 0 and n — r among these elements are < 0. 
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An orthogonal basis {v u . .., v n } such that (v i9 ^) = 1 or — 1 is called an 

orthonormal basis. 

Corollary 5.2. Assume that theform is non-degenerate , andthateverypositive 
element of k 0 is a square. Then there exists an orthonormal basis. 

We say that the hermitian form is positive defìnite if (x, x) > 0 for all 
x e E. We say that it is negative defìnite if ( x, x) < 0 for all x e E, x ± 0. 

CoroIIary 5.3. Assume that the form is non-degenerate. Then E admits an 
orthogonal decomposition E = E + 1 E~ such that the form is positive defìnite 
on E + and negative defìnite on E'. The dimension of E + (or E~) is the same 
in all such decompositions. 

The proofs of Theorem 5.1 and its corollaries are identical with those of the 
analogous results for symmetric forms, and will be left to the reader. 

We have the polarization identity, for any /c-linear map A : E -► E, namely 

(A(x + y), (x -f y» - (A(x - y\ (x - y)> = 2[</lx, y) -f (Ay, x>]. 

If < Ax , x> = 0 for all x, we replace x by ix and get 

(Ax, y> -f (Ay, x> = 0, 
i(Ax, y) - i(Ay , x> = 0. 

From this we conclude: 

// </íx, x> = 0, for all x, then /1 = 0. 

This is the only statement which has no analogue in the case of symmetric 
forms. The presence of i in one of the above linear equations is essential to the 
conclusion. In practice, one uses the statement in the complex case, and one 
meets an analogous situation in the real case when A is symmetric. Then the 
statement for symmetric maps is obvious. 

Assume that the hermitian form is positive definite, and that every positive 
element of k 0 is a square. 

We have the Schwarz inequality, namely 

\<X, y) I 2 ^ <x,x}(y,y} 

whose proof comes again by expanding 

0 ^ <ax -f fy, olx -f fy) 

and setting a = <y, y) and f = — <x, y). 

We define the norm of | x | to be 

\x\ = 
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Then we get at once the triangle inequality 


I* + y\ = M + |y|. 


and for a 6 k, 


|ax| = |a| |x|. 

Just as in the symmetric case, given a basis, one can find an orthonormal 
basis by the inductive procedure ofsubtracting successive projections. We leave 
this to the reader. 


§6. THE SPECTRAL THEOREM (HERMITIAN CASE) 

Throughout this section, we let Ehea finite dimensional space over C, of dimension 
^ 1, and we endow E with a positive definite hermitian form. 

Let A : E -► E be a linear map ( i.e . C-linear map) of E into itself. For fixed 
yeE, the map xt-+(Ax, y) is a linear functional, and hence there exists a 
unique element y* e E such that 

(Ax, y> = <x, y*} 

for all x e E. We define the map A* : E -► E by A*y = y*. It is immediately 
clear that A* is linear, and we shall call A* the adjoint of A with respect to our 
hermitian form. 

The following formulas are trivially verified, for any linear maps A, B of E 
into itself: 


(A -f B)* = A* -f B*, A** = A, 

(olA)* = ôlA*, (AB)* = B*A*. 

A linear map A is called self-adjoint (or hermitian) if A* = A. 

Proposition 6.1. A is hermitian if and only if (Ax, x) is real for all x e E. 

Proof. Let A be hermitian. Then 

<>4x, x> = <x, >4x> = <>4x, x>, 

whence <>4x, x> is real. Conversely, assume (Ax, x> is real for all x. Then 
<i4x, x> = < Ax, x> = <x, Ax} = < A*x, x>, 
and consequently {(A — A*)x, x> = Oforallx. Hence/1 = A* by polarization. 
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Let A : E -> E be a linear map. An element Çe E is called an eigenvector 
of A if there exists X e C such that AÇ = XÇ. If £ # 0, then we say that X is an 
eigenvalue of A, belonging to 

Proposition 6.2. Let A be hermitian. Then all eigenvalues belonging to 
nonzero eigenvectors of A are real. If £, £' are eigenvectors 0 having 
eigenvalues À, À' respectively, and if À =A À', f f'. 

Proo/ Let A be an eigenvalue, belonging to the eigenvector £ # 0. Then 
</!£, O = <£, /lO, and these two numbers are equal respectively to A<<!;, O 
and Â<(^, O. Since £ # 0, it follows that X = X, i.e. that X is real. Secondly, 
assume that £, Ç and A, A' are as described above. Then 

<AÇ, O = A<£, {') = <É, AO = X\Z, O, 
from which it follows that <<!;, £'> = 0. 

Lemma 6.3. A.E^Ebea linear map, and dim E ^ 1. Then there 
exists at least one non-zero eigenvector of A. 

Proof. We consider C[y4], i.e. the ring generated by A over C. As a vector 
space over C, it is contained in the ring of endomorphisms of E , which is finite 
dimensional, the dimension being the same as for the ring of all n X n matrices 
if n = dim E. Hence there exists a non-zero polynomial P with coefficients in 
C such that P{A) = 0. We can factor P into a product of linear factors, 


P(X) = (X- X,) ..•(*- X m ) 


with Xj e C. Then (A — XJ) • •• (A — X m I) = 0. Hence not all factors A — Xjl 
can be isomorphisms, and there exists X e C such that A — XI is not an iso- 
morphism. Hence it has an element £ / 0 in its kernel, and wegeM<!; — XÇ = 0. 
This shows that Ç is a non-zero eigenvector, as desired. 

Theorem 6.4. (Spectral Theorem, Hermitian Case). Let E be a non- 
zero finite dimensional vector space over the complex numbers, with a positive 
definite hermitian form. Let A : E —> E be a hermitian linear map. Then E has 
an orthogonal basis consisting of eigenvectors of A. 

Proof Let ^ be a non-zero eigenvector, with eigenvalue X l9 and let E x be 
the subspace generated by Then A maps Ef into itself, because 

<AEl O = <£|, AO = (Ef X^y = X x <E\ 9 <^> = 0, 
whence AE\ is perpendicular to 

Since ^ /0we have <^, ^) > 0 and hence, since our hermitian form is 
non-degenerate (being positive definite), we have 


E = E^ © E\. 
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The restriction of our form to Ef is positive definite (if dim E > 1). From 
Proposition 6.1, we see at once that the restriction of A toF/is hermitian. Hence 
we can complete the proof by induction. 

Corollary 6.5. Hypotheses being as in the theorem, there exists an ortho- 
normal basis consisting of eigenvectors of A. 

Proof Divide each vector in an orthogonal basis by its norm. 

CoroIIary 6.6. Let E be a non-zero finite dimensional vector space over the 
complex numbers, with a positive definite hermitian form f. Let g be another 
hermitian form on E. Then there exists a basis of E which is orthogonal for 
bothf and g. 

Proof. We write/(x, y) = <x, y>. Since / is non-singular, being positive 
definite, there exists a unique hermitian linear map A such that g(x 9 y ) = </lx, y> 
for all x, ye E. We apply the theorem to A, and find a basis as in the theorem, 
say {v ì9 ..., v n }. Let be the eigenvalue such that Av { = Then 

g(Vi, Vj ) = (AVi, Vj) = Vj >, 

and therefore our basis is also orthogonal for g, as was to be shown. 

We recall that a linear map U : E —> E is unitary if and only if U* = U ~ 1 . 
This condition is equivalent to the property that (Ux, Uy) = (x, y) for all elements 
x, y e E. In other words, U is an automorphism of the form /. 

Theorem 6.7. (Spectral Theorem, Unitary Case). Let E be a non-zero 
finite dimensional vector space over the complex numbers, with a positive definite 
hermitian form. Let U : £ —» E bea unitary linear map. Then E has an orthogonal 
basis consisting of eigenvectors of U. 

Proof. Let ^ # 0 be an eigenvector of U. It is immediately verified that 
the subspace of E orthogonal to ^ is mapped into itself by U, using the relation 
U* = U~ l , because if rj is perpendicular to then 

<Uri, {j> = <rj, U*^} = <rj, IT ^ = <rj, X~ l ^y = 0. 

Thus we can finish the proof by induction as before. 

Remark. If X is an eigenvalue of the unitary map U, then X has necessarily 
absolute value 1 (because U preserves length), whence X can be written in the 
form e ie with 9 real, and we may view U as a rotation. 

Let A : E ^ E be an invertible linear map. Just as one writes a non-zero 
complex number z = re ld with r > 0, there exists a decomposition of A as a 
product called its polar decomposition. Let P : E —> E be linear. We say that P 
is semipositive if P is hermitian and we have (Px, x) ^ 0 for all x e E. If we 
have (Px, x) > 0 for all x 0 in E then we say that P is positive defìnite. For 
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example, if we let P = A*A then we see that P is positive definite, because 
(A*Ax, x) = ( Ax , Ax) > 0 if x ^ 0. 

Proposition 6.8. Let P be semipositive. Then P has a unique semipositive 
square root B : E —> E y i.e. a semipositive linear map such that B 2 = P. 

Proof. For simplicity, we assume that P is positive definite. By the spectral 
theorem, there exists a basis of E consisting of eigenvectors. The eigenvalues 
must be > 0 (immediate from the condition of positivity). The linear map defined 
by sending each eigenvector to its multiple by the square root of the corresponding 
eigenvalue satisfies the required conditions. As for uniqueness, since B commutes 
with P because B 2 = P, it follows that if {v Ì9 . . ., v n } is a basis consisting of 
eigenvectors for P, then each v t is also an eigenvector for B. (Cf. Chapter XIV, 
Exercises 12 and 13(d).) Since a positive number has a unique positive square 
root, it follows that B is uniquely determined as the unique linear map whose 
effect on v { is multiplication by the square root of the corresponding eigenvalue 
for P. 

Theorem 6.9. Let A :E —> E be an invertible linear map. Then A can be 
xvritten in a unique way as a product A = UP, where U is unitary and P is 
positive definite. 

Proof. Let P = (A*A) 1/2 , and let U = AP~ l . Using the defiitions, it is 
immediately verified that U is unitary, so we get the existence of the decom- 
position. As for uniqueness, suppose A = U X P X . Let 

U 2 = PP x l = u~ x u x . 

Then U 2 is unitary, so U 2 U 2 = I. From the fact that P* = P and P* = P u we 
conclude that P 2 = P 2 . Since P, P { are Hermitian positive definite, it follows 
as in Proposition 6.8 that P = P Xì thus proving the theorem. 

Remark. The arguments used to prove Theorem 6.9 apply in the case of 
Hilbert space in analysis. Cf. my Real Analysis. However, for the uniqueness, 
since there may not be “eigenvalues”, one has to use another technique from 
analysis, described in that book. 

As a matter of terminology, the expression A = UP in Theorem 6.9 is called 
the polar decomposition of A. Of course, it does matter in what order we write 
the decomposition. There is also a unique decomposition A = P\U X with P x 
positive definite and U x unitary (apply Theorem 6.9 to A~ l , and then take 
inverses). 


§7. THE SPECTRAL THEOREM (SYMMETRIC CASE) 

Let E be a finite dimensional vector space over the real numbers, and let g be 
a symmetric positive definite form on E. If A : E —> E is a linear map, then we know 
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that its transpose, relative to g , is defined by the condition 

<Ax, y > = <x, 'Ay > 

for all x, ye E. We say that A is symmetric if A = l A. As before, an element 
Çe E is called an eigenvector of A if there exists X e R such that AÇ = ÁÇ, and X 
is called an eigenvalue if £ # 0. 

Theorem 7.1. (Spectral Theorem, Symmetric Case). Let E =£ 0. Let 

A : E —> E be a symmetric linear map. Then E has an orthogonal basis 
consisting of eigenvectors of A. 

Proof. If we select an orthogonal basis for the positive definite form, 
then the matrix of A with respect to this basis is a real symmetric matrix, and 
we are reduced to considering the case when E = R". Let M be the matrix repre- 
senting A. We may view M as operating on C", and then M represents a hermi- 
tian linear map. Let z # 0 be a complex eigenvector for M, and write 

z = x + iy, 

with x, y e R". By Proposition 6.2, we know that an eigenvalue A for M, be- 
longing to z, is real, and we have Mz = Az. Hence Mx = Ax and My = A y. 
But we must have x / 0 or y / 0. Thus we have found a nonzero eigenvector 
for M, namely, A, in E. We can now proceed as before. The orthogonal comple- 
ment of this eigenvector in E has dimension (n - 1), and is mapped into itself by 
A. We can therefore finish the proof by induction. 

Remarks. The spectral theorems are valid over a real closed field; our 
proofs don’t need any change. Furthermore, the proofs are reasonably close 
to those which would be given in analysis for Hilbert spaces, and compact 
operators. The existence of eigenvalues and eigenvectors must however be 
proved differently, for instance using the Gelfand-Mazur theorem which we have 
actually proved in Chapter XII, or using a variational principle (i.e. finding a 
maximum or minimum for the quadratic function depending on the operator). 

CoroIIary 7.2. Hypotheses being as in the theorem, there exists an ortho- 
normal basis consisting of eigenvectors of A. 

Proof. Divide each vector in an orthogonal basis by its norm. 

CoroIIary 7.3. Let E be a non-zero fìnite dimensional vector space over the 
reals, with a positive defìnite symmetric form f. Let g be another symmetric 
form on E. Then there exists a basis of E which is orthogonal for bothf and g. 

Proof. We write /(x, y) = <x, y>. Since / is non-singular, being positive 
definite, there exists a unique symmetric linear map A such that 


g(x, y) = (Ax, y > 
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for all x, ye E. We apply the theorem to A , and find a basis as in the theorem. 
It is clearly an orthogonal basis for g (cf. the same proof in the hermitian case). 

The analogues of Proposition 6.8 and the polar decomposition also hold in 
the present case, with the same proofs. See Exercise 9. 


§8. ALTERNATING FORMS 

Let £ be a vector space over the field /c, on which we now make no restriction. 
We let/be an alternating form on £, i.e. a bilinear map /: £ x £ -► /e such that 
/(x, x) = x 2 = 0 for all x e E. Then 

x • y = — y * x 

for all x, y e £, as one sees by substituting (x -f y) for x in x 2 = 0. 

We define a hyperbolic plane (for the alternating form) to be a 2-dimensional 
space which is non-degenerate. We get automatically an element vv such that 
vv 2 = 0, vv ± 0. If P is a hyperbolic plane, and vv e P, vv ^ 0, then there exists 
an element y ± 0 in P such that vv • y ^ 0. After dividing y by some constant, 
we may assume that vv • y = 1. Then y • vv = -1. Hence the matrix of the form 
with respect to the basis {vv, y} is 


The pair vv, y is cailed a hyperbolic pairas before. Given a 2-dimensional vector 
space over k with a bilinear form, and a pair of elements {vv, y} satisfying the 
relations 


vv 2 = y 2 = 0, y • vv = — 1 , vv • y — 1 , 

then we see that the form ìs alternating, and that (vv, y) is a hyperbolic plane for 
the form. 

Given an alternating form/ on £, we say that E (or/) ishyperbolic if E is 
an orthogonal sum of hyperbolic planes. We say that E (or f) is null if x • y = 0 
for all x, y e E. 

Theorem 8.1. Let f be an alternating form on the fìnite dimensional vector 
space E over k. Then E is an orthogonal sum of its hernel and a hyperbolic 
subspace. IfE is non-degenerate, then E is a hyperbolic space, and its dimension 
is even. 

Proof. A complementary subspace to the kernel is non-degenerate, and 
hence we may assume that E is non-degenerate. Let vv e £, vv # 0. There 
exists y e E such that vv • y # 0 and y # 0. Then (vv, y) is non-degenerate, hence 
is a hyperbolic plane P. We have £ = P © P 1 and P 1 is non-degenerate. We 
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complete the proof by induction. 

CoroIIary 8.2. All alternating non-degenerate forms of a given dimension 
over afìeld k are isometric. 

We see from Theorem 8.1 that there exists a basis of E such that relative to 
this basis, the matrix of the alternating form is 



0 1 

-1 0 


\ 


0 1 

-1 0 


For convenience of writing, we reorder the basis elements of our orthogonal 
sum of hyperbolic planes in such a way that the matrix of the form is 

0 I r 

l r 0 
0 0 

where I r is the unit r x r matrix. The matrix 

0 I r 

I r 0 

is called the standard alternating matrix. 

Corollary 8.3. Let E be a finite dimensional vector space over k, with a 
non-degenerate symmetric form denoted by { , ). Let Í1 be a non-de- 
generate alternating form on E. Then there exists a direct sum decomposition 
E = E x © E 2 and a symmetric automorphism A of E (with respect to { , )) 
having the following property. lfx,yeE are written 

x = (x l5 x 2 ) with x^eEj and x 2 e£ 2 > 

y = (yuyi) with y A ^E x and 
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then 


0(x, y) = (Ax u y 2 > - (Ax 29 y x y 


Proof. Take a basis of E such that the matrix of Q with respect to this basis 
is the standard alternating matrix. Let / be the symmetric non-degenerate 
form on E given by the dot product with respect to this basis. Then we obtain 
a direct sum decomposition of E into subspaces E u E 2 (corresponding to the 
first n, resp. the last n coordinates), such that 

0(x, y) =f(x l9 y 2 ) -f{x 2 ,yi). 

Since < , ) is assumed non-degenerate, we can find an automorphism A having 
the desired effect, and A is symmetric because/is symmetric. 


§9. THE PFAFFIAN 

An alternating matrix is a matrix G such that l G = —G and the diagonal 
elements are equal to 0. As we saw in Chapter XIII, §6, it is the matrix of an 
alternating form. We let G be an n x n matrix, and assume n is even. (For odd 
n, cf. exercises.) 

We start over a field of characteristic 0. By Corollary 8.2, there exists a non- 
singular matrix C such that 'CGC is the matrix 


/ 0 

h 

°\ 

l-'r 

0 

0 

\ 0 

0 

0/ 


and hence 

det(C) 2 det(G) = 1 or 0 

according as the kernel of the alternating form is trivial or non-trivial. Thus in 
any case, we see that det(G) is a square in the field. 

Now we move over to the integers Z. Let r 7 (1 ^ i < j ^ n) be n(n — l)/2 
algebraically independent elements over Q, let t u = 0 for i = 1,..., n, and let 
tij = —tji for i > j. Then the matrix T = (r l7 ) is alternating, and hence det(T) 
is a square in the field Q(í) obtained from Q by adjoining all the variables í l7 . 
However, det(T) is a polynomial in Z[í], and since we have unique factorization 
in Z[í], it follows that det(T) is the square ofa polynomial in Z[r]. We can write 

det(T) = P(r) 2 . 

The polynomial P is uniquely determined up to a factor of ± 1. If we substitute 
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values for the í fj so that the matrix T specializes to 



then we see that there exists a unique polynomial P with integer coefficients 
taking the value 1 for this specialized set of values of (í). We call P the generic 
Pfaffian of size n, and write it Pf. 

Let R be a commutative ring. We have a homomorphism 

Z[í] - Rit] 

induced by the unique homomorphism of Z into R. The image of the generic 
Pfaffian of size n inR[t] is a polynomial with coefficients in R , which we still 
denote by Pf. If G is an alternating matrix with coefficients in R, then we write 
Pf(G) for the value of Pf(í) when we substitute for í 0 in Pf. Since the deter- 
minant commutes with homomorphisms, we have: 


Theorem 9.1. Let Rbe a commutative ring. Let (g^) = G be an alternating 
matrix mth g t j e R. Then 

det(G) = (Pf(G)) 2 . 

Furthermore , if C is an n x n matrix in R, then 

Pf(CG r C) = det(C) Pf(G). 

Proof. The first statement has been proved above. The second statement 
will follow if we can prove it over Z. Let (í, j = 1,..., n) be algebraically 
independent over Q, and such that u^, í fj - are algebraically independent over Q. 
Let U be the matrix (w l7 ). Then 

Pf (UVU) = ± det(C) Pf(T), 

as follows immediately from taking the square of both sides. Substitute values 
for U and T such that U becomes the unit matrix and T becomes the standard 
alternating matrix. We conclude that we must have a + sign on the right-hand 
side. Our assertion now follows as usual for any substitution of U to a matrix in 
R, and any substitution of T to an alternating matrix in R, as was to be shown. 


§10. WITT’S THEOREM 


We go back to symmetric forms and we let k be a field of characteristic ± 2. 
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Let E be a vector space over k, with a symmetric form. We say that E is a 
hyperbolic plane if the form is non-degenerate, if E has dimension 2, and if there 
exists an element vv # 0 in E such that vv 2 = 0. We say that E is a hyperbolic 
space if it is an orthogonal sum of hyperbolic planes. We also say that the form 
on E is hyperbolic. 

Suppose that E is a hyperbolic plane, with an element vv # 0 such that 
vv 2 = 0. Let u e E be such that E = (vv, u). Then u • vv # 0; otherwise vv would 
be a non-zero element in the kernel. Let bekbe such that vv -bu = bw -u = 1. 

Then select aek such that 

(aw -f bu) 2 = 2abw • u + b 2 u 2 = 0. 

(This can be done since we deal with a linear equation in a.) Put v = aw -f bu. 
Then we have found a basis for E, namely E = (vv, v) such that 

w 2 = v 2 = 0 and vv • v = 1. 

Relative to this basis, the matrix of our form is therefore 

Cí) 

We observe that, conversely, a space E having a basis {vv, v} satisfying 
vv 2 = v 2 = 0 and vv • v = 1 is non-degenerate, and thus is a hyperbolic plane. A 
basis {vv, v} satisfying these relations will be called a hyperbolic pair. 

An orthogonal sum of non-degenerate spaces is non-degenerate and hence 
a hyperbolic space is non-degenerate. We note that a hyperbolic space always 
has even dimension. 


Lemma 10.1. Let E be a finite dimensional vector space over k, with a non - 
degenerate symmetric form g. Let F be a subspace, F 0 the hernel of F, and 
suppose we have an orthogonal decomposition 

F = F 0 1 U. 

Let {w l5 ..., w s } be a basis of F 0 . Then there exist elements v x ,..., v s in E 
perpendicular to U, such that each pair {w,-, v t } is a hyperbolic pair generating 
a hyperbolic plane P h and such that we have an orthogonal decomposition 


(7 _L Pj _L • • • _L P s . 


Proof. Let 


U x = (w 2 ,..., wj® U. 

Then U j is contained in F 0 © U properly, and consequently ( F 0 ® U) 1 is 
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contained inUf properly. Hence there exists an element e í/| but 

U)\ 

We have vv^ -u^ # 0, and hence (w l5 is a hyperbolic plane Pj. We have 
seen previously that we can find v x e P { such that {w u v x } is a hyperbolic pair. 
Furthermore, we obtain an orthogonal sum decomposition 

F i = (w 2 ,..., w s ) 1 Pi 1 U. 

Then it is clear that (w 2 ,..., w s ) is the kernel of F u and we can complete the 
proof by induction. 

Theorem 10.2 Let E be a finite dimensional vector space over k, and let g 
be a non-degenerate symmetric form on E. Let F , F' be subspaces of E, and 
let a: F —> F' be an isometry. Then <x can be extended to an isometry ofE onto 
itself. 

Proof We shall first reduce the proof to the case when F is non-degenerate. 
We can write F = F 0 1 U as in the lemma of the preceding section, and 
then oF = F f = oF 0 1 oU. Furthermore, oF 0 = F 0 is the kernel of F'. Now 
we can enlarge both F and F' as in the lemma to orthogonal sums 

U 1 Pi 1 • • • 1 P s and oU 1 P\ _L • • • _L P s 

corresponding to a choice of basis in F 0 and its corresponding image in F 0 . 
Thus we can extend o to an isometry of these extended spaces, which are non- 
degenerate. This gives us the desired reduction. 

We assume that F, F' are non-degenerate, and proceed stepwise. 

Suppose first that F' = P, i.e. that o is an isometry of F onto itself. We can 
extend o to E simply by leaving every element of F 1 fixed. 

Next, assume that dim F = dim F' = 1 and that F # F'. Say F = ( v ) and 
F' = (i/). Then v 2 = v' 2 . Furthermore, (v, v' ) has dimension 2. 

If ( v , v') is non-degenerate, it has an isometry extending <x, which maps v on 
v' and v' on v. We can apply the preceding step to conclude the proof. 

If ( v , v') is degenerate, its kernel has dimension 1. Let w be a basis for this 
kernel. There exist a,bsk such that v' = av + bw. Then v' 2 = a 2 v 2 and hence 
a = ± 1. Replacing v' by —v' if necessary, we may assume a = 1. Replacing w 
by bw, we may assume v' = v + w. Let z = v + v'. We apply Lemma 10.1 to 
the space 


(w, z) = (w) 1 (z). 

We can find an element y e E such that 

y • z = 0, y 2 = 0, and w • y = 1. 
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The space (z, vv, y) = (z) 1 (vv, y) is non-degenerate, being an orthogonal sum 
of (z ) and the hyperbolic plane (vv, y). It has an isometry such that 

z z, vv — vv, y <-+ — y. 

But v = }(z - vv) is mapped on v' = j(z 4- vv) by this isometry. We have 
settled the present case. 

We finish the proof by induction. By the existence of an orthogonal basis 
(Theorem 3.1), every subspace F of dimension > 1 has an orthogonal de- 
composition into a sum of subspaces of smaller dimension. Let F = 1 F 2 

with dim F { and dim F 2 ^ L Then 

oF = oF ! ± oF 2 . 

Let o ^ = <j|Fj be the restriction of o to F v By induction, we can extend o x to 
an isometry 


ctj : E -> E. 

Then ôfFj) = (o^F^) 1 . Since oF 2 is perpendicular to oF^ = o x F x , it follows 
that oF 2 is contained in â^Fl). Let o 2 = o\F 2 . Then the isometry 

O 2 • F 2 * O 2 F 2 = @F 2 


extends by induction to an isometry 

ô 2 \F\ - 

The pair (o u ô 2 ) gives us an isometry of F x _L F} = E onto itself, as desired. 

CoroIIary 10.3. Let E y E' be finite dimensional vector spaces with non- 
degenerate symmetric forms, and assume that they are isometric. Let F, F' be 
subspaces, and let a : F —> F' be an isometry. Then a can be extended to an 
isometry of E onto E'. 

Proof. Clear. 

Let E be a space with a symmetric form g , and let F be a null subspace. 
Then by Lemma 10.1, we can embed F in a hyperbolic subspace H whose 
dimension is 2 dim F. 

As applications of Theorem 10.2, we get several corollaries. 

CoroIIary 10.4. Let E be a finite dimensional vector space with a non- 
degenerate symmetric form. Let W be a maximal null subspace, and let W' be 
some null subspace. Then dim W' ^ dim W, and W' is contained in some 
maximal null subspace, whose dimension is the same as dim W. 
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Prooj. That W' is contained in a maximal null subspace follows by Zorn’s 
lemma. Supposedim W' ^ dim W. We have an isometry ofW onto a subspace 
of W' which we can extend to an isometry of E onto itself. Then a~ \Wj is a 
null subspace containing W, hence is equal to W f whence dim W = dim W'. 
Our assertions follow by symmetry. 

Let E be a vector space with a non-degenerate symmetric form. Let W be a 
null subspace. By Lemma 10.1 we can embed W in a hyperbolic subspace H of 
E such that W is the maximal null subspace of H , and H is non-degenerate. Any 
such H will be called a hyperbolic enlargement of W. 

CoroIIary 10.5. Let E be a jìnite dimensional vector space with a non- 
degenerate symmetric form. Let W and W' be maximal null subspaces. Let H y 
H' be hyperbolic enlargements ofW, W' respectively. ThenH, H' are isometric 
and so are H 1 and H ,J -. 

Proof. We have obviously an isometry of H on H\ which can be extended 
to an isometry of E onto itself. This isometry maps H 1 on H' 1 , as desired. 

CoroIIary 10.6. Let g x , g 2 , hbe symmetricforms onjinite dimensional vector 
spaces over the jìeld of k. If g { © h is isometric to g 2 © h, and if g ìf g 2 are 
non-degenerate, then g x is isometric to g 2 . 

Proof. Let g x be a form on E x and g 2 a form on E 2 . Let h be a form on F. 
Then we have an isometry between F © E x and F ® E 2 . Extend the identity 
id : F —> F to an isometry a of F © E x to F © E 2 by Corollary 10.3. Since E x 
and E 2 are the respective orthogonal complements of F in their two spaces, we 
must have cr(E x ) = E 2 , which proves what we wanted. 

If g is a symmetric form on E , we shall say that g is definite if g(x, x) ± 0 
for any x e E, x 0 (i.e. x 2 ± 0 if x 0). 

Corollary 10.7. Let gbe a symmetric form on E. Then g has a decomposition 
as an orthogonal sum 


G Go © Ghyp © Gde( 

where g 0 is a nulì form , g hyp is hyperbolic , and g de( is definite. The form 
0hyp®0def is non-degenerate. The forms g 0 , g hypf and g dç( are uniquely 
determined up to isometries. 

Proof. The decomposition g = g 0 © g x where g 0 is a null form and g x 
is non-degenerate is unique up to an isometry, since g 0 corresponds to the 
kernel of g. 

We may therefore assume that g is non-degenerate. If 


9 = Gh © Qd 
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where g h is hyperbolic and g d is definite, then g h corresponds to the hyperbolic 
enlargement of a maximal null subspace, and by Corollary 10.5 it follows that 
g h is uniquely determined. Hence g d is uniquely determined as the orthogonal 
complement of g h . (By uniquely determined, we mean of course up to an 
isometry.) 

We shall abbreviate g hyp by g h and g de( by g d . 


§11. THE WITT GROUP 

Let g 9 (p by symmetric forms on finite dimensional vector spaces over k. We 
shall say that they are equivalent if g d is isometric to <p d . The reader will verify 
at once that this is an equivalence relation. Furthermore the (orthogonal) sum 
of two null forms is a null form, and the sum of two hy perbolic forms is hy perbolic. 
However, the sum of two definite forms need not be definite. We write our 
equivalence g ~ <p. Equivalence is preserved under orthogonal sums, and hence 
equivalence classes of symmetric forms constitute a monoid. 

Theorem 11.1. The monoid of equivalence classes of symmetric forms (over 

the field k) is a group. 

Proof. We have to show that every element has an additive inverse. Let g 
be a symmetric form, which we may assume definite. We let — g be the form 
such that (—g)(x, y) = —g(x, y). We contend that g © — g is equivalent to 0. 
Let E be the space on which g is defined. Then g © — g is defined on E © E. 
Let W be the subspace consisting of all pairs (x, x) with x e E. Then W is a null 
spaceforgf © — g. Sincedim(£ © E) = 2 dim W, it followsthat W isamaximal 
null space, and that g © — g is hyperbolic, as was to be shown. 

The group of Theorem 11.1 will be called the Witt group of k, and will be 
denoted by W(k). It is of importance in the study of representations of elements 
of k by the quadratic form/arising from g [i.e. f(x) = g(x, x)], for instance 
when one wants to classify the definite forms /. 

We shall now define another group, which is of importance in more functorial 
studies of symmetric forms, for instance in studying the quadratic forms arising 
from manifolds in topology. 

We observe that isometry classes of non-degenerate symmetric forms (over 
k) constitute a monoid M(k), the law of composition being the orthogonal sum. 
Furthermore, the cancellation law holds (Corollary 10.6). We let 


cl: M(k) - WG(k) 
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be the canonical map of M(k) into the Grothendieck group of this monoid, 
which we shall call the Witt-Grothendieck group over k . As we know, the 
cancellation law implies that cl is injective. 

If g is a symmetric non-degenerate form over /c, we define its dimension 
dim g to be the dimension of the space E on which it is defined. Then it is clear 
that 


dim (0 © g') = dim g -f dim g'. 

Hence dim factors through a homomorphism 

dim : WG(k) - Z. 

This homomorphism splits since we have a non-degenerate symmetric form of 
dimension 1 . 

Let WG 0 (k) be the kernel of our homomorphism dim. If g is a symmetric 
non-degenerate form we can define its determinant det(gr) to be the determinant 
of a matrix G representing g relative to a basis, modulo squares. This is well 
defined as an element of k*/k* 2 . We define det of the 0-form to be 1. Then det is 
a homomorphism 


det: M(k) k*/k* 2 , 

and can therefore be factored through a homomorphism, again denoted by 
det, of the Witt-Grothendieck group, det: WG(k) -► k*/k* 2 . 

Other properties of the Witt-Grothendieck group will be given in the 
exercises. 


EXERCISES 


1. (a) Let E be a finite dimensional space over the complex numbers, and let 

h:E x £-C 


be a hermitian form. Write 


Kx,y) = g(x,y) + if(x , y) 

where g , / are real valued. Show that g , / are R-bilinear, g is symmetric, / is 
alternating. 

(b) Let E be finite dimensional over C. Let g : E x E -> C be R-bilinear. Assume 
that for all x e £, the map y g(x , y) is C-linear, and that the R-bilinear form 


/(x, y) = g(x , y) - g(y , x) 
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ìs real-valued on E x E. Shovv that there exists a hermitian form h on E and a 
symmetric C-bilinear form ì/j on E such that 2ig = h -I- ý. Show that h and ý are 
uniquely determined. 

2. Prove the real case of the unitary spectral theorem: If E is a non-zero finite dimensional 
space over R, with a positive definite symmetric form, and U :E E ìs a unitary linear 
map, then E has an orthogonal decomposition into subspaces of dimension 1 or 2, 
invariant under U. If dim E = 2, then the matrix of U with respect to any ortho- 
normal basis ìs of the form 

/cos 6 -sin0\ /—1 0\/cos 6 -sm0\ 

\sin 6 cos 6/ \ 0 l/\sin0 cos 0/ 

depending on whether det(U) = 1 or — 1. Thus U is a rotation, or a rotation followed 
by a reflection. 

3. Let E be a finite-dimensional, non-zero vector space over the reals, with a positive 
definite scalar product. Let T: E -+ E be a unitary automorphism of E. Show that E 
is an orthogonal sum of subspaces 

£ = -±E m 

such that each E, is T-invariant, and has dimension 1 or 2. If E has dimension 2, show 
that one can find a basis such that the matrix associated with T with respect to this 
basis is 

cos 6 -sin0\ /-cos 6 sin 6 
sin 6 cos 6/ \ sin 6 cos 6 

according as det T = 1 or det T = — 1. 

4. Let E be a finite dimensional non-zero vector space over C, with a positive definite 
hermitian product. Let A, B : E —» E be a hermitian endomorphism. Assume that 
AB = BA. Prove that there exists a basis of E consisting of common eigenvectors 
for A and B. 

5. Let E be a finite-dimensional space over the complex, with a positive definite hermitian 
form. Let S be a set of (C-linear) endomorphisms of E having no invariant subspace 
except 0 and E. (This means that if F is a subspace of E and BF cz F for all B e S, then 
F = 0 or F = E.) Let A be a hermitian map of E into ìtself such that AB = BA for all 
BeS. Show that A = XI for some real number X. [Hint: Show that there exists 
exactly one eigenvalue of A. If there were two eigenvalues, say X x ^ X 2 , one could find 
two polynomials / and g with real coefficients such that f(A) ^ 0, g(A) ^ 0 but 
f(A)g(A) = 0. Let F be the kernel of g(A) and get a contradiction.] 

6. Let E be as in Exercise 5. Let T be a C-linear map of E into itself. Let 

A = j(T + T*). 

Show that A is hermitian. Show that T can be written in the form A 4- iB where A, B 
are hermitian, and are uniquely determined. 

7. Let S be a commutative set of C-linear endomorphisms of E having no invariant sub- 
space unequal to 0 or E. Assume in addition that if BeS, then B* e S. Show that each 
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element of S is of type a/ for some complex number a. [ Hint : Let B 0 e S. Let 

A = j(B q -i- B*). 

Show that A = XI for some real A.] 

8. An endomorphism B of E is said to be normal if B commutes with B*. State and prove a 
spectral theorem for normal endomorphisms. 


Symmetric endomorphisms 


For Exercises 9, 10 and 11 we let E be a non-zero finite dimensional vector space over 
R, with a symmetric positive definite scalar product g, which gives rise to a norm \\ onE. 

Let A : E —> E be a symmetric endomorphism of E with respect to g. Define A ^ 0 
to mean (Ax, x) ^ 0 for all x E E. 

9. (a) Show that A ^ 0 if and only if all eigenvalues of A belonging to non-zero 
eigenvectors are ^ 0. Both in the hermitian case and the symmetric case, one 
says that A is semipositive if A ^ 0, and positive definite if (Ax, x) > 0 for all 
jc 0. 

(b) Show that an automorphism A of E can be written in a unique way as a product 
A = UP where U is real unitary (that is, { UU = /), and P is symmetric positive 
definite. For two hermitian or symmetric endomorphisms A, B, define A ^ B to 
mean A - B ^ 0, and similarly for A > B. Suppose A > 0. Show that there are 
two real numbers a > 0 and (3 > 0 such that al ^ A ^ /3/. 

10. If A is an endomorphism of E, define its norm |A| to be the greatest lower bound of 
all numbers C such that |Ajc| ^ C\x\ for all jc E E. 

(a) Show that this norm satisfies the triangle inequality. 

(b) Show that the series 

exp(A) = / + A + Y\ + • ‘ • 


converges, and if A commutes with B , then exp(A + B) = exp(A) exp(5). 
If A is sufficiently close to /, show that the series 

log(A) = (d^_(d^ + ... 


converges, and if A commutes with B, then 

log(AZ?) = log A + log B. 


(c) Using the spectral theorem, show how to define log P for arbitrary positive 
definite endomorphisms P. 

11. Again, let E be non-zero finite dimensional over R, and with a positive definite 
symmetric form. Let A : E —> E be a linear map. Prove: 

(a) If A is symmetric (resp. alternating), then exp(A) is symmetric positive definite 
(resp. real unitary). 

(b) If A is a linear automorphism of E sufficiently close to /, and is symmetric 
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positive definite (resp. real unitary), then log A is symmetric (resp. 
alternating). 

(c) More generally, if A is positive definite, then log A is symmetric. 

12. Let R be a commutative ring, let E, F be E-modules, and let/: E -> F be a mapping. 
Assume that multiplication by 2 in F is an invertible map. Show that/is homogeneous 
quadratic if and only if/satisfies the parallelogram law: 

/(* + y) + /(x -y) = 2/(x) + 2f(y) 


for all x, y e E. 

13. (Tate) Let £, F be complete normed vector spaces over the real numbers. Let 
/: E -* F be a map having the following property. There exists a number C > 0 such 
that for all x, y e E we have 


i/(* + y) -/w -m i ^ c. 

Show that there exists a unique additive map g : E^>F such that |g — /| is bounded 
(i.e. | g(x) — /(x) | is bounded as a function of x). Generalize to the bilinear case. [ Hint: 
Let 


g(x) = lim 

n-* oo 


/( 2 n x) 
2 n 


14. (Tate) Let S be a set and /:S —> S a map of S into itself. Let h:S — » R be a real 
valued function. Assume that there exists a real number í/ > 1 such that hof- df 
is bounded. Show that there exists a unique function h f such that h f - h is bounded, 
and h f °f = dh f . [Hint: Let h f (x) = lim h(f n (x))/d n .] 

15. Define maps of degree > 2, from one module into another. [ Hint: For degree 3, 
consider the expression 


f(x + y + z) -f(x + y) -f(x + z) -f(y + z) +f(x) +f(y) +/(z).] 

Generalize the statement proved for quadratic maps to these higher-degree maps, i.e. 
the uniqueness of the various multilinear maps entering into their definitions. 


Alternating forms 

16. Let E be a vector space over a field k and let g be a bilinear form on E. Assume that 
whenever x,ye E are such that g(x, y) = 0, then g(y, x) = 0. Show that g is symmetric 
or alternating. 

17. Let E be a module over Z. Assume that E is free, of dimension n > 1, and let/be a 
bilinear alternating form on E. Show that there exists a basis {ej (i = 1 ,...,n) and 
an integer r such that 2 r ^ n, 

Ci -?2 = <*!, ^3-^4 = û 2 , ...,e 2r _! -e 2r = a r 

where a í ,...,a r e Z, a, ^ 0, and <3, divides a i+ { for i = 1,. .., r - 1 and finally 
^ ej = 0 for all other pairs of indices i ^ j. Show that the ideals Za, are uniquely 
determined. [Hint : Consider the injective homomorphism (p f :E^E v oïE into the 



XV, Ex 


EXERCISES 599 


dual space over Z, viewing (p f (E) as a free submodule of £ v .]. Generalize to principal 
rings when you know the basis theorem for modules over these rings. 

Remark. A basis as in Exercise 18 is called a symplectic basis. For one use of 
such a basis, see the theory of theta functions, as in my Introduction to Algebraic and 
Abelian Functions (Second Edition, Springer Verlag), Chapter VI, §3. 

18. Let E be a finite-dimensional vector space over the reals, and let < , > be a symmetric 
positive definite form. Let Q be a non-degenerate alternating form on E. Show that 
there exists a direct sum decomposition 

E = E x © E 2 

having the following property. If x, y e E are written 

x = (xj, x 2 ) with x^eÊj and x 2 e£ 2 » 

y = (yiiyi) with y^eE^ and y 2 eE 2ì 

then Q(x, y) = (x 1? y 2 ) - (x 2 , y x ). [ Hint : Use Corollary 8.3, show that A is positive 
definite, and take its square root to transform the direct sum decomposition obtained 
in that corollary.] 

19. Show that the pfaffian of an alternating n x n matrix is 0 when n is odd. 

20. Prove all the properties for the pfaffian stated in Artin’s Geometric Algebra (Inter- 
science , 1957), p. 142. 

The Witt group 

21. Show explicitly how W(k) is a homomorphic image of WG(k). 

22. Show that WG(k) can be expressed as a homomorphic image of Z[/c*//c* 2 ] [Hint: 
Use the existence of orthogonal bases.] 

23. Witt’s theorem is still true for alternating forms. Prove it or look it up in Artin (ref. 
in Exercise 20). 

SL n ( R) 

There is a whole area of linear algebraic groups, giving rise to an extensive algebraic 
theory as well as the possibility of doing Fourier analysis on such groups. The group 
SL n ( R) (or SL„( C)) can serve as a prototype, and a number of basic facts can be easily 
verifìed. Some of them are listed below as exercises. Readers wanting to see solutions can 
look them up in [JoL 01], Spherical Inversion on SL n ( R), Chapter I. 

24. Iwasawa decomposition. We start with GL n ( R). Let: 

G = GL n ( R); 

K — subgroup of real unitary n x n matrices; 

U = group of real unipotent upper triangular matrices, that is having components 1 
on the diagonal, arbitrary above the diagonal, and 0 below the diagonal; 
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A = group of diagonal matrices with positive diagonal components. 

Prove that the product map U x A x K UAK c= G is actually a bijection. This 
amounts to Gram-Schmidt orthogonalization. Prove the similar statement in the 
complex case, that is, for G(C) = GL n (C), K(C) = complex unitary group, U(C) = 
complex unipotent upper triangular group, and A the same group of positive diag- 
onal matrices as in the real case. 

25. Let now G = SL n ( R), and let K, A be the corresponding subgroups having deter- 
minant 1. Show that the product U x A x K —► UAK again gives a bijection with G. 

26. Let a be the R-vector space of real diagonal matrices with trace 0. Let a v be the 

dual space. Let a, (i = 1,..., n - 1) be the functional defined on an element H = 
diag(/ 2 i,. by a,(//) — h ( - h i+ j. (a) Show that {aj,..., a„_i} is a basis of a v 

over R. (b) Let H i i+ \ be the diagonal matrix with hi = 1, h i+ \ = -1, and hj = 0 
for j * /,/+ 1. Show that • • •, H n -\, n } is a basis of a. (c) Abbreviate 

H i i+ \ = Hí (i = 1,... ,n - 1). Let a/ e a v be the functional such that a '(Hj) = ôy 
(=1 if i=j and 0 othenvise). Thus {aj,..., a'_,} is the dual basis of 
{H \,..., //„_i}. Show that 

ol[(H) = h\ 4* • • • + hi. 

27. The trace form. Let Mat„(R) be the vector space of real n x n matrices. Defìne the 
twisted trace form on this space by 

B t (X, Y) = tr(Z r T) = <X, Y\. 

As usual, 'Y is the transpose of a matrix Y. Show that B t is a symmetric positive 
definite bilinear form on Mat w (R). What is the analogous positive definite hermitian 
form on Mat„(C)? 

28. Positivity. On a (real diagonal matrices with trace 0) the form of Exercise 27 can be 
defined by tr (XY), since elements X,Y e a are symmetric. Let stf = {ai,..., a„_i} 
denote the basis of Exercise 26. Define an element H e a to be semipositive (writen 
H ^ 0) if a,(//) ^ 0 for all i = 1,..., n — 1. For each a e a v , let // a e a represent a 
with respect to B t , that is <//„,//) = a(//) for all H e a. Show that H ^ 0 if and 
only if 

n— 1 

H = J2 SiH a > with Si ^ 0. 

i=i 

Similarly, define H to be positive and formulate the similar condition with s, > 0. 

29. Show that the elements not' (i = 11) can be expressed as linear combina- 
tions of ai,..., a n _i with positive coefficients in Z. 

30. Let W be the group of permutations of the diagonal elements in the vector space a of 
diagonal matrices. Show that a^o is a fundamental domain for the action of W on a 
(i.e., given H e a, there exists a unique H + ^ 0 such that H + = wH for some 
w e W. 



CHAPTER XVI 

The Tensor Product 


Having considered bilinear maps, we now come to multilinear maps and basic 
theorems concerning their structure. There is a universal module representing 
multilinear maps, called the tensor product. We derive its basic properties, and 
postpone to Chapter XIX the special case of alternating products. The tensor 
product derives its name from the use made in differential geometry, when this 
product is applied to the tangent space or cotangent space of a manifold. The 
tensor product can be viewed also as providing a mechanism for “extending the 
base”; that is, passing from a module over a ring to a module over some algebra 
over the ring. This “extension” can also involve reduction modulo an ideal, 
because what matters is that we are given a ring homomorphism /: A —> £, and 
we pass from modules over A to modules over B. The homomorphism /can be 
of both types, an inclusion or a canonical map with B = A/J for some ideal J, 
or a composition of the two. 

I have tried to provide the basic material which is immediately used in a 
variety of applications to many fields (topology, algebra, differential geometry, 
algebraic geometry, etc.). 


§1. TENSOR PRODUCT 

Let R be a commutative ring. If E u F are modules, we denote by 

L n (E í , F) 

the module of n-multilinear maps 

f\E x x ••• x E„-F. 
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We recall that a multilinear map is a map which is linear (i.e., R-linear) in each 
variable. We use the words linear and homomorphism interchangeably. Unless 
othermse specified , modules , homomorphisms , linear , multilinear refer to the ring R. 

One may view the multilinear maps of a fixed set of modules E u as the 

objects of a category. Indeed, if 

f :E X x ••• x E n -> F and g: E x x ••• x E n -> G 

are multilinear, we define a morphism / -► to be a homomorphism h: F -► G 
which makes the following diagram commutative: 




x 


• • • x 



F 

h 

G 


A universal object in this category is called a tensor product of E í9 ... 9 E n 
(over F). 

ILc shall now prove that a tensor product exists , and in fact construct one in a 
natural way. By abstract nonsense, we know of course that a tensor product is 
uniquely determined, up to a unique isomorphism. 

Let M be the free module generated by the set of all n-tuples (x^,..., x„), 
(XìeEì), i.e. generated by the set E t x ••• x E n . Let N be the submodule 
generated by all the elements of the following type: 


(Xj, ... , x f + xj,..., x n ) - (x ì9 ... , x i9 ... , x w ) - (Xj, .. 

(xi, • • •, ax i9 ..., x„) - a(x l5 ..., x„) 


X/, • • • , Xn) 


for all X/ e E i9 x'ì e E ì9 a e R. We have the canonical injection 


x • - x E„-> M 

of our set into the free module generated by it. We compose this map with the 
canonical map M -► M/N on the factor module, to get a map 


(p :E x x • • • x E n —► M/N. 

We contend that <p is multilinear and is a tensor product. 

It is obvious that q> is multilinear—our definition was adjusted to this 
purpose. Let 


f:E x x ••• x E n -► G 

be a multilinear map. By the definition of free module generated by 

E x x • • • x E n 
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we have an induced linear map M -► G which makes the following diagram 
commutative: 


M 

E x x • • • x E n 


Since/is multilinear, the induced map M -► G takes on the value 0 on N. Hence 
by the universal property of factor modules, it can be factored through M/N, 
and we have a homomorphism /* : M/N -► G which makes the following dia- 
gram commutative: 



E, 


x 


■ • • x 



M/N 



G 


Since the image of cp generates M/N , it follows that the induced map /* is 
uniquely determined. This proves what we wanted. 

The module M/N will be denoted by 

n 

E x ®-"®E n or also (X) E t . 

i = 1 

We have constructed a specific tensor product in the isomorphism class of tensor 
products, and we shall call it the tensor product of E u ..., E n . If x f - e E i9 we write 

(p(x u ..., x n ) = Xi (g) • •• ® x n = Xj ® R • • • ® R x n . 

We have for all i, 

x x ® • • • ® axi ® • • • ® x n = a(x x ® • • • ® x n \ 

Xi ® • • • ® (x f -f xj) (g) • • • (8) x„ 

= (Xi ® • • • ® x n ) + (X! ® • • • ® x'i ®) • • • ® X n ) 


for x f , x\ e Ei and a e R. 

If we have two factors, say E (g) F, then every element of E ®) F can be 
written as a sum of terms x ®) y with x £ E and y £ F, because such terms generate 
E ® F over /c, and «(x ® y) = ax (g) y for a £ /?. 
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Remarlt. If an element of the tensor product is 0, then that element can 
already be expressed in terms of a finite number of the relations defining the 
tensor product. Thus if E is a direct limit of submodules £, then 


lim F ® Ei = F ® lim £ t = F ® E. 

In particular, every module is a direct limit of finitely generated submodules, 
and one uses frequently the technique of testing whether an element of F ® E is 
0 by testing whether the image of this element in F ® £, is 0 when £ f ranges over 
the finitely generated submodules of E. 

Warning. The tensor product can involve a great deal ofcollapsing between 
the modules. For instance, take the tensor product over Z of Z/mZ and Z/nZ 
where m, n are integers > 1 and are relatively prime. Then the tensor product 

Z/nZ ® Z/mZ = 0. 

Indeed, we have n(x (x) y) = (nx) (x) y = 0 and m(x ® y) = x ® my = 0. Hence 
x (x) y = 0 for all x e Z/nZ and y e Z/mZ. Elements of type x (x) y generate the 
tensor product, which is therefore 0. We shall see later conditions under which 
there is no collapsing. 

In many subsequent results, we shall assert the existence of certain linear 
maps from a tensor product. This existence is proved by using the universal 
mapping property of bilinear maps factoring through the tensor product. The 
uniqueness follows by prescribing the value of the linear maps on elements of 
type x (x) y (say for two factors) since such elements generate the tensor product. 

We shall prove the associativity of the tensor product. 

Proposition 1.1. Let E í9 E 2 , E 3 be modules. Then there exists a unique 

isomorphism 


(£j ® E 2 ) ® E 3 -► E { (x) (E 2 (x) E 3 ) 

such that 


(x ® y) ® z h-> x ® (y (x) z) 
for x e E u ye E 2 and ze E 3 . 

Proof. Since elements of type (x ® y) ® z generate the tensor product, the 
uniqueness of the desired linear map is obvious. To prove its existence, let 
xeE x . Themap 


X x : E 2 x E 3 -► (E x ® E 2 ) ® E 3 
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such that X x (y\ z) = (x ® y) ® z is clearly bilinear, and hence factors through a 
linear map of the tensor product 

l x : E 2 ® £ 3 -► (£j (g) £ 2 ) ® £ 3 . 

The map 

£i x (£ 2 (g) £ 3 ) -► (£, (8) £ 2 ) ® £3 

such that 

(x, a) 4(a) 

for xeE^ and ae £ 2 ( 8 ) £3 is then obviously bilinear, and factors through a 
linear map 

£1 ® (£ 2 ® £3) -► (£1 ® £2) ® £ 3 > 

which has the desired property (clear from its construction). 

Proposition 1.2. Let £, £ he modules. Then there is a unique isomorphism 

E ( 8 ) F -► F ® £ 

such í/iní x (8) y 1 —► T ® x for x e £ and y e £. 

Proo/ The map £ x £ -► £ (8) £ such that (x, y) f—► y ® x is bilinear, and 
factors through the tensor product £ ® £, sending x (8) T on y (8) x. Since this 
last map has an inverse (by symmetry) we obtain the desired isomorphism. 

The tensor product has various functorial properties. First, suppose that 

/i:£ï-£| (i = l,...,n) 

is a collection of linear maps. We get an induced map on the product, 

If we compose { ] /’■ with the canonical map into the tensor product, then we get 
an induced linear map which we may denote by T(f u ...,/„) which makes the 
following diagram commutative: 

E\ x ••• x E'„ -» E\®---®E' n 

n/, n/,. ./.> 

£1 X ... X E„ - > Ei (8) •••(8)£„ 
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It is immediately verified that T is functorial, namely that if we have a com- 
posite of linear maps f o g. (i = 1 ,..., n) then 

T(fi °9u---,fn°Q n ) = T(fu •••,/») ° T(g u 

and 

T(id,..., id) = id. 

We observe that T(f l9 ... 9 f n ) is the unique linear map whose effect on an 
element x\ ® • • • ® x„ of E\ ® • • • ® E n is 

x ; i (x) •. • ® x; /ì(xì) ® • • • ® / w (x;). 

We may view T as a map 

e' ì9 ®e\ 

i=l \i=l i = l / 

and the reader will have no difficulty in verifying that this map is multilinear. 
We shall write out what this means explicitly for two factors, so that our map can 
be written 

{f,g)^T(f,g). 

Given homomorphisms /: F' -► F and g u g 2 '- E' -► E, then 

T(f, gi + 02 )= T(/, 0 1 )+ T(f,g 2 ), 

T(f, m) = aT(f, gi ). 

In particular, select a fixed module F, and consider the functor t = x F (from 
modules to modules) such that 

x(E) = F ® E. 

Then t gives rise to a linear map 

t:L(F, E)^L(t(E'\ t(£)) 
for each pair of modules E\ L, by the formula 

t(/)= T(id,/). 

Remark. By abuse of notation, it is sometimes convenient to write 
/i ® • • • <8> /„ instead of T(f u ... ,/„). 
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This should not be confused with the tensor product of elements taken in the 
tensor product of the modules 


L(£'i,£i)<8> ..®L(£;,£ W ). 

The context will always make our meaning clear. 


§2. BASIC PROPERTIES 

The most basic relation relating linear maps, bilinear maps, and the tensor 
product is the following: For three modules £, F, G, 


L(F, L(F, G)) « L 2 (L, F;G)x L(E® F, G). 


The isomorphisms involved are described in a natural way. 

(i) L\E, F\ G) -* L(E, L(F, G)). 

If /: E x F -> G is bilinear, and xe E, then the map 

f x 'F^G 

such that f x (y) = /(x, y) is linear. Furthermore, the map xh f x is linear, and 
is associated with / to get (i). 

(ii) L(F,L(F, G)) -► L 2 (F, F; G). 

Let <p g L(F, L(F, G)). We let : F x F -► G be the bilinear map such that 

/*(x, y) = <p(x)(y). 


Then cp^f^ defines (ii). 

It is clear that the homomorphisms of (i) and (ii) are inverse to each other 
and therefore give isomorphisms of the first two objects in the enclosed box. 

(iii) L 2 (F, F; G) - L(F ® F, G). 

This is the map/ 1 —► / + which associates to each bilinear map / the induced 
linear map on the tensor product. The association /»—►/* is injective (because 
/* is uniquely determined by /), and it is surjective, because any linear map 
of the tensor product composed with the canonical map F x F -► F ® F gives 
rise to a bilinear map on F x F. 
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Proposition 2.1. Let E — 0) be a direct sum. Then we have an isomor - 

i= 1 

phism 

F®E~@(F®E i ). 

i= 1 

Proof The isomorphism is given by abstract nonsense. We keep F fixed, 
and consider the functor t : X i—► F ® X. As we saw above, t is linear. We have 
projections n x{ \ E -► E of E on . Then 

71 i o 71 i = Tíi , 7 li o 71 j = 0 Ìf Ì # j, 

Z = id. 

1 = 1 

We apply the functor t, and see that 1 ( 71 ,) satisfies the same relations, hence gives 
a direct sum decomposition of i (E) = F ® E. Note that t(tt,) = id ® tt, . 

CoroUary 2.2. Let I be an indexing set , and E = E^. Then we have an 

i 6 / 

isomorphism 

(0 e) ® F » © (£,• (g) F). 

Proof. Let S be a finite subset of /. We have a sequence of maps 
(© e\ x F - 0 (£ ( (g) F) 0 (£ f ® F) 

\ieS / ieS ie/ 

the first of which is bilinear, and the second is linear, induced by the inclusion of 
5in/. The first is the obvious map. IfS c: 5', then a trivial commutative diagram 
shows that the restriction of the map 

(© £.ì x £ ^ © (£ f ® F) 

\i eS' / i e / 

induces our preceding map on the sum for i e S. But we have an injection 
(©£,) x F -> (©i £,.) x F. 

Hence by compatibility, we can define a bilinear map 

(© £ f ) x F - © (£, ® £), 

\i£/ / /6/ 
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and consequently a linear map 

(® e\ ® F -> ® (E t ® F). 

\iel J i e/ 

In a similar way, one defines a map in the opposite direction, and it is clear 
that these maps are inverse to each other, hence give an isomorphism. 

Suppose now that E is free, of dimension 1 over R . Let {t;} be a basis, and 
consider F ® E. Every element of F ® E can be written as a sum of terms y ® av 
with y e F and aeR. However, y ® av = ay ® v. In a sum of such terms, we can 
then use linearity on the left, 

y.'j ® V, y t e F. 

Hence every element is in fact of type y ® v with some yeF. 

We have a bilinear map 

F x 

such that (y, i—► ay, inducing a linear map 

F ® E i-> F. 

We also have a linear map F -► F ® E given by y i—► y (x) y. It is clear that these 
maps are inverse to each other, and hence that we have an isomorphism 

F ® E & F. 

Thus every element of F ® E can be written uniquely in the form y ® v 9 yeF. 


î (y. ®«) = ( Ê 

i= 1 \i= 1 


Proposition 2.3. Let E befree over R , with basis e/ . T/ien et;ery element 
of F ® E has a unique expression of the form 

S ® »i, j’iEf 

íe / 


vvií/i almost all = 0. 

Proof. This follows at once from the discussion of the 1-dimensional case, 
and the corollary of Proposition 2.1. 

Corollary 2.4. Leí L, F be /ree F, vv/í/î bases {Vi} ieI and {wj}j € j re- 
spectively. Then E (x) F is free 9 with basis {y* ® w 7 }. We have 


dim(F ® F) = (dim F)(dim F). 
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Proof. Immediate from the proposition. 

We see that when E is free over R , then there is no collapsing in the tensor 
product. Every element of F ® E can be viewed as a “formal” linear combina- 
tion of elements in a basis of E with coefficients in F. 

In particular, we see that R ® E (or E ® R) is isomorphic to £, under the 
correspondence x i—► x (x) 1. 

Proposition 2.5. Let £, F be free of finite dimension over R. Then we have an 

isomorphism 


End r (E) ® End R (F) - End*(£ ® F) 
which is the unique linear map such that 

f ®gy-+T(f,g) 
for f e End R (E) and g £ End*(F). 

[We note that the tensor product on the left is here taken in the tensor 
product of the two modules End R (E) and End*(F).] 

Proof Let {v t } be a basis of E and let {wj} be a basis of F. Then {v t ® vv^} 
is a basis of E (x) F. For each pair of indices (i'J') there exists a unique endo- 
morphism / = f i r of E and g = g jtj . of F such that 

f(vd = v v and f(v v ) = 0 if v / i 

g ( Wj ) = Wf and g(wf) = 0 if p # j. 


Furthermore, the families {f tV } and {gjjf are bases of End^F) and End*(F) 
respectively. Then 


T(f, g)(v v ® wj = 


v v ® w r 

0 


if (v, p) = (ij) 
if (v, p) # (ij). 


Thus the family {T(f ir , g jtj ’)} is a basis of End*(F ® F). Since the family 
{f tV ® g jtj f isabasisofEnd^F) ® End^(F), the assertion of our proposition is 
now clear. 


In Proposition 2.5, we see that the ambiguity of the tensor sign in / ® g is in 
fact unambiguous in the important special case of free, finite dimensional 
modules. We shall see later an important application of Proposition 2.5 when 
we discuss the tensor algebra of a module. 


Proposition 2.6. Let 


o -► E’ 4 E 4 E" -► 0 
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be an exact sequence , and F any module. Then the sequence 
F ®E' - F<g)£-F(g)F / -0 


is exact. 

Proof. Given x" eE" and yeF, there exists xeE such that x" = ^(x), an d 
hence y ® x" is the image of y ® x under the linear map 

F®E^F® E". 

Since elements of type y ® x" generate F ® E”, we conclude that the preceding 
linear map is surjective. One also verifies trivially that the image of 

F® E' -> F ®E 


is contained in the kernel of 


F ® E -> F ® E". 

Conversely, let / be the image of F ® E' -* F ® E, and let 

f:(F®E)/I^F®E" 
be the canonical map. We shall define a linear map 

g :F ® E" -> (F ® E)/I 

suchthat go f = id, Thisobviously willimply that/isinjective,and hencewill 
prove the desired converse. 

Let y e F and x" e E". Let xeE be such that i jj(x) = x". We define a map 
F x E" - (F (x) E)/I by letting 

(y, x")h -+y®x (mod /), 

and contend that this map is well defined, i.e. independent of the choice of x 
such that \//(x) = x". If ^(x^) = i//(x 2 ) = x", then ^(x t — x 2 ) = 0, and by 
hypothesis, x x — x 2 = ç(x') for some x' e E '. Then 

y ® Xi - y® x 2 = y ® (x^ - x 2 ) = y ® (p(x'). 

This shows that y ® x^ = y ® x 2 (mod /), and proves that our map is well 
defined. It is obviously bilinear, and hence factors through a linear map g , on 
the tensor product. It is clear that the restriction of g ° / on elements of type 
y ® x" is the identity. Since these elements generate F ® E", we conclude 
that / is injective, as was to be shown. 
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It is not always true that the sequence 

0->F®E' -+F®E->F®E" - 0 

is exact. It is exact if the first sequence in Proposition 2.6 splits, i.e. if E is 
essentially the direct sum of E ' and E". This is a trivial consequence of Pro- 
position 2.1, and the reader should carry out the details to get accustomed to the 
formalism of the tensor product. 

Proposition 2.7. Let a be an ideal of R. Let Ebe a module. Then the map 
( R/a ) x E -► E/aE induced by 

(a, x) i—► ax (mod a£), ae R, xe E 

is bilinear and induces an isomorphism 

(R/a ) (x) E A E/aE. 

Proof Our map (a, x) i—► ax (mod aE) clearly induces a bilinear map of 
R/a x E onto E/aE , and hence a linear map of R/a (x) E onto £/a£. We can 
construct an inverse, for we have a well-defined linear map 

E -► K/a (g) E 

such that x h-> T (x) x (where T is the residue class of 1 in R/á). It is clear that aE 
is contained in the kernel of this last linear map, and thus that we obtain a 
homomorphism 

E/aE -► R/a (x) E , 

which is immediately verified to be inverse to the homomorphism described in 
the statement of the proposition. 

The association E ► E/aE « R/a (g) £ is often called a reduction map. In 
§4, we shall interpret this reduction map as an extension of the base. 


§3. FLAT MODULES 

The question under which conditions the left-hand arrow in Proposition 2.6 
is an injection gives rise to the theory of those modules for which it is, and we 
follow Serre in calling them flat. Thus formally, the following conditions are 
equivalent, and define a flat module F, which should be called tensor exact. 

F 1. For every exact sequence 


E - E - E 
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the sequence 

F ® E' -► F ® E -► F ® E" 

is exact. 

F 2. For every short exact sequence 

0 -► E' -► E -+ E" -► 0 

the sequence 

0^>F®E'-*F®E^>F®E"^0 

is exact. 

F 3. For every injection 0 —> E' —> E the sequence 

0 -> F ® E' F ® E 


is exact. 

It is immediate that F 1 implies F 2 implies F 3. Finally, we see that F 3 implies 
F 1 by writing down the kernel and image of the map E’ -► E and applying F 3. 
We leave the details to the reader. 

The following proposition gives tests for flatness, and also examples. 

Proposition 3.1. 

(i) The ground ring is flat as module over itself. 

(ii) Let F = 0) F { be a direct sum. Then F is flat if and only if each F t is flat. 

(iii) A projective module is flat. 

The properties expressed in this proposition are basically categorical, cf. the 
comments on abstract nonsense at the end of the section. In another vein, we 
have the following tests having to do with localization. 

Proposition 3.2. 

(i) Let Sbe a multiplicative subset of R. Then S~ is flat over R. 

(ii) A module M is flat over R if and only if the localization M p is flat over R v 
for each prime ideal p of R. 

(iii) Let Rbea principal ring. A module F is flat if and only if F is torsion free. 

The proofs are simple, and will be left to the reader. More difficult tests for 
flatness will be proved below, however. 

Examples of non-flatness. If R is an entire ring, and a module M over R 
has torsion, then M is not flat. (Prove this, which is immediate.) 
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There is another type of example which illustrates another bad phenomenon. 
Let R be some ring in a finite extension K of Q, and such that R is a finite 
module over Z but not integrally closed. Let R' be its integral closure. Let p be 
a maximal ideal of R and suppose that pR' is contained in two distinct maximal 
ideals ^ and $P 2 . Then it can be shown that R' is not flat over /?, otherwise R' 
would be free over the local ring R v , and the rank would have to be 1, thus 
precluding the possibility of the two primes ^ and ^S 2 - It i s good practice for 
the reader actually to construct a numerical example of this situation. The same 
type of example can be constructed with a ring R = k[x,y], where k is an 
algebraically closed field, even of characteristic 0, and x, y are related by an 
irreducible polynomial equation f(x,y ) = 0 over k. We take R not integrally 
closed, such that its integral closure exhibits the same splitting of a prime p of 
R into two primes. In each one of these similar cases, one says that there is a 
singularity at p. 

As a third example, let R be the power series ring in more than one variable 
over a field k. Let m be the maximal ideal. Then m is not flat, because otherwise, 
by Theorem 3.8 below, m would be free, and if R = k[[x x ,. .., *„]], then x x , 

. . ., x n would be a basis for m over R, which is obviously not the case, since 
x x , x 2 are linearly dependent over R when n ^ 2. The same argument, of course, 
applies to any local ring R such that m/m 2 has dimension ^ 2 over R/m. 

Next we come to further criteria when a module is flat. For the proofs, we 
shall snake it all over the place. Cf. the remark at the end of the section. 

Lemma 3.3. Let F be flat, and suppose that 

0-N-M-F-0 

is an exact sequence. Then for any E, we have an exact sequence 
0 — N®F — M®F — F(x)F — 0. 

Proof. Represent E as a quotient of a flat L by an exact sequence 


0-X-L-F-0. 
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Then we have the following exact and commutative diagram: 

0 


N ® K -►M ®K - >F ® K ->0 


0- >N ® L -► M ® L -► F ® L 


N ® E -► M ® E 


0 0 

The top right 0 comes by hypothesis that F is flat. The 0 on the left comes from 
the fact that L is flat. The snake lemma yields the exact sequence 

0^N®E^M®E 


which proves the lemma. 

Proposition 3.4. Let 

0 - F' - F - F" - 0 

be an exact sequence , and assume that F" is flat. Then F is flat if and only if F' 
is flat . More generally , let 

0 _ F° _ F 1 --► F" - 0 

be an exact sequence such that F 1 ,..., F n are flat. Then F° is flat. 
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Proof. Let 0 E' -> E be an injection. We have an exact and commuta- 
tive diagram: 


0 

0-* F' ® E' -> F ® E' -> F" ® E' -> 0 

0-> F' ® E -> F ® E -> F" ® E 


The 0 on top is by hypothesis that F" is flat, and the two zeros on the left are 
justified by Lemma 3.3. If F' is flat, then the first vertical map is an injection, and 
the snake lemma shows that F is flat. If F is flat, then the middle column is an 
injection. Then the two zeros on the left and the commutativity of the left square 
show that the map F' ® E' -► F' ® E is an injection, so F' is flat. This proves the 
first statement. 

The proof of the second statement is done by induction, introducing kernels 
and cokernels at each step as in dimension shifting, and apply the first statement 
at each step. This proves the proposition 

To give flexibility in testing for flatness, the next two lemmas are useful, in 
relating the notion of flatness to a specific module. Namely, we say that F is 
ZT-flator flat for E, if for every monomorphism 

0 - E' - E 


the tensored sequence 


is also exact. 


0 -► F ® E' -► F ® E 


Lemma 3.5. Assume that F is E-flat. Then F is also flat for every submodule 
and every quotient module of E. 

Proof. The submodule part is immediate because if E\ c - E' 2 ^ E are 
submodules, and F ® E\ -> F ® E is a monomorphism so is F ® E\ —> F ® E' 2 
since the composite map with F ® E' 2 -> F ® E is a monomorphism. The only 
question lies with a factor module. Suppose we have an exact sequence 

Let M' be a submodule of M and E' its inverse image in E. Then we have a 
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commutative diagram of exact sequences: 

0- >N - >E' - >M' -►O 

II 

0- >N - >E - >M -►O. 

We tensor with F to get the exact and commutative diagram 

0 K 


F ®N -► F ® E' -> F ®M' -> 0 


0- >F ®N - > F ® E - F ® M 


0 

where K is the questionable kernel which we want to prove is 0. But the snake 
lemma yields the exact sequence 

0-K-0 


which concludes the proof. 

Lemma 3.6. Let {£,} be a family of modules , and suppose that F is flat for each 
Ei. Then F is flat for their direct sum. 

Proof Let E = (0 E t be their direct sum. We have to prove that given any 
submodule E' of E, the sequence 

O->F®£'->F®£ = 0F®E| 


is exact. Note that if an element of F ® E' becomes 0 when mapped into the 
direct sum, then it becomes 0 already in a finite subsum, so without loss of 
generality we may assume that the set of indices is finite. Then by induction, 
we can assume that the set of indices consists of two elements, so we have two 
modules E x and E 2 , and E = E\ © E 2 . Let N be a submodule of E. Let N\ 
= N í) E { and let N 2 be the image of N under the projection on E 2 . Then 
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we have the following commutative and exact diagram: 

0 0 


N j- >N - >N 2 -►0 

0- >E { - >E - >E 2 

Tensoring with F we get the exact and commutative diagram: 

0 0 

F ® Ni -> F ® N - >F ® N 2 -> 0 


0- E x - > F ®E - >F® E 2 

The lower left exactness is due to the fact that E = E x ® E 2 . Then the snake 
lemma shows that the kernel of the middle vertical map is 0. This proves the 
lemma. 

The next proposition shows that to test for flatness, it suffices to do so only 
for a special class of exact sequences arising from ideals. 


Proposition 3.7. F is flat if and only if for every ideal aofR the natural map 

û ® F -> aF 

is an isomorphism. In fact , F is flat if and only for every ideal aofR tensoring 
the sequence 


0 —► a —► R —► R/a —► 0 
mth F yields an exact sequence. 

Proof If F is flat, then tensoring with F and using Proposition 2.7 shows 
that the natural map is an isomorphism, because aM is the kernel of M -> M/aM. 
Conversely, assume that this map is an isomorphism for all ideals a. This means 
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that F is R-fì at. By Lemma 3.6 it follows that F is flat for an arbitrary direct sum 
of R with itself, and since any module M is a quotient of such a direct sum, 
Lemma 3.5 implies that F is M-flat, thus concluding the proof. 

Remark on abstract nonsense. The proofs of Proposition 3.1(i), (ii), (iii), 
and Propositions 3.3 through 3.4 are basically rooted in abstract nonsense, 
and depend only on arrow theoretic arguments. Specifically, as in Chapter XX, 
§§, suppose that we have a bifunctor T on two distinct abelian categories û and 
(B such that for each A, the functor B »-» T(A, B) is right exact and for each B 
the functor A «-* T(A, B) is right exact. Instead of “flat > ’ we call an object A 
of û íj-exact if B < * T(A, B) is an exact functor; and we call an object L of (B 
T-exact if A T(A, L) is exact. Then the references to the base ring and free 
modules can be replaced by abstract nonsense conditions as follows. 

In the use of L in Lemma 3.3, we need to assume that for every object E of B 
there is a 'T-exact L and an epimorphism 

L - E -► 0. 

For the analog of Proposition 3.7, we need to assume that there is some 
object R in (B for which F is /?-exact, that is given an exact sequence 

0 ^a^R 

then 0 -► T(F, a) -► T(T, R) is exact; and we also need to assume that R is a 
generator in the sense that every object B is the quotient of a direct sum of R with 
itself, then over some family of indices, and T respects direct sums. 

The snake lemma is valid in arbitrary abelian categories, either because its 
proof is “functorial,” or by using a representation functor to reduce it to the 
category of abelian groups. Take your pick. 

In particular, we really don’t need to have a commutative ring as base ring, 
this was done only for simplicity of language. 

We now pass to somewhat different considerations. 

Theorem 3.8. Let Rbe a commutative local ring , and let M be a finite flat 
module over R. Then M is free. Infact , if x l9 ..., x n e M are elements of M 
whose residue classes are a basis of M/mM over R/ m, then x l5 ..., x n form 
a basis of M over R. 

Proof. Let R {n) -> M be the map which sends the unit vectors of R {n) on 
x í9 ... 9 x n respectively, and let N be its kernel. We get an exact sequence 


0 -N-R (n) -M, 
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whence a commutative diagram 

m (x) N -► m (g) R (n) -► m (x) M 

f 0 h 

0- *N - >R in) -►M 

in which the rows are exact. Since M is assumed flat, the map h is an injection. 
By the snake lemma one gets an exact sequence 

0 -► coker / -► coker g — coker /i, 

and the arrow on the right is merely 

R (n) /mR (n) - M/mM, 

which is an isomorphism by the assumption on x lr ..,x r It follows that 
coker/ = 0, whence m N = N, whence N = 0 by Nakayama if R is Noetherian, 
so N is finitely generated. If R is not assumed Noetherian, then one has to add 
a slight argument as follows in case M is finitely presented. 

Lemma 3.9. Assume that M is finitely presented , and let 

0—N—£-M-0 

be exact , mth E finite free. Then N is finitely generated. 

Proof. Let 

Lj — L 2 — M — 0 

be a finite presentation of M, that is an exact sequence with L u L 2 finite free. 
Using the freeness, there exists a commutative diagram 



such that L 2 -► L is surjective. Then the snake lemma gives at once the exact 
sequence 

0 — coker(L! — N) — 0, 

so coker(L! — N) = 0, whence N is an image of L x and is therefore finitely 
generated, thereby proving the lemma, and also completing the proof of Theorem 
3.8 when M is finitely presented. 
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We still have not proved Theorem 3.8 in the fully general case. For this we 
use Matsumura’s proof (see his Commutative Algebra , Chapter 2), based on the 
following lemma. 

Lemma 3.10. Assume that M is flat over R. Let a t e A, x t e Mfor i = 1, 
. . . , n, and suppose that we have the relation 

n 

Za iXi = 0 . 

i — 1 

Then there exists an integer s and elements b tj e A and M (j = 1,..., 5 ) 
such that 


Yj a i b ij = 0 for all j and x, = £ b^yj for all i. 

» j 

Proof We consider the exact sequence 

R (n) - R 

where the map R (n) R is given by 

n 

(h l5 ..., b n ) X ciibi , 

i - 1 

and K is its kernel. Since M is flat it follows that 

K ® M -> M (n) ^ M 
is exact, where / M is given by 

n 

/ M (zi,...,z„) = Xa.Z;. 

i = 1 

Therefore there exist elements fij e K and ^ g M such that 

(x„...,x„) = f J p J y j . 

í= 1 

Write fj = (b l7 -,..., with e /?. This proves the lemma. 

We may now apply the lemma to prove the theorem in exactly the same way 
we proved that a finite projective module over a local ring is free in Chapter X, 
Theorem 4.4, by induction. This concludes the proof. 

Remark. In the applications I know of, the base ring is Noetherian, and so 
one gets away with the very simple proofgiven at first. I did not want to obstruct 
the simplicity of this proof, and that is the reason I gave the additional tech- 
nicalities in increasing order of generality. 
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Applications of homology. We end this section by pointing out a connection 
between the tensor product and the homological considerations of Chapter XX, 
§8 for those readers who want to pursue this trend of thoughts. The tensor product 
is a bifunctor to which we can apply the considerations of Chapter XX, §8. Let 
M, N be modules. Let 


-► -£ 0 -A*- 0 

be a free or projective resolution of M, i.e. an exact sequence where E ( is free or 
projective for all i ^ 0. We write this sequence as 

E m - M - 0. 

Then by definition, 

Tor^M, N) = /-th homology of the complex E (x) N, that is of 

• • • — E t ® N — Ei_ — • • • — — 0. 

This homology is determined up to a unique isomorphism. I leave to the reader 
to pick whatever convention is agreeable to fix one resolution to determine a 
fixed representation of Tor,(M, N), to which all others are isomorphic by a 
unique isomorphism. 

Since we have a bifunctorial isomorphism M (g) N « N ® M, we also get a 
bifunctorial isomorphism 

Tor,(M, N) « Tor f (JV, M) 

for all i . See Propositions 8.2 and 8.2' of Chapter XX. 

Following general principles, we say that M has Tor-dimension ^ d if 
Tor z (M, N) = 0 for all i > d and all jV. From Chapter XX, §8 we get the follow- 
ing result, which merely replaces T-exact by flat. 

Theorem 3.11. The follomng three conditions are equivalent concerning a 
module M. 

(i) M is flat. 

(ii) Torj(M, N) = 0 for all N. 

(iii) Tor,(M, N) = 0 for all i ^ 1 and all N, in other words , M has Tor- 
dimension 0. 

Remark. Readers willing to use this characterization can replace some of 
the preceding proofs from 3.3 to 3.6 by a Tor-dimension argument, which is 
more formal, or at least formal in a different way, and may seem more rapid. 
The snake lemma was used ad hoc in each case to prove the desired result. The 
general homology theory simply replaces this use by the corresponding formal 
homological step, once the general theory of the derived functor has been carried 
out. 
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§4. EXTENSION OF THE BASE 


Let R be a commutative ring and let E be a R-module. We specify R since 
we are going to work with several rings in a moment. Let R -> R' be a homo- 
morphism of commutative rings, so that R' is an /?-algebra, and may be viewed as 
an /?-module also. We have a 3-multilinear map 

R' x R' x £ -► R' ®E 

defined by the rule 

(a, h, x) h-► ab (x) x. 

This induces therefore a R-linear map 

R' ®(R' ® E) -► R' ®E 

and hence a R-bilinear map R' x ( R' (x) £)-►£' (x) £. It is immediately verified 
that our last map makes R' (x) E into a R'- module, which we shall call the 
extension of E over R\ and denote by E R - . We also say that E R > is obtained by 
extension of the base ring from R to R'. 

Example 1. Let a be an ideal of R and let R -► R/a be the canonical homo- 
morphism. Then the extension of E to R/a is also called the reduction of E 
modulo a. This happens often over the integers, when we reduce modulo a prime 
p (i.e. modulo the prime ideal (p )). 

Example 2. Let R be a fìeld and R' an extension field. Then E is a vector 
space over R , and E R > is a vector space over R'. In terms of a basis, we see that 
our extension gives what was alluded to in the preceding chapter. This example 
will be expanded in the exercises. 

We draw the same diagrams as in field theory: 



to visualize an extension of the base. From Proposition 2.3, we conclude: 


Proposition 4.1. Let E be a free module over K, mth basis {Vi} i€l . Let 
v'i = 1 (x) v t . Then E R > is a free module over R\ mth basis {v'i} ieI . 


We had already used a special case of this proposition when we observed that 
the dimension of a free module is defined, i.e. that two bases have the same 
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cardinality. Indeed, in that case, we reduced modulo a maximal ideal of R to 
reduce the question to a vector space over a field. 

When we start changing rings, it is desirable to indicate R in the notation 
for the tensor product. Thus we write 

E r , = R'®E = R' ® r E . 

Then we have transitivity of the extension of the base , namely, if R ^ R' -> R" is a 
succession of homomorphisms of commutative rings, then we have an iso- 
morphism 

R ® R E & R ® R ’ (R ® R E) 

and this isomorphism is one of /T-modules. The proof is trivial and will be left 
to the reader. 

If E has a multiplicative structure, we can extend the base also for this 
multiplication. Let R -► A be a ring-homomorphism such that every element in 
the image of R in A commutes with every element in A (i.e. an /?-algebra). Let 
R -► R' be a homomorphism of commutative rings. We have a 4-multilinear 
map 

R' x A x R' x A -> R' ® A 

defined by 

(a, x, fc, y)y-+ab ® xy. 

We get an induced R -linear map 

R' ® A® R' ® A -► R' ® A 
and hence an induced R-bilinear map 

(R' ® A) x ( R' ® A) -> R' ® A. 

It is trivially verified that the law of composition on R' ® A we have just 
defined is associative. There is a unit element in R' ® A , namely, 1 ® 1. We 
have a ring-homomorphism of R' into R' ® A, given by a i-» a ® 1. In this way 
one sees at once that R' ® A = A R > is an /^-algebra. We note that the map 

xy-+l ® x 

is a ring-homomorphism of A into R' ® A, and that we get a commutative 
diagram of ring homomorphisms, 

R' ® A = A r > 
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For the record, we give some routine tests for flatness in the context of base 
extension. 

Proposition 4.2. Let R -> Abe an R-algebra , and assume A commutative. 

(i) Base change. If F is a flat R-module , then A F is a flat A-module. 

(ii) Transitivity. IfA is aflat commutative R-algebra andM is a fiat A-module , 
then M is flat as R-module. 

The proofs are immediate, and will be left to the reader. 


§5. SOME FUNCTORIAL ISOMORPHISMS 


We recall an abstract definition. Let 81, 93 be two categories. The functors 
of 81 into 93 (say covariant, and in one variable) can be viewed as the 
objects of a category, whose morphisms are defined as follows. If L, M are two 
such functors, a morphism H: L -> M is a rule which to each object X of 91 
associates a morphism H x : L(X) -► M(X) in 93, such that for any morphism 
/: X -► Y in 81, the following diagram is commutative: 


L(X) —» M(X) 


Uf)\ 

L(Y) 


Hy 


M(f) 

>M(Y) 


We can therefore speak of isomorphisms of functors. We shall see examples of 
these in the theory of tensor products below. In our applications, our categories 
are additive, that is, the set of morphisms is an additive group, and the composi- 
tion law is Z-bilinear. In that case, a functor L is called additive if 

L(f + g) = L(f) + L(g). 

We let R be a commutative ring, and we shall consider additive functors from 
the category of K-modules into itself. For instance we may view the dual 
module as a functor, 


L(L, R) = Hom^L, R). 
Similarly, we have a functor in two variables, 

(£, F) L(L, F) = Hom K (L, F), 


contravariant in the first, covariant in the second, and bi-additive. 
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We shall give several examples of functorial isomorphisms connected with 
the tensor product, and for this it is most convenient to state a general theorem, 
giving us a criterion when a morphism of functors is in fact an isomorphism. 

Proposition 5.1. Let L, M be two functors (both covariant or both contra- 
variant) from the category of R-modules into itself Assume that both functors 
are additive. Let H: L -> M bea morphism of functors. IfH E : L(E) -» M(E) 
is an isomorphismfor every ì-dimensional free module E over R , then H E is an 
isomorphism for every finite-dimensional free module over R. 

Proof. We begin with a lemma. 

Lemma 5.2. Let E and E t (i = 1,..., m) be modules over a ring. Let 
(pi: Ei~+ E and ^ : E -► L, be homomorphisms having the following properties: 

\j/i o (pi = id, \pi o(pj = 0 if i / j 
m 

Z ° ýi = id > 

i — 1 


Then the map 




is an isomorphism of E onto the direct product [ j £,, and the map 

1 = 1 

(x 1 ,...,X m )h-MíP 1 X 1 + ••• + (fì m X m 

is an isomorphism of the product onto E. Conversely , if E is equal to the direct 
sum of submodules E t (i = 1,..., m), if we let ifi be the inclusion of E t in L, 
and (Pi the projection of E on L f , then these maps satisfy the above-mentioned 
properties. 

Proof. The proof is routine, and is essentially the same as that of Proposition 
3.1 of Chapter III. We shall leave it as an exercise to the reader. 

We observe that the families {(Pi) and {i/'J satisfying the properties of the 
lemma behave functorially: If T is an additive contravariant functor, say, then 
the families { T (i/^)} and {T (<p f )} also satisfy the properties ofthe lemma. Similarly 
if T is a covariant functor. 

To apply the lemma, we take the modules £, to be the 1-dimensional 
components occurring in a decomposition of E in terms ofa basis. Let us assume 
for instance that L, M are both covariant. We have for each module E a com- 
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mutative diagram 

L(E) -> Af(£) 

UEd-^MiEd 

and a similar diagram replacing <p, by reversing the two vertical arrows. 
Hence we get a direct sum decomposition of L(E) in terms of L(^,) and L((pi\ 
and similarly for M(E\ in terms of M(^) and M((pi). By hypothesis, H E . is an 
isomorphism. It then follows trivially that H E is an isomorphism. For instance, 
to prove injectivity, we write an element v e L(E) in the form 

V = X L(<Pi)vi, 

with Vi e L(Ei). If H e v = 0, then 

0 = Z H e L((Pì) Vì = X M(<Pi)H Et v i9 

and since the maps M(<p,) (i = 1,..., m) give a direct sum decomposition of 
M(£), we conclude that H e .Vì = 0 for all i , whence v { = 0, and t; = 0. The 
surjectivity is equally trivial. 

When dealing with a functor of several variables, additive in each variable, 
one can keep all but one of the variables fixed, and then apply the proposition. 
We shall do this in the following corollaries. 

Corollary 5.3. Let E' , L, F', F be free and finite dimensional over R. Then we 
have a functorial isomorphism 

L(E\ E) ® L(F\ F) -+ L(E' ®F\E® F) 

such that 

f T(f 9 g). 

Proof Keep E, F\ F fixed, and view L(E\ E) ® L(F\ F) as a functor in the 
variable E. Similarly, view 

L(E' ®F\E® F) 

as a functor in E\ The map/ ® g i—► T(/, g) is functorial, and thus by the lemma, 
it suffices to prove that it yields an isomorphism when E has dimension 1. 
Assume now that this is the case; fix E of dimension 1, and view the two 
expressions in the corollary as functors of the variable E. Applying the lemma 
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again, it suffices to prove that our arrow is an isomorphism when E has di- 
mension 1. Similarly, we may assume that F, F' have dimension 1. In that 
case the verification that the arrow is an isomorphism is a triviality, as desired. 

Corollary 5.4. Let £, F be free and finite dimensional. Then we have a 
natural isomorphism 

End^L) ® End R (F) -► End*(L ® F). 

Proof. Special case of Corollary 5.3. 

Note that Corollary 5.4 had already been proved before, and that we 
mention it here only to see how it fits with the present point of view. 

Corollary 5.5. Let E , F be free finite dimensional over R. There is a func - 
torial isomorphism 


E v ® F —» L(E, F) 

given for X e E v and y e F by the map 

X®y^ Á^y 

where A^ y is such that for all x e E, we have A^ y (x) = À(x)y. 

The inverse isomorphism of Corollary 5.5 can be described as follows. 
Let {v\,...,v n } be a basis of F, and let ,..., t; n v } be the dual basis. If 
A e L(F,F), then the element 

±vf ®A(vì) eE v ®F 

i=i 

maps to A. In particular, if E — F, then the element mapping to the identity id^ 
is called the Casimir element 

£ v ï ® v h 

i= 1 

independent of the choice of basis. Cf. Exercise 14. 

To prove Corollary 5.5, justify that there is a well-defined homomorphism 
of E v ®F to L(E,F ), by the formula written down. Verify that this homo- 
morphism is both injective and surjective. We leave the details as exercises. 
Differential geometers are very fond of the isomorphism 

L(E, E)^> E v ®E, 

and often use E v ® E when they think geometrically of L(F, E), thereby em- 
phasizing an unnecessary dualization, and an irrelevant formalism, when it is 
easier to deal directly with L(L, E). In differential geometry, one applies 
various functors L to the tangent space at a point on a manifold, and elements 
of the spaces thus obtained are called tensors (of type L). 
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Corollary 5.6. Let E , F be free and finite dimensional over R. There is a 
functorial isomorphism 


£ v ® F v (E® F) v . 
given for x v e E y and y v e F v by the map 

* v ® / À, 

where À w ì’mc/z í/iû/, /cr all x e E and y e F, 

À(x ® y) = (*, * v )(y, y v ). 


Proo/ As before. 

Finally, we leave the following results as an exercise. 

Proposition 5.7. Let E be free and finite dimensional over R. The trace 
function on L(E,E) is equal to the composite of the two maps 

L(E , E) £ v ® E R, 

where the first map is the inverse of the isomorphism described in Corollary 5.5, 
and the second map is induced by the bilinear map 

(X v , X) h-> (x, X V ). 

Of course, it is precisely in a situation involving the trace that the iso- 
morphism of Corollary 5.5 becomes important, and that the finite dimen- 
sionality of E is used. In many applications, this finite dimensionality plays 
no role, and it is better to deal with L(E , E) directly. 


§6. TENSOR PRODUCT OF ALGEBRAS 

In this section, we again let R be a commutative ring. By an R- algebra we 
mean a ring homomorphism R —> A into a ring A such that the image of R is 
contained in the center of A. 

Let A, B be R- algebras. We shall make A ® B into an /?-algebra. Given 
(> a , b) e A x B, we have an /?-bilinear map 

M a ,b : A x B —> A ® B such that M a h (a\ b') = aa' ® bb'. 

Hence M a b induces an /?-linear map m a b : A ® B —> A ® B such that 
m a b (a\ b’) = aa' ® bb'. But m a b depends bilinearly on a and b , so we obtain 
finally a unique /?-bilinear map 

A® B x A® B A® B 
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such that ( a ® b)(a' ® b’) = aa' ® bb’. This map is obviously associative, and 
we have a natural ring homomorphism 

R —> A ® B given by ch>1®c = c®1. 

Thus A ® B is an /?-algebra, called the ordinary tensor product 

Application: commutative rings 

We shall now see the implication of the above for commutative rings. 

Proposition 6.1. Finite coproducts exist in the category of commutative 
rings, and in the category of commutative algebras over a commutative ring. 
IfR^A and /? —> B are two homomorphisms of commutative rings, then their 
coproduct over R is the homomorphism R —> A ® B given by 

a i—► a ® 1 = 1 ® a. 

Proof We shall limit our proof to the case of the coproduct of two ring 
homomorphisms R —> A and R —> B. One can use induction. 

Let A , B be commutative rings, and assume given ring-homomorphisms into 
a commutative ring C, 

cp : A -* C and \jj : B ^ C. 

Then we can define a Z-bilinear map 

A x B —► C 

by (x, y) h-* (p(x)\jj(y). From this we get a unique additive homomorphism 

A ® £-► C 

such that x ® y m► cp(x)\/j(y). We have seen above that we can define a ring 
structure on A ® B, such that 

(a ® b)(c ® d) = ac ® bd. 

It is then clear that our map A ® B -> C is a ring-homomorphism. We also have 
two ring-homomorphisms 

-4 /1 ® B and B A ® B 

given by 

x i—► x ® 1 and y i—► 1 ® y. 

The universal property of the tensor product shows that (A ® B, /, g) is a 
coproduct of our rings A and B. 

If A, B , C are /Calgebras, and if (/?, \jj make the following diagram com- 
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mutative, 



then A ® B is also an R-algebra (it is in fact an algebra over R, or A, or B , de- 
pending on what one wants to use), and the map A (g) B -> C obtained above 
gives a homomorphism of R-algebras. 

A commutative ring can always be viewed as a Z-algebra (i.e. as an algebra 
over the integers). Thus one sees the coproduct of commutative rings as a 
special case of the coproduct of R-algebras. 

Graded Algebras. Let G be a commutative monoid, written additively. By 
a G-graded ring, we shall mean a ring A, which as an additive group can be 
expressed as a direct sum. 


A = ® ^r> 

r e G 

and such that the ring multiplication maps A r x A s into A r+S , for all r, seG. 
In particular, we see that A 0 is a subring. 

The elements of A r are called the homogeneous elements of degree r. 

We shall construct several examples of graded rings, according to the 
following pattern. Suppose given for each reG an abelian group A r (written 
additively), and for each pair r, s e G a map A r x A s ^> A r+S . Assume that A 0 
is a commutative ring, and that composition under these maps is associative and 
A 0 -bilinear. Then the direct sum A = @ A r is a ring: We can define multiplica- 

reG 

tion in the obvious way, namely 


Z*r4 

\reC / \seG / leG \r + s = f / 


The above product is called the ordinary product. However, there is another 
way. Suppose the grading is in Z or Z/2Z. We define the super product of 
x e A r and y e A s to be (— l) r5 xy, where xy is the given product. It is easily veri- 
fied that this product is associative, and extends to what is called the super 
product A ® A —> A associated with the bilinear maps. If R is a commutative 
ring such that A is a graded /?-algebra, i.e. RA r C A r for all r (in addition to the 
condition that A is a graded ring), then with the super product, A is also an 
/?-algebra, which will be denoted by A su , and will be called the super algebra 
associated with A. 
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Example. In the next section, we shall meet the tensor algebra T(E ), which 
will be graded as the direct sum of T r (E), and so we get the associated super 
tensor algebra T SU (E) according to the above recipe. 

Similarly, let A, B be graded algebras (graded by the natural numbers as 
above). We define their super tensor product 

A ® su B 

to be the ordinary tensor product as graded module, but with the super product 

(a <8> b)(a' ® b') = (-1)^^ a ') aa ' ® bb' 

if b, a' are homogeneous elements of B and A respectively. It is routinely verified 
that A @ su B is then a ring which is also a graded algebra. Except for the sign, 
the product is the same as the ordinary one, but it is necessary to verify associati vity 
explicitly. Suppose a' e A h b e Bj, a" e A s , and b' e B r . Then the reader will 
find at once that the sign which comes out by computing 

(a ® su b)(a' ® su b')(a" ® su b") 

in two ways turns out to be the same, namely (— l) í - /+ - / ‘ 9+ ‘ çr . Since bilinearity is 
trivially satisfied, it follows that A ® su B is indeed an algebra. 

The super product in many ways is more natural than what we called the 
ordinary product. For instance, it is the natural product of cohomology in topol- 
ogy. Cf. Greenberg-Harper, Algebraic Topology, Chapter 29. For a similar con- 
struction with Z/2Z-grading, see Chapter XIX, §4. 


§7. THE TENSOR ALGEBRA OF A MODULE 

Let R be a commutative ring as before, and let E be a module (i.e. an 
R- module). For each integer r ^ 0, we let 

r 

T r {E) = (X) E and T°(£) = R. 

i = 1 

Thus T r (E) = E (g) • • • (g) E (tensor product taken r times). Then T r is a functor, 
whose effect on linear maps is given as follows. If/: E -► F is a linear map, then 

T r (f) = T(f ,...,/) 


in the sense of §1. 

From the associativity of the tensor product, we obtain a bilinear map 
T r (E) x T S (E) - T r+S (E), 
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which is associative. Consequently, by means of this bilinear map, we can define 
a ring structure on the direct sum 


T(E) = 0 T'(E), 

r = 0 

and in fact an algebra structure (mapping R on T°(E) = R). We shall call T(E) 
the tensor algebra of E , over R . It is in general not commutative. If x, y e T(£), 
we shall again write x (x) y for the ring operation in T(E). 

Let /: E -► F be a linear map. Then / induces a linear map 

T r (f): T r (E) - T r (F) 

for each r ^ 0, and in this way induces a map which we shall denote by T(f) on 
T(E). (There can be no ambiguity with the map of §1, which should now be 
written T l (f\ and is in fact equal to / since T l (E) = E.) It is clear that T(f) is 
the unique linear map such that for x Í9 ..., x r e E we have 


T(/)(xi ® • • • ® x r ) = /(Xj) ® ■ • • ® /(x r ). 

Indeed, the elements of T'(E) = E are algebra-generators of T(E) over R. We 
see that T(f) is an algebra-homomorphism. Thus T may he viexved as a functor 
from the category of modules to the category of graded algebras , T(f) being a 
homomorphism of degree 0. 

When E is free and finite dimensional over R , we can determine the structure 
of T(E) completely, using Proposition 2.3. Let P be an algebra over k. We shall 
say that P is a non-commutative polynomial algebra if there exist elements 
t ì9 ..., t n 6 P such that the elements 

Aí( 0 (0 = íi, • 

with 1 g i v ^ n form a basis of P over £. We may call these elements non- 
commutative monomials in (í). As usual, by convention, when r = 0, the 
corresponding monomial is the unit element of P. We see that t ì9 ..., t n generate 
P as an algebra over fc, and that P is in fact a graded algebra, where P r consists of 
linear combinations of monomials t ix • • • t ir with coefficients in R. It is natural to 
say that t í9 ... 9 t n are independent non-commutative variables over R. 

Proposition 7.1. Let E befree of dimension n over R. Then T(E) is isomorphic 
to the non-commutative polynomial algebra on n variables over R. In other 
words , if {v ì9 ... ,v n } is a basis of E over P, then the elements 

M {i) (v) = Vi , ® • • • ® Vi v9 1 íkivíkn 

form a basis of T r (E\ and every element of T(E) has a unique expression as a 
finite sum 

Z a (0 A/<i)(tf), «(,) 6 E 

(*> 
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with almost all a (i) equal to 0. 

Proof. This follows at once from Proposition 2.3. 

The tensor product of linear maps will now be interpreted in the context of 
the tensor algebra. 

For convenience , we shaii denote the module of endomorphisms End R (E) by 
L{E ) for the rest of this section. 

We form the direct sum 


(LT)(E) = 0 L(T r (E)), 

r = 0 

whichweshallalsowrite LT(E) forsimplicity. (Ofcourse, LT(E )isnotequalto 
End r (T(E)) 9 so we must view LT as a single symbol.) We shall see that LT is a 
functor from modules to graded algebras, by defining a suitable multiplication 
on LT(E). Let / e L(T r (E)\ g e L(T S (E)\ h e L(T m (E)). We define the product 
fg e L(T r + s (E)) to be T(f,g ), in the notation of §1, in other words to be the 
unique linear map whose effect on an element x (x) y with x e T r (E) and 
y e T S (E) is 


x (g y (_► f(x) ® g(y). 

In view of the associativity of the tensor product, we obtain at once the as- 
sociativity (fg)h = f(gh) 9 and we also see that our product is bilinear. Hence 
LT(E) is a /c-algebra. 

We have an algebra-homomorphism 

T(L(E)) - LT(E) 

given in each dimension r by the linear map 

/l ® ® fr T(f u ...,/)= /l ' ’ ' /• 

We specify here that the tensor product on the left is taken in 

L(E) (g) • • • (g) L(E). 

We also note that the homomorphism is in general neither surjecti ve nor injective. 
When E is free finite dimensional over R , the homomorphism turns out to be 
both, and thus we have a clear picture of LT(E) as a non-commutative poly- 
nomial algebra, generated by L(E). Namely, from Proposition 2.5, we obtain: 

Proposition 7.2. Let E be free,fìnite dimensional over R. Then we have an 
algebra-isomorphism 

oo 

T(L(E)) = T(End R (£)) - LT(E) = 0End R (T r (E)) 

r= 0 
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given by 


Proof. By Proposition 2.5, we have a linear isomorphism in each dimen- 
sion, and it is clear that the map preserves multiplication. 

In particular, we see that LT(E) ìs a noncommutative polynomial algebra. 


§8. SYMMETRIC PRODUCTS 

Let S„ denote the symmetric group on n letters, say operating on the integers 
(1,..., n). An r-multilinear map 


/: E (r) -► F 

is said to be symmetric if f(x u ..., x r ) = /(x^,..., x^) for all o e S r . 
In T r (E ), we let b r be the submodule generated by all elements of type 

Xj ® ® x r - x ail) ® ® x ff(r) 

for all x f e E and o e S r . We define the factor module 

S r (E) = T r (E)/b r , 


and let 


S(E) = 0 S r (E) 

r = 0 

be the direct sum. It is immediately obvious that the direct sum 

b= ©b r 

r = 0 

is an ideal in T(E\ and hence that S(E) is a graded R- algebra, which is called the 
symmetric algebraof E. 

Furthermore, the canonical map 

E ir) - S r (E) 

obtained by composing the maps 

E (r) T r (E)/b r = S r (E) 

is univérsal for r-multilinear symmetric maps. 
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We observe that S is a functor, from the category of modules to the category 
of graded R-algebras. The image of (x l5 ..., x r ) under the canonical map 

£ (r) - S r (E) 


will be denoted simply by x^ • x r . 

Proposition 8.1. Let E be free of dimension n over R. Let {v u ..., v n } be a 
basis of E over k. Viewed as elements ofS\E) in S(E\ these basis elements are 
algebraically independent over R , and S(E) is therefore isomorphic to the 
polynomial algebra in n variables over R. 

Proof. Let t ì9 ...,t n be algebraically independent variables over R , and 
form the polynomial algebra R[t ì9 ..., îj. Let P r be the R-module of homo- 
geneous polynomials of degree r. We define a map of E (r) -► P r as follows. If 
Wj,..., w r are elements of E which can be written 

n 

Wi = £ a iv v v , i= 1, r, 

V = 1 

then our map is given by 

( w i>..., w r )i—► (fljjíj + • • • + a ìn t n ) • • • (n^Í! + • • • + a rn t n ). 

It is obvious that this map is multilinear and symmetric. Hence it factors 
through a linear map of S r (E) into P r : 


P r 

From the commutativity of our diagram, it is clear that the element v ix • • • v is in 
S r (E) maps on t ix • • • t is in P r for each r-tuple of integers (i) — (i u ..., i r ). Since 
the monomials M (i) (t) of degree r are linearly independent over k , it follows that 
the monomials M {i) (v) in S r (E) are also linearly independent over R , and that 
our map S r (E) -► P r is an isomorphism. One verifies at once that the multiplica- 
tion in S(E) corresponds to the multiplication of polynomials in R[t] 9 and thus 
that the map of S(E) into the polynomial algebra described as above for each 
component S r (E) induces an algebra-isomorphism of S(E) onto K[r], as desired. 

Proposition 8.2. Let E = E' © E" be a direct sum of finite free modules. 

Then there is a natural isomorphism 

S n (E' © E") * 0 S p E' ® S q E". 

p + q-n 

In fact , this is but the n-part of a graded isomorphism 
S(E' © E") x SE' ® SE". 
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Proof. The isomorphism comes from the following maps. The inclusions 
of E' and E" into their direct sum give rise to the functorial maps 

SE' ® SE" -> SE , 

and the claim is that this is a graded isomorphism. Note that SE' and SE" are 
commutative rings, and so their tensor product is just the tensor product of 
commutative rings discussed in §6. The reader can either give a functorial map 
backward to prove the desired isomorphism, or more concretely, SE' is the 
polynomial ring on a finite family of variables, SE" is the polynomial ring in 
another family of variables, and their tensor product is just the polynomial ring 
in the two families of variables. The matter is easy no matter what, and the 
formal proof is left to the reader. 


EXERCISES 

1. Let k be a field and k( a) a fimte extension. Let /(X) = Irr(a, k, X), and suppose that/is 
separable. Let k' be any extension of k. Shovv that /c(a) 0 k' is a direct sum of fields. 
If k’ is algebraically closed, show that these fields correspond to the embeddings of 
/c(a) ìn k'. 

2. Let k be a field, f(X) an ìrreducible polynomial over /c, and a a root of /. Show that 
/c(a) 0 k' is isomorphic, as a /c'-algebra, to k'[X]/(f(X)). 

3. Let E be a finite extension of a field k. Show that E is separable over k if and only if 
E 0* L has no nilpotent elements for all extensions L of /c, and also when L = /c a . 

4. Let <p : A —» B be a commutative ring homomorphism. Let E be an A-module and F 
a fi-module. Let F A be the 4-module obtained from F via the operation of A on F 
through <p, that is for y E F A and a E A this operation is given by 

(a, y) h-> (p(a)y. 

Show that there ìs a natural isomorphism 

Hom 5 (fí 0 A L, F) % Hom^E, F A ). 

5. The norm. Let B be a commutative algebra over the commutative ring R and assume 
that B is free of rank r. Let A be any commutative R- algebra. Then A 0 B is both 
an A-algebra and a L-algebra. We view A 0 B as an A-algebra, which is also free 
of rank r. If [e x ,. . . , e r j is a basis of B over R, then 

\ A 0 e í9 .... \ A 0 e r 

is a basis of A 0 B over A. We may then define the norm 

N = N a< ^ ba :A(S)B—> A 


as the unique map which coincides with the determinant of the regular representation. 
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In other words, if be B and b B denotes multiplication by b , then 


N BtR (b) = det (b B ); 


and similarly after extension of the base. Prove: 

(a) Let q>: A -* C be a homomorphism of R- algebras. Then the following diagram 
is commutative: 


A ®B — » C (g) 5 


1 


r 

-► C 

<p 


(b) Let x, ye/10fî. Then N(x ® fl y) = N(x) ® N(y). [Hint: 
mutativity relations e^j = and the associativity.] 


Use the com- 


A little flatness 

6. Let M, N be flat. Show that M 0 N is flat. 

7. Let F be a flat R- module, and let a e R be an element which is not a zero-divisor. Show 
that if ax = 0 for some xeF then x = 0. 

8. Prove Proposition 3.2. 


Faithfully flat 

9. We continue to assume that rings are commutative. Let M be an /í-module. We say 
that M is faithfully flat if M is flat, and if the functor 


T m : E M ® A E. 


is faithful, that is E ^ 0 implies M ® A E ^ 0. Prove that the following conditions are 
equivalent. 

(i) M is faithfully flat. 

(ii) M is flat, and if u: F -► E is a homomorphism of /t-modules, u ^ 0, then 
T m (u) :M® a F^>M® a E is also ^O. 

(iii) A/ is flat, and for all maximal ideals m of A , we have mM ^ M. 

(iv) A sequence of v4-modules N' -► A -► A" is exact if and only if the sequence 
tensored with M is exact. 

10. (a) Let A -► B be a ring-homomorphism. If M is faithfully flat over A, then B ® A M 

is faithfully flat over B. 

(b) Let M be faithfully flat over B. Then M viewed as /4-module via the homomorphism 
A -► B is faithfully flat over A if B is faithfully flat over A. 

11. Let P , M, £ be modules over the commutative ring A. If P is finitely generated (resp. 
finitely presented) and £ is flat, show that the natural homomorphism 

Hom A (P, M) ® A E -► Hom^P, M 0^ £) 

is a monomorphism (resp. an isomorphism). 
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[ Hint : Let F!->F 0 ->F->Obea fìnite presentation, say. Consider the diagram 
0-► Hom A (P, M) 0^ F-> Hom^Fo, M) ® A E -> Hom^Fi, M) ® A E 


0-> Hom^F, A/ 0^ E) -> Hom^Fo, M 0^ £)-► Hom^Fj, M^^F)]. 

Tensor products and direct limits 

12. Show that the tensor product commutes with direct limits. In other words, if {£,} is a 
directed family of modules, and M is any module, then there is a natural isomorphism 

lim(F t 0^ M) « (lim F t ) ® Á M. 

13. (D. Lazard) Let E be a module over a commutative ring A. Tensor products are all 
taken over that ring. Show that the following conditions are equivalent: 

(i) There exists a direct family {F,} of free modules of finite type such that 

F ss lim Fj. 

(ii) E is flat. 

(iii) For every fìnitely presented module P the natural homomorphism 

Hom^F, A) ® A E -♦ Hom^(F, E) 

is surjective. 

(iv) For every finitely presented module F and homomorphism /: F -► E there 
exists a free module F, finitely generated, and homomorphisms 

g :P -+ F and h :F -► F 

such that / = h ° g. 

Remark. The point of Lazard’s theorem lies in the fìrst two conditions: E is flat 
if and only if E is a direct limit of free modules of finite type. 

[Hint: Since the tensor product commutes with direct limits, that (i) implies (ii) 
comes from the preceding exercise and the definition of flat. 

To show that (ii) implies (iii), use Exercise 11. 

To show that (iii) implies (iv) is easy from the hypothesis. 

To show that (iv) implies (i), use the fact that a module is a direct limit of fìnitely 
presented modules (an exercise in Chapter III), and (iv) to get the free modules 
instead. For complete details, see for instance Bourbaki, Algèbre , Chapter X, §1, 
Theorem 1, p. 14.] 


The Casimir element 

14. Let k be a commutative field and let F be a vector space over F, of finite dimension 
n. Let B be a nondegenerate symmetric bilinear form on F, inducing an iso- 
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morphism E —> E y of E with its dual space. Let {i?i,..., v n } be a basis of E. The B- 
dual basis {v [,..., v' n } consists of the elements of E such that B(vì, vj) = ôy. 

(a) Show that the element Y v i ®v[\tíE®E is independent of the choice of 
basis. We call this element the Casimir element (see below). 

(b) In the symmetric algebra S(E), let Q B = Y v i v í- Show that Q B is indepen- 
dent of the choice of basis. We call Q B the Casimir polynomial. It depends on 
B , of course. 

(c) More generally, let D be an (associative) algebra over k , let E —► D be an 

injective linear map of E into D. Show that the element Y @( v ì)@( v ì) = 
(o Bì g, is independent of the choice of basis. We call it the Casimir element in 
D, determined by Q) and B. 

Remark. The terminology of the Casimir element is determined by the classical 
case, when G is a Lie group, E = g = Lie(G) is the Lie algebra of G (tangent space at the 
origin with the Lie algebra product determined by the Lie derivative), and 3)(v) is the 
diíferential operator associated with v (Lie derivative in the direction of u). The Casimir 
element is then a partial diíferential operator in the algebra of all differential operators 
on G. Cf. basic books on manifolds and Lie theory, for instance [JoL 01], Chapter II, §1 
and Chapter VII, §2. 

15. Let E = sl n (k) = subspace of Mat„(/:) consisting of matrices with trace 0. Let B be 
the bilinear form defìned by B(X , Y) = tr(XY). Let G = SL n (k). Prove: 

(a) B is c(G)-invariant, where c (g) is conjugation by an element g e G. 

(b) B is invariant under the transpose (X, Y) »-* (^X/Y). 

(c) Let k = R. Then B is positive definite on the symmetric matrices and nega- 
tive defìnite on the skew-symmetric matrices. 

(d) Suppose G is given with an action on the algebra D of Exercise 14, and that 
the linear map E —► D is G-linear. Show that the Casimir element is G- 
invariant (for the conjugation action on S(E), and the given action on D). 



CHAPTER XVII 

Semisimplicity 


In many applications, a module decomposes as a direct sum of simple sub- 
modules, and then one can develop a fairly precise structure theory, both under 
general assumptions, and particular applications. This chapter is devoted to 
those results which can be proved in general. In the next chapter, we consider 
those additional results which can be proved in a classical and important special 
case. 

I have more or less followed Bourbalci in the proof of Jacobson’s density 
theorem. 


§1. MATRICES AND LINEAR MAPS OVER 
NON-COMMUTATIVE RINGS 

In Chapter XIII, we considered exclusively matrices over commutative 
rings. For our present purposes, it is necessary to consider a more general 
situation. 

Let K be a ring. We define a matrix (ç> 0 ) with coefficients in K just as we 
did for commutative rings. The product of matrices is defined by the same 
formula. Then we again have associativity and distributivity, whenever the 
size of the matrices involved in the operations makes the operations defined. 
In particular, the square n x n matrices over K form a ring, again denoted by 
Mat„(K). We have a ring-homomorphism 

K - Mat„(X) 


on the diagonal. 


641 
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By a division ring we shall mean a ring with 1 # 0, and such that every 
non-zero element has a multiplicative inverse. 

If K is a division ring, then every non-zero K-module has a basis, and the 
cardinalities oftwo bases are equal. The proofis the same as in the commutative 
case; we never needed commutativity in the arguments. This cardinality is 
again called the dimension of the module over K , and a module over a division 
ring is called a vector space. 

We can associate a matrix with linear maps, depending on the choice of a 
finite basis, just as in the commutative case. However, we shall consider a 
somewhat different situation which we want to apply to semisimple modules. 

Let R be a ring, and let 

E = £i ® • • • ® E n , F = Fj © • • • ® F m 

be K-modules, expressed as direct sums of /?-submodules. We wish to describe 
the most general K-homomorphism of E into F. 

Suppose first F = Fj has one component. Let 

(p : E x ©■■•©£„-► F 

be a homomorphism. Let (pj : Ej -► F be the restriction of (p to the factor E- r 
Every element x e E has a unique expression x = x 1 + -* + x„, with Xj e E r 
We may therefore associate with x the column vector X = l (x l9 . .., x n ), whose 
components are in E u ..., E n respectively. We can associate with (p the row 
vector ((p í9 ..., <p„), (pj e Hom R (E j9 F ), and the effect of (p on the element x of 
E is described by matrix multiplication, of the row vector times the column 
vector. 

More generally, consider a homomorphism 


(p '-Ei ® • * • © E n -► F t ® ■ • • ® F m . 

Let n f : Fj © • • • © F m -► F { be the projection on the z-th factor. Then we can 
apply our previous remarks to 7 t £ o <p, for each i. In this way, we see that there 
exist unique elements (p^ e Hom^Ej, F*), such that (p has a matrix representa- 
tion 


M(«p) = 




whose effect on an element x is given by matrix multiplication, namely 
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Conversely, given a matrix (< p u ) with (p^ e Hom R (E j9 F t ), we can define an 
element of Hom^F, F) by means of this matrix. We have an additive group- 
isomorphism between Hom^F, F ) and this group of matrices. 

In particular , let Ebea fixed R-module, and let K = End R (E). Then we have 
a ring-isomorphism 


End*(E<">) - Mat n (K) 
which to each cp e End R (E {n) ) associates the matrix 



determined as before , and operating on the left on column vectors of E {n \ with 
components in E. 

Remark. Let E be a 1-dimensional vector space over a division ring D, 
and let {v} be a basis. For each ae D, there exists a unique D-linear map 
f a :E -+ E such that f a (v) = av. Then we have the rule 

fa fb = fba • 

Thus when we associate a matrix with a linear map, depending on a basis, the 
multiplication gets twisted. Nevertheless, the statement we just made preceding 
this remark is correct!! The point is that we took the (p i} in End K (£), and not 
in D, in the special case that R = D. Thus K is not isomorphic to D (in the 
non-commutative case), but anti-isomorphic. This is the only point ofdifference 
of the formal elementary theory of linear maps in the commutative or non- 
commutative case. 

We recall that an R-module E is said to be simple if it is # 0 and if it has no 
submodule other than 0 or E. 

Proposition 1.1. Schur’s Lemma. Let E, F be simple R-modules. Every 

non-zero homomorphism of E into F is an isomorphism. The ring End R (E) is 

a division ring. 

Proof. Let/: E -> F be a non-zero homomorphism. Its image and kernel 
are submodules, hence Ker/ = 0 and Im / = F. Hence /is an isomorphism. 
If E = F, then / has an inverse, as desired. 

The next proposition describes completely the ring of endomorphisms of a 
direct sum of simple modules. 


Proposition 1.2. Let E = E (ni) © • • • © E (nr) be a direct sum of simple 
modules, the being non-isomorphic, and each E^ being repeated n^ times in 
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the sum. Then , up to a permutation , E l9 ..., E r are uniquely determined up 
to isomorphisms , and the multiplicities n ì ,...,n r are uniquely determined. 
The ring End R (£) is isomorphic to a ring of matrices , of type 



where M t is an n t - x matrix over End K (E t ). (The isomorphism is the one 
with respect to our direct sum decomposition.) 

Proof. The last statement follows from our previous considerations, taking 
into account Proposition 1.1. 

Suppose now that we have two R- modules, with direct sum decompositions 
into simple submodules, and an isomorphism 

E\ ni) e • • • © E ( r nr) - F ( r ] © • • • © F ( r ] , 

such that the £ t are non-isomorphic, and the Fj are non-isomorphic. From 
Proposition 1.1, we conclude that each £ t is isomorphic to some F j9 and con- 
versely. It follows that r = s, and that after a permutation, E t % F t -. Further- 
more, the isomorphism must induce an isomorphism 

£(«.) f(mi) 

for each i. Since £ t « F t , we may assume without loss of generality that in 
fact F t = F t . Thus we are reduced to proving: If a module is isomorphic to 
E (n) and to F (m) , with some simple module F, then n = m. But End K (F (n) ) is 
isomorphic to the n x n matrix ring over the division ring End K (F) = K. 
Furthermore this isomorphism is verified at once to be an isomorphism as 
K-vector space. The dimension of the space of n x n matrices over K is n 2 . 
This proves that the multiplicity n is uniquely determined, and proves our 
proposition. 

When F admits a (finite) direct sum decomposition of simple submodules, 
the number of times that a simple module of a given isomorphism class occurs 
in a decomposition will be called the multiplicity of the simple module (or of 
the isomorphism class of the simple module). 

Furthermore, if 

F = E (ni) © • • • © F ( ” r) 

is expressed as a sum of simple submodules, we shall call n x + • • • + n r the 
length of F. In many applications, we shall also write 

r 

E = n^Fj © • • • © n r E r = (J) n t F t -. 

i — 1 
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§2. CONDITIONS DEFINING SEMISIMPLICITY 

Let R be a ring. Unless othermse specified in this section all modules and 
homomorphisms mll be R-modules and R-homomorphisms. 

The following conditions on a module E are equivalent: 

SS 1. E is the sum of a family of simple submodules. 

SS 2. E is the direct sum of a family of simple submodules. 

SS3. Every submodule F of E is a direct summand, i.e. there exists a 
submodule F' such that E = F © F'. 

We shall now prove that these three conditions are equivalent. 


Lemma 2.1. Let E — ^be a sum (not necessarily direct ) of simple sub- 

ze/ 

modules. Then there exists a subset Jc / such that E is the direct sum 

®Ej. 

j^J 

Proof. Let J be a maximal subset of / such that the sum £ Ej is direct. 

je J 

We contend that this sum is in fact equal to E. It will suffice to prove that each 
E t is contained in this sum. But the intersection of our sum with E t is a sub- 
module of E i9 hence equal to 0 or E^. If it is equal to 0, then J is not maximal, 
since we can adjoin i to it. Hence E { is contained in the sum, and our lemma is 
proved. 

The lemma shows that SS 1 implies SS 2. To see that SS 2 implies SS 3, take 
a submodule F, and let J be a maximal subset of / such that the sum F + © £,• 

je J 

is direct. The same reasoning as before shows that this sum is equal to E. 

Finally assume SS3. To show SS 1, we shall first prove that every non-zero 
submodule of E contains a simple submodule. Let v e £, v ± 0. Then by 
definition, Rv is a principal submodule, and the kernel of the homomorphism 

R^Rv 

is a left ideal L # R. Hence L is contained in a maximal left ideal M # R 
(by Zorn’s lemma). Then M/L Is a maximal submodule of R/L (unequal to 
R/L\ and hence Mv is a maximal submodule of Rv , unequal to Rv , correspond- 
ing to M/L under the isomorphism 


R/L - Rv. 
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We can write E = Mv © M' with some submodule M'. Then 

Rv = Mv © (M' n Rv), 

because every element x e Rv can be written uniquely as a sum x = av + x' 
with a e M and x' e Af', and jc' = x - av lies in Rv. Since Mv is maximal in 
Rv , it follows that Aí' fl /?z; is simple, as desired. 

Let E 0 be the submodule of E which is the sum of all simple submodules of 
E. If E 0 # E , then E = E 0 © F with F ^ 0, and there exists a simple sub- 
module of F, contradicting the definition of E 0 . This proves that SS 3 implies 
SSl. 

A module E satisfying our three conditions is said to be semisimple. 

Proposition 2.2. Every submodule and every factor module of a semisimple 

module is semisimple. 

Proof. Let F be a submodule. Let F 0 be the sum of all simple submodules 
of F. Write E = F 0 © F' 0 . Every element x of F has a unique expression 
x = x 0 4- x' 0 with x 0 g F 0 and x' 0 e F 0 . But x' 0 = x — x 0 e F. Hence F is 
the direct sum 


F = F 0 © (F n F' 0 ). 

We must therefore have F 0 = F, which is semisimple. As for the factor module, 
write E = F © F'. Then F' is a sum of its simple submodules, and the canonical 
map E -► E/F induces an isomorphism of F' onto E/F. Hence E/F is semisimple. 


§3. THE DENSITY THEOREM 

Let E be a semisimple F-module. Let R' = R\E) be the ring End^F). Then 
E is also a /?'-module, the operation of R' on E being given by 

(<P, x) i ► cp(x) 

for cp e R' and x e E. Each a e R induces a /?'-homomorphism f a : E —> E by 
the map f a (x) = ax. This is what is meant by the condition 

cp(ax) = a (p(x). 

We let R " = R'\E) = End^'ÍF). We call R' the commutant of R and R” the 
bicommutant. Thus we get a ring-homomorphism 

R End r \E) = R"(E) = R" 
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by a f a . We now ask how big is the image of this ring-homomorphism. 
The density theorem states that it is quite big. 

Lemma 3.1. Let E be semisimple over R. Let R' = End R (E) 9 f e End^^E) 
as above. Let x e R. There exists an element a e R such that ax = f(x). 

Proof. Since E is Semisimple, we can write an R- direct sum 

E = Rx © F 

with some submodule F. Let tt: E —* Rx be the projection. Then it e R\ and 
hence 


f(x) = f(nx) = nf(x). 

This shows that /(x) e Rx , as desired. 

The density theorem generalizes the lemma by dealing with a finite number 
of elements of E instead of just one. For the proof, we use a diagonal trick. 

Theorem 3.2. (Jacobson). Let E be semisimple over R, and let 
R' = Endfl(£). Let f e End^F). Let x ìf ..., x n e E. Then there exists an 
element a e R such that 

ax ( =f(x t ) for i=1,...,/i. 

IfE isfinitely generated over R', then the natural map R —» End^E) is surjective. 

Proof. For clarity of notation, we shall first carry out the proof in case E 
is simple. Let f {n) : E {n) -► E {n) be the product map, so that 

f (n) (yu---,yn) = (f(yi),-.-,f(y»))- 

Let R' n = End R (E {n) ). Then R' n is none other than the ring of matrices with 
coefficients in R'. Since/commutes with elements of R' in its action on F, one 
sees immediately that/ (n) is in End^F^). By the lemma, there exists an element 
a e R such that 


(axi,..., ocx„) = (/(xO,..., f(x n )) 9 
which is what we wanted to prove. 

When E is not simple, suppose that E is equal to a finite direct sum of simple 
submodules E t (non-isomorphic), with multiplicities n z : 

E = E {ní) © • • • ® E { f r) (Eì^Ej if i / j), 

then the matrices representing the ring of endomorphisms split according to 
blocks corresponding to the non-isomorphic simple components in our direct 
sum decomposition. Hence here again the argument goes through as before. 
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The main point is that/ (n) lies in End^(£ (n) ), and that we can apply the lemma. 

We add the observation that if E is finitely generated over R', then an element 
/e Endfl'(Zs) is determined by its value on a finite number of elements of E , so 
the asserted surjectivity R —> End R '(E) follows at once. In the applications 
below, E will be a finite dimensional vector space over a field k , and R will be 
a Â:-algebra, so the finiteness condition is automatically satisfied. 

The argument when E is an infinite direct sum would be similar, but the 
notation is disagreeable. However, in the applications we shall never need the 
theorem in any case other than the case when E itself is a finite direct sum of 
simple modules, and this is the reason why we first gave the proof in that case, 
and let the reader write out the formal details in the other cases, if desired. 

CoroIIary 3.3. (Burnside’s Theorem). Let E be a finite-dimensional 
vector space over an algebraically closed field k , and let Rbe a subalgebra of 
End*(£). // E is a simple R-module, then R = End^^E). 

Proofi We contend that End R (E) = k. At any rate, End R (E) is a division 
ring R\ containing k as a subring and every element of k commutes with every 
element of R' . Let a e R'. Then k(a) is a field. Furthermore, R' is contained in 
End^is) as a A:-subspace, and is therefore finite dimensional over k. Hence k{ a) 
is finite over k , and therefore equal to k since k is algebraically closed. This 
proves that End R (E) = k. Let now {v Ì9 ..., v n } be a basis of E over k. Let 
A e End*(Zs). According to the density theorem, there exists a e R such that 

avi = Av { for i = I,..., n. 

Since the effect of A is determined by its effect on a basis, we conclude that 
R = End k (E). 

Corollary 3.3 is used in the following situation as in Exercise 8. Let E 
be a finite-dimensional vector space over field k. Let G be a submonoid of 
GL(E) (multiplicative). A G-invariant subspace F of E is a subspace such that 
o-F C F for all a e G. We say that E is G-simple if it has no G-invariant 
subspace other than 0 and E itself, and E =k 0. Let R = k[G] be the subalgebra 
of End*(Zs) generated by G over k. Since we assumed that G is a monoid, it 
follows that R consists of linear combinations 

with a t e k and o x e G. Then we see that a subspace F of E is G-invariant if and 
only if it is R-invariant. Thus E is G-simple if and only if it is simple over R in 
the sense which we have been considering. We can then restate Burnside’s 
theorem as he stated it: 

Coro0ary3.4. Let E be a finite dimensional vector space over an alge- 
braically closed field k, and let G be a ( multiplicative ) submonoid of GL(E ). 
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// E is G-simple , then /c[G] = End k (E). 

When k is not algebraically closed, then we still get some result. Quite 
generally, let R be a ring and E a simple R-module. We have seen that Endj^E) 
is a division ring, which we denote by D, and E is a vector space over D. 

Let R be a ring, and E any R-module. We shall say that E is a faithful 
module if the following condition is satisfied. Given oce R such that ax = 0 
for all xe£, we have a = 0. In the applications, E is a vector space over a field 
/c, and we have a ring-homomorphism of R into End*(£). In this way, E is an 
/?-module, and it is faithful if and only if this homomorphism is injective. 

CoroUary 3.5. (Wedderburn’s Theorem). Let R be a ring, and E a simple, 
faithful module over R. Let D = End R (E), and assume that E is finite dimen - 
sional over D. Then R = End D (£). 

Proof. Let {v ì9 ..., v n } be a basis of E over D. Given A e End D (£), by 
Theorem 3.2 there exists a e R such that 

ctVi = Av x for i=l,..., n. 

Hence the map R -► End D (£) is surjective. Our assumption that E is faithful 
over R implies that it is injective, and our corollary is proved. 

Example. Let R be a fìnite-dimensional algebra over a field /c, and assume 
that R has a unit element, so is a ring. If R does not have any two-sided ideals 
other than 0 and R itself, then any nonzero module E over R is faithful, because 
the kernel of the homomorphism 

R - End fc (£) 

is a two-sided ideal ^ R. If E is simple, then E is finite dimensional over k. 
Then D is a finite-dimensional division algebra over k. Wedderburn’s theorem 
gives a representation of R as the ring of D-endomorphisms of E. 

Under the assumption that R is finite dimensional, one can find a simple 
module simply by taking a minimal left ideal =k 0. Such an ideal exists merely 
by taking a left ideal of minimal non-zero dimension over k. An even shorter 
proof of Wedderburn’s theorem will be given below (Rieffel’s theorem) in this 
case. 

CoroIIary 3.6. Let Rbea ring, finite dimensional algebra over a field k which 
is algebraically closed. Let V be a finite dimensional vector space over k, with 
a simple faithful representation p: R—> End fc (V). Then p is an isomorphism, 
in other words, R ~ Mat n (k). 

Proof. We apply Corollary 3.5, noting that D is finite dimensional over 
k. Given a e D, we note that k(a) is a commutative subfield of D, whence 
k(á) = k by assumption that k is algebraically closed, and the corollary follows. 
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Note. The corollary applies to simple rings, which will be defined below. 

Suppose next that Vi,..., V m are finite dimensional vector spaces over a field 
k 9 and that R is a £-algebra with representations 

R End*(V9, i = 1,. . . , m, 

so V- is an /?-module. If we let 

E = V, © • • • © V m9 

then E is finite over R'(E), so we get the following consequence of Jacobson’s 
density theorem. 

Theorem 3.7. Existence of projection operators. Let k be a field, R a 
k-algebra, and Vj, ..., V m finite dimensional k-spaces which are also simple 
R-modules , cmd such that V/ w not R-isomorphic îo Vj for i 4 1 j. Then there 
exist elements e^ E /? that e t acts as the identity on V t and e t Vj = 0 

ifj * i- 

Proof. We observe that the projection f from the direct sum E to the i-th 

factor is in End^ íE’), because if ç e R' then (p(Vj) C Vj for all j. We may therefore 

apply the density theorem to conclude the proof. 

CoroIIary 3.8. (Bourbaki). Let k be a field of characteristic 0. Let R be 
a k-algebra, and let E, F be semisimple R-modules, finite dimensional over k. 
For each a e R, let a E , a F be the corresponding k-endomorphisms on E and 
F respectively. Suppose that the traces are equal; that is, 

tr(a E ) = tr(a F ) for all a e R. 

Then E is isomorphic to F as R-module. 

Proof. Each of E and F is isomorphic to a finite direct sum of simple R- 
modules, with certain multiplicities. Let V be a simple /?-module, and suppose 

E = © direct summands not isomorphic to V 

F = \ft m) © direct summands not isomorphic to V. 

It will suffice to prove that m = n. Let e v be the element of R found in Theorem 
3.7 such that e v acts as the identity on V, and is 0 on the other direct summands 
of E and F . Then 

\x(e E ) = ndim k (V) and tr(e E ) = mdim k (V). 

Since the traces are equal by assumption, it follows that m = n, thus concluding 
the proof. Note that the characteristic 0 is used here, because the values of the 
trace are in k. 

Example. In the language of representations, suppose G is a monoid, and 
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we have two semisimple representations into finite dimensional Â:-spaces 

p : G —> End^L) and p': G —> End*(F) 

(so p and p' map G into the multiplicative monoid of End*). Assume that 
tr p(cr) = tr p'(cr) for all cr e G. Then p and p' are isomorphic. Indeed, we let 
R = k[G ], so that p and p' extend to representations of R. By linearity, one has 
that tr p(a) = tr p'(a) for all a e F, so one can apply Corollary 3.8. 


§4. SEMISIMPLE RINGS 

A ring R is called semisimple if 1 /0, and if R is semisimple as a left module 
over itself. 

Proposition 4.1. IfR is semisimple , then every R-module is semisimple. 

Proof. An R- module is a factor module of a free module, and a free module 
is a direct sum of R with itself a certain number of times. We can apply Proposi- 
tion 2.2 to conclude the proof. 

Examples. 1) Let k be a field and let R = Mat n (k) be the algebra of 
n x n matrices over k. Then R is semisimple, and actually simple, as we shall 
define and prove in §5, Theorem 5.5. 

2) Let G be a finite group and suppose that the characteristic of k does not 
divide #(G). Thenthegroup ring k[G] is semisimple, as we shallprove in Chapter 
XVIII, Theorem 1.2. 

3) The Clifford algebras C n over the real numbers are semisimple. See Exer- 
cise 19 of Chapter XIX. 

A left ideal of R is an F-module, and is thus called simple if it is simple as a 
module. Two ideals L, L' are called isomorphic if they are isomorphic as 
modules. 

We shall now decompose R as a sum of its simple left ideals, and thereby 
get a structure theorem for R. 

Let {L,} ie/ be a family of simple left ideals, no two of which are isomorphic, 
and such that each simple left ideal is isomorphic to one of them. We say that 
this family is a family of representatives for the isomorphism classes of simple 
left ideals. 

Lemma 4.2. Let Lbe a simple left ideal , and let E be a simple R-module. 

If L is not isomorphic to E , then LE = 0. 

Proof. We have RLE = LE , and LE is a submodule of L, hence equal to 
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0 or E. Suppose LE = E. Let y e E be such that 

Ly # 0. 

Since Ly is a submodule of E , it follows that Ly = E. The map a i—► ocy of L 
into E is a homomorphism of L into £, which is surjective, and hence nonzero. 
Since L is simple, this homomorphism is an isomorphism. 

Let 


«,= 

L * L, 

be the sum of all simple left ideals isomorphic to L f . From the lemma, we con- 
clude that = 0 if i # This will be used constantly in what follows. We 
note that R x is a left ideal, and that R is the sum 

R = I 

i e / 

because /? is a sum of simple left ideals. Hence for any j e /, 

/?j c= Rj/? = /^j/^j c: 

the first inclusion because R contains a unit element, and the last because Rj 
is a left ideal. We conclude that Rj is also a right ideal, i.e. Rj is a two-sided 
ideal for all j e /. 

We can express the unit element 1 of /? as a sum 

i = Z 

iel 

with e^e R x . This sum is actually finite, almost all e, = 0. Say e { # 0 for 
indices i = 1 ,..., 5, so that we write 

I = e, + ... + e s . 


For any x e /?, write 


* = Z x i> XieRi. 

ie / 

For j = 1,..., 5 we have e^x = e } Xj and also 

Xj= 1 • = ejX, + • • • + e s Xj = e^Xj. 

Furthermore, x = e t x + ■ • • + e s x. This proves that there is no index i 
other than i = 1,..., s and also that the i-th component x, of x is uniquely 
determined as e v x = e,x,. Hence the sum R = R { + • • • + R s is direct, and 
furthermore, e x is a unit element for R x , which is therefore a ring. Since 
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RiRj = 0 for i # /, we fìnd that in fact 

r = n *i 

i = 1 

is a direct product of the rings R t . 

A ring R is said to be simple if it is semisimple, and if it has only one 
isomorphism class of simple left ideals. We see that we have proved a structure 
theorem for semisimple rings: 

Theorem 4.3. Let R be semisimple. Then there is only a finite number of 
non-isomorphic simple left ideals , say L l9 ...,L s . If 

Ri= 

L-Li 

is the sum of all simple left ideals isomorphic to L i9 then R t is a txvo-sided ideal , 
which is also a ring (the operations being those induced by R ), and R is ring 
isomorphic to the direct product 


R=Û Rf 

i = 1 

Each Ri is a simple ring. If e t is its unit element , then 1 = e x + • • • + e s9 and 
R t = Re^. We have e^j = 0 if i ^ j. 


We shall now discuss modules. 

Theorem 4.4. Let R be semisimple, and let E be an R-module / 0. Then 

E = 0 RiE = 0 e,E t 

i — 1 i= 1 

and RiE is the submodule of E consisting of the sum of all simple submodules 
isomorphic to L, . 

Proof. Let L, be the sum of all simple submodules of E isomorphic to L, . 
If V is a simple submodule of L, then RV = V, and hence L, V = V for some i. 
By a previous lemma, we have L f % V. Hence E is the direct sum oïE í9 ... 9 E s . 
It is then clear that R^E = L f . 

Corollary 4.5. Let R be semisimple. Every simple module is isomorphic to 
one of the simple left ideals L f . 

Corollary 4.6. A simple ring has exactly one simple module , up to iso- 
morphism. 
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Both these corollaries are immediate consequences of Theorems 4.3 and 4.4. 

Proposition 4.7. Let k be a field and E a finite dimensional vector space 
over k. Let Sbea subset of End k (E). Let R be the k-algebra generated by the 
elements of S. Then R is semisimple if and only if E is a semisimple R (or S) 
module. 

Proof. If R is semisimple, then E is semisimple by Proposition 4.1. Con- 
versely, assume E semisimple as S-module. Then E is semisimple as R-module, 
and so is a direct sum 


£ = ©£,• 

i= 1 

where each E t is simple. Then for each i there exists an element v t e £, such 
that Ei = Rv t . The map 


xt-+(xv u ..., xv n ) 

is a K-homomorphism of R into E , and is an injection since R is contained in 
End*(£). Since a submodule of a semisimple module is semisimple by Proposi- 
tion 2.2, the desired result follows. 


§5. SIMPLE RINGS 

Lemma 5.1. Let Rbe a ring , and \j/ e End r (R) a homomorphism of R into 
itselfi viexved as R-module. Then there exists a e R such that \j/(x) = xa/or 
all x e R. 

Proof. We have ij/(x) = \j/(x • 1) = x\//(ì). Let a = ^(1). 

Theorem 5.2. Let Rbea simple ring. Then R is a finite direct sum of simple 
left ideals. There are no txvo-sided ideals except 0 and R. IfL , M are simple 
left ideals , then there exists a e R such that Lcl = M. We have LR = R. 

Proof. Since R is by definition also semisimple, it is a direct sum of simple 

m 

left ideals, say ©£/• We can write 1 as a finite sum 1 = 2 fy, with fy e Lj. 

j£j j~1 

m m 

R = © Rlh - © Lj. 


Then 
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This proves our first assertion. As to the second, it is a consequence of the 
third. Let therefore L be a simple left ideal. Then LR is a left ideal, because 
RLR = LR , hence (R being semisimple) is a direct sum of simple left ideals, 
say 


LR = @Lj, L = L v 

j=\ 

Let M be a simple left ideal. We have a direct sum decomposition R = L®L. 
Let 7i ’ R —> L be the projection. It is an R-endomorphism. Let o : L -> M be 
an isomorphism (it exists by Theorem 4.3). Then o ° n \ R -► R is an R-e ndo- 
morphism. By the lemma, there exists ae R such that 

o o n(x) = xoc for all x e R. 

Apply this to elements x e L. We find 

o(x) = xol for all x e L. 

The map x i—► xa is a R-homomorphism of L into M, is non-zero, hence is an 
isomorphism. From this it follows at once that LR = R , thereby proving our 
theorem. 


Corollary 5.3. Let Rbe a simple ring. Let E be a simple R-module, and L 
a simple left ideal of R. Then LE = E and E is faithful. 

Proof. We have LE = L(RE) = (LR)E = RE = E. Suppose olE = 0 
for some a e R. Then RolRE = RolE = 0. But RolR is a two-sided ideal. Hence 
RolR = 0, and a = 0. This proves that E is faithful. 


Theorem 5.4. (Rieffel). Let Rbe a ring mthout two-sided ideals except 0 

and R. Let L be a nonzero left ideal, R' = End R (L) and R" = End R (L). 

Then the natural map X: R -> R" is an isomorphism. 

Proof. The kernel of X is a two-sided ideal, so X is injective. Since LR 
is a two-sided ideal, we have LR = R and X(L)X(R) = X(R). For any x, y e L, 
and / e R", we have / (xy) = f (x)y, because right multiplication by y is an 
R-endomorphism of L. Hence X(L) is a left ideal of R", so 

R" = R"X(R) = R"X(L)X(R) = X(L)X(R) = X(R), 
as was to be shown. 

In Rieffel’s theorem, we do not need to assume that L is a simple module. 
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On the other hand, L is an ideal. So this theorem is not equivalent with previous 
ones of the same nature. In §7, we shall give a very general condition under 
which the canonical homomorphism 

R -► R " 

of a ring into the double endomorphism ring of a module is an isomorphism. 
This will cover all the previous cases. 

As pointed out in the example following Wedderburn’s theorem, Rieffel’s 
theorem applies to give another proof when R is a finite-dimensional algebra 
(with unit) over a field k. 

The next theorem gives a converse, showing that matrix rings over division 
algebras are simple. 


Theorem 5.5. Let D be a division ring , and E a finite-dimensional vector 

space over D. Let R = End D (E). Then R is simple and E is a simple R-module. 

Furthermore , D = End R (E). 

Proof. We first show that E is a simple R-module. Let v e £, v / 0. Then 
v can be completed to a basis of E over D, and hence, given vv e £, there exists 
ol 6 R such that av = vv. Hence E cannot have any invariant subspaces other 
than 0 or itself, and is simple over R. It is clear that E is faithful over R. Let 
{v u ... , v m ) be a basis of E over D. The map 

of R into E (m) is an R-homomorphism of R into £ (m) , and is injective. Given 
(vvj,..., w m ) g £ (w) , there exists a e R such that olvì = w, and hence R is R- 
isomorphic to E {m) . This shows that R (as a module over itself) is isomorphic 
to a direct sum of simple modules and is therefore semisimple. Furthermore, 
all these simple modules are isomorphic to each other, and hence R is simple 
by Theorem 4.3. 

There remains to prove that D = End R (E). We note that E is a semisimple 
module over D since it is a vector space, and every subspace admits a com- 
plementary subspace. We can therefore apply the density theorem (the roles 
of R and D are now permuted!). Let cpe End R (E). Let ve E, v # 0. By the 
density theorem, there exists an element a e D such that (p{v ) = av. Let w e E. 
There exists an element / e R such that f(v) = w. Then 

<P(w) = <P(f(v)) = f(<p(v)) = f(av) = af(v) = aw. 

Therefore cp(w) = aw for all w e E. This means that cp s D, and concludes our 
proof. 


Theorem 5.6. Let k be a field and E a finite-dimensional vector space of 
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dimension m over k. Let R = End fc (£). Then R is a k-space , and 

dim k R = m 2 . 

Furthermore , m is the number of simple ieft ideals appearing in a direct sum 

decomposition of R as such a sum. 

Proof. The /c-space of /c-endomorphisms of E is represented by the space 
of m x m matrices in /c, so the dimension of R as a k- space is m 2 . On the other 
hand, the proof of Theorem 5.5 showed that R is K-isomorphic as an R-module 
to the direct sum E (m) . We know the uniqueness of the decomposition of a 
module into a direct sum of simple modules (Proposition 1.2), and this proves 
our assertion. 

In the terminology introduced in §1, we see that the integer m in Theorem 
5.6 is the length of R. 

We can identify R = End k (E) with the ring of matrices Mat m (k), once a 
basis of E is selected. In that case, we can take the simple left ideals to be the 
ideals L, (i = 1,..., m) where a matrix in L, has coefïìcients equal to 0 except 
in the i-th column. An element of L x thus looks like 



We see that R is the direct sum of the m columns. 

We also observe that Theorem 5.5 implies the following: 

Ifa matrix M e Mat m (/c) commutes mth all elements of Mat m (fc), then M is a 
scalar matrix. 

Indeed, such a matrix M can then be viewed as an /^-endomorphism of L, 
and we know by Theorem 5.5 that such an endomorphism lies in k. Of course, 
one can also verify this directly by a brute force computation. 


§6. THE JACOBSON RADICAL, BASE CHANGE, 

AND TENSOR PRODUCTS 

Let R be a ring and let M be a maximal left ideal. Then R/M is an /?-module, 
and actually R/M is simple. Indeed, let J be a submodule of R/M with 
J ^ R/M. Let J be its inverse image in R under the canonical homomorphism. 
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Then J is a left ideal ± M because J ± R/M , so J = R and J = 0. Conversely, 
let E be a simple /?-module and let i? e E, u 0. Then Rv is a submodule ± 0 
of E , and hence = £. Let M be the kemel of the homomorphism x 
Then M is a left ideal, and M is maximal; otherwise there is a left ideal M' with 
R D M’ D M and M' /?, M. Then /?/M ~ E and /?/M' is a non-zero homo- 
morphic image of E , which cannot exist since E is simple (Schur’s lemma, 
Proposition 1.1). Thus we obtain a bijection between maximal left ideals and 
simple /?-modules (up to isomorphism). 

We define the Jacobson radical of R to be the left ideal N which is the 
intersection of all maximal left ideals of R. We may also denote N = Rad (R). 

Theorem 6.1. (a) For every simple R-module we have NE = 0. 

(b) The radical N is a two-sided ideal , containing all nilpotent two-sided ideals. 

(c) Let R he afinite dimensional algebra over field k. Its radical is {0}, ifand 
only ifR is semisimple. 

(d) If R is a finite dimensional algebra over a field k , then its radical N is 
nilpotent (i.e. N r = 0 for some positive integer r). 

These statements are easy to prove, and hints will be given appropriately. See 
Exercises 1 through 5. 

Observe that under finite dimensionality conditions, the radical’s being 0 
gives us a useful criterion for a ring to be semisimple, which we shall use in 
the next result. 

Theorem 6.2. Let Abe a semisimple algebra, finite dimensional over a field 
k. Let K be a finite separable extension of k. Then K ® k A is a semisimple 
over K. 

Proof. In light of the radical criterion for semisimplicity, it suffices to prove 
that K® k A has zero radical, and it suffices to do so for an even larger extension 
than K , so that we may assume K is Galois over k , say with Galois group G. 
Then G operates on K ® A by 

a(x ® a) = ox ® a for x e K and a e A. 

Let N be the radical of K ® A. Since N is nilpotent, it follows that aN is also 
nilpotent for all cr e G, whence aN = N because N is the maximal nilpotent 
ideal (Exercise 5). Let {«!,..., a m } be a basis of A over k. Suppose N contains 
the element 

£ = 2 x t ® a t ^ 0 with x t e K. 

For every y e K the element (y ® 1)£ = 2y*/ ® a t also lies in N. Then 
trace((y <S> 1)£) = 2 o-Ç = 2Tr(jAr,) 0 a,, = 2 1 0 a,Tr(yx,) 
also lies in /V, and lies in 1 ® A * A, thus proving the theorem. 
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Remark. For the case when A is a finite extension of £, compare with 
Exercises 1, 2, 3 of Chapter XVI. 

Let A be a semisimple algebra, finite dimensional over a field k. Then by 
Theorem 6.2 the extension of scalars A ® k k* is semisimple if k is perfect. In 
general, an algebra A over k is said to be absolutely semisimple if A ® k k* is 
semisimple. 

We now look at semisimple algebras over an algebraically closed field. 

Theorem 6.3. Let A, B be simple algebras, finite dimensional over a 
field k which is algebraically closed. Then A ® k B is also simple. We have 
A ~ End*(V) andB ~ End^(W) where V, W arefinite dimensional vector spaces 
over k, and there is a natural isomorphism 

A® k B « End^V 0* W) * End*(V) ® k End*(W). 

Proof. The formula is a special case of Theorem 2.5 of Chapter XVI, and 
the isomorphisms A ~ End^(V), B ~ End*(W) exist by Wedderburn’s theorem 
or its corollaries. 

Let A be an algebra over k and let F be an extension field of k. We denote 
by A f the extension of scalars 

Ap — A ® k F. 

Thus Ap is an algebra over F. As an exercise, prove that if k is the center of A, 
then F is the center of A F . (Here we identify F with 1 0 F.) 

Let A, B be algebras over k. We leave to the reader the proof that for every 
extension field F of k , we have a natural isomorphism 

(A <8> k B) f = A F 0 F B f . 

We apply the above considerations to the tensor product of semisimple 
algebras. 

Theorem 6.4. Let A, B be absolutely semisimple algebras finite dimensional 
over afield k. Then A ® k B is absolutely semisimple. 

Proof. Let F = k a . Then A F is semisimple by hypothesis, so it is a direct 
product of simple algebras, which are matrix algebras, and in particular we can 
apply Theorem 6.3 to see that A F ® F B F has no radical. Hence A ® k B has no 
radical (because if N is its radical, then N ® k F = N f is a nilpotent ideal of 
A f 0 f B F ), whence A ® k B is semisimple by Theorem 6.1(c). 

Remark. We have proved the above tensor product theorems rapidly in 
special cases, which are already important in various applications. For a more 
general treatment, I recommend Bourbaki’s Algebra, Chapter VIII, which gives 
an exhaustive treatment of tensor products of semisimple and simple algebras. 
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§7. BALANCED MODULES 

Let R be a ring and E a module. We let R'(E) = End R (E) and 

R"(E) = End*,(£). 

Let A: R -> R" be the natural homomorphism such that = xv for x e R 
and v e E. If A is an isomorphism, we shall say that E is balanced. We shall say 
that E is a generator (for £-modules) if every module is a homomorphic image 
of a (possibly infinite) direct sum of E with itself. For example, R is a generator. 

More interestingly, in Rieffel’s Theorem 5.4, the left ideal L is a gen- 
erator, because LR = R implies that there is a surjective homomorphism 
L x • • • x L —> R since we can write 1 as a finite combination 

1 = X\Q\ + • * • + x n a n with x t e L and a { e R. 

The map (x \,..., x n ) h» X\a\ + • • • + x n a n is a /?-homomorphism of left module 
onto R. 

If E is a generator, then there is a surjective homomorphism & n) —> R (we 
can take n finite since R is finitely generated, by one element 1). 

Theorem 7.1. (Morita). Let E be an R-module. Then E is a generator if 
and only if E is balanced and fìnitely generated projective over R’(E). 

Proof. We shall prove half of the theorem, leaving the other half to the 
reader, using similar ideas (see Exercise 12). So we assume that E is a generator, 
and we prove that it satisfies the other properties by arguments due to Faith. 

We first prove that for any module F, R © F is balanced. We identify R and 
F as the submodules R © 0 and 0 ® F of R © F, respectively. For w e F, 
let ỳ w :R ® F —> F be the map iIj w (x + v) = xw. Then any / e R"(R © F) 
commutes with tt \, 7r 2 , and each i/j w . From this we see at once that 
f(x + v) = f(l)(x + v) and hence that R © F is balanced. Let E be a gen- 
erator, and £ (n) —> R a surjective homomorphism. Since R is free, we can write 
£■(«) «=/?©£ for some module F, so that ES n) is balanced, Let g e R\E). 
Then g in) commutes with every element cp = (<p iy ) in /?'(£ (n) ) (with components 
(Pij e R’(E)), and hence there is some x e R such that g (n) = À^ n) . Hence 
g = À*, thereby proving that E is balanced, since À is obviously injective. 

To prove that E is finitely generated over R'(E), we have 

R'(E) {n) » Hom R (E in) , E) « Hom«(R, £) © Ho mj? (F, £) 

as additive groups. This relation also obviously holds as R'-modules if we 
define the operation of R' to be composition of mappings (on the left). Since 
Hom^R, E) is R'-isomorphic to E under the map h i—► h(\), it follows that E is 
an R'-homomorphic image of R ,(n) , whence finitely generated over R'. We also 
see that E is a direct summand of the free R'-module R' in) and is therefore 
projective over R'(E). This concludes the proof. 
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EXERCISES 

The radical 

1. (a) Let R be a ring. We define the radical of R to be the left ideal N which is the inter- 
section of all maximal left ideals of R. Show that NE = 0 for every simple /?-module 
E. Show that N is a two-sided ideal. (b) Show that the radical of R/N is 0. 

2. A ring is said to be Artinian if every descending sequence of left ideals J { D J 2 D • • • 
with Ji ± J i+X is finite. (a) Show that a finite dimensional algebra over a field is 
Artinian. (b) If R is Artinian, show that every non-zero left ideal contains a simple 
left ideal. (c) If R is Artinian, show that every non-empty set of ideals contains a 
minimal ideal. 

3. Let R be Artinian. Show that its radical is 0 if and only if R is semisimple. [ Hint : Get 
an injection of R into a direct sum © R/M^ where {M, } is a finite set of maximal left 
ideals.] 

4. Nakayama’s lemma. Let R be any ring and M a finitely generated module. Let N 
be the radical of R. If NM = M show that M = 0. [Hint : Observe that the proof 
of Nalcayama’s lemma still holds.] 

5. (a) Let J be a two-sided nilpotent ideal of R. Show that J is contained in the radical. 
(b) Conversely, assume that R is Artinian. Show that its radical is nilpotent, i.e., 
that there exists an integer r ^ 1 such that N r = 0. [Hint: Consider the descending 
sequence of powers N r , and apply Nakayama to a minimal finitely generated left 
ideal LCr such that N°°L ± 0. 

6. Let R be a semisimple commutative ring. Show that R is a direct product of fields. 

7. Let R be a finite dimensional commutative algebra over a field k. If R has no nilpotent 
element ^ 0, show that R is semisimple. 

8. (Rolchin) Let E be a finite-dimensional vector space over a field k. Let G be a sub- 
group of GL(E) such that every element A e G is of type / + N where N is nilpotent. 
Assume E ^ 0. Show that there exists an element v e E, v # 0 such that Av = v for all 
A e G. \_Hint : First reduce the question to the case when k is algebraically closed by 
showing that the problem amounts to solving linear equations. Secondly, reduce it to 
the case when E is a simple /c[G]-module. Combining Burnside’s theorem with the 
fact that tr(/4) = tr(/) for all A e G, show that if A 0 e G, A 0 = / 4- N, then tr(NA") = 0 
for all X e End k (E), and hence that N = 0, A 0 = /.] 

Semisimple operations 

9. Let E be a finite dimensional vector space over a field k. Let R be a semisimple sub- 
algebra of End k (E). Let a,b e R. Assume that 

Ker b E => Ker a E , 

where b E is multiplication by b on E and similarly for a E . Show that there exists an 
element seR such that sa = b. [Hint: Reduce to R simple. Then R = End D (£ 0 ) 
and E = £[" ) . Let v l9 ..., v r e E be a Z)-basis for aE. Define s by s(ûu f ) = bv t and 
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extend s by D-linearity. Then sa E = b Eì so sa = b.\ 

10. Let E be a finite-dimensional vector space over a field k. Let A e End*(£). We say 
that A is semisimple if E is a semisimple /4-space, or equivalently, let R be the k- algebra 
generated by A, then E is semisimple over R. Show that A is semisimple if and only 
if its minimal polynomial has no factors of multiplicity > 1 over k. 

11. Let E be a finite-dimensional vector space over a fìeld k, and let S be a commutative 
set of endomorphisms of E. Let R = /c[S]. Assume that R is semisimple. Show that 
every subset of S is semisimple. 

12. Prove that an £-module E is a generator if and only if it is balanced, and finitely 
generated projective ovqtR'(E). Show that Theorem 5.4 is a consequence of Theorem 
7.1. 

13. Let A be a principal ring with quotient fìeld K. Let A n be n-space over A, and let 

T = A n © A n © • • • © A n 

be the direct sum of A n with itself r times. Then T is free of rank nr over A. If we view 
elements of A n as column vectors, then T is the space of n x r matrices over A. Let 
M = Mat„C4) be the ring of n x n matrices over A , operating on the left of T. By a 
lattice L in T we mean an A-submodule of rank nr over A. Prove that any such lattice 
which is Af-stable is Af-isomorphic to T itself. Thus there is just one M-isomorphism 
class of lattices. [Hint : Let g e M be the matrix with 1 in the upper left corner and 
0 everywhere else, so g is a projection of A n on a 1-dimensional subspace. Then multi- 
plication on the left g: T —► A r maps T on the space of n x r matrices with arbitrary 
first row and 0 everywhere else. Furthermore, for any lattice L in T the image gL is a 
lattice in A r , that is a free A-submodule of rank r. By elementary divisors there exists 
anrxr matrix Q such that 

gL = A r Q (multiplication on the right). 

Then show that TQ = L and that multiplication by Q on the right is an M-isomorphism 
of T with L.] 

14. Let F be a field. Let n = n(F) be the vector space of strictly upper triangular n x n 
matrices over F. Show that n is actually an algebra, and all elements of n are nilpo- 
tent (some positive integral power is 0). 

15. Conjugation representatìon. Let A be the multiplicative group of diagonal matrices in 
F with non-zero diagonal components. For a e A, the conjugation action of a on 
Mat w (F) is denoted by c(a ), so c (a)M = aMa~ x for M e Mat n (F). (a) Show that n 
is stable under this action. (b) Show that n is semisimple under this action. More 
precisely, for 1 ^ / <j ^ n, let E tJ be the matrix with (//)-component 1, and all other 
components 0. Then these matrices E t j form a basis for n over F, and each Ey is an 
eigenvector for the conjugation action, namely for a = diag(ûi,... ,a n ), we have 

aEijoT' = (aj/aj)Ejj, 

so the corresponding character Xij is given by Xij( a ) = a i/ a j • ( c ) Show that Mat w (F) 
is semisimple, and in fact is equal to b © n ® 'n, where b is the space of diagonal 
matrices. 



CHAPTER XVIII 


Representations of Finite 
Groups 


The theory of group representations occurs in many contexts. First, it is 
developed for its own sake: determine all irreducible representations of a given 
group. See for instance Curtis-Reiner’s MethodsofRepresentation Theory (Wiley- 
Interscience, 1981). Itis alsoused in classifying finite simple groups. Butalready 
in this book we have seen applications of representations to Galois theory and 
the determination of the Galois group over the rationals. In addition, there is an 
analogous theory for topological groups. In this case, the closest analogy is with 
compact groups, and the reader will find a self-contained treatment of the compact 
case entirely similar to §5 of this chapter in my book SL 2 (R) (Springer Verlag), 
Chapter II, §2. Essentially, finite sums are replaced by integrals, otherwise the 
formalism is the same. The analysis comes only in two places. One of them is 
to show that every irreducible representation of a compact group is finite dimen- 
sional; the other is Schur’s lemma. The details of these extra considerations are 
carried out completely in the above-mentioned reference. I was careful to write 
up §5 with the analogy in mind. 

Similarly, readers will find analogous material on induced representations in 
SL 2 (R), Chapter III, §2 (which is also self-contained). 

Examples of the general theory come in various shapes. Theorem 8.4 may 
be viewed as an example, showing how a certain representation can be expressed 
as a direct sum of induced representations from 1-dimensional representations. 
Examples of representations of S 3 and S 4 are given in the exercises. The entire 
last section works out completely the simple characters for the group GL 2 (F) 
when F is a finite field, and shows how these characters essentially come from 
induced characters. 

For other examples also leading into Lie groups, see W. Fulton and J. Harris, 
Representation Theory , Springer Verlag 1991. 
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§1. REPRESENTATIONS AND SEMISIMPLICITY 

Let R be a commutative ring and G a group. We form the group algebra 
/?[G]. As explained in Chapter II, §3 it consists of all formal linear combinations 

Z a °° 


with coefficients a a e R, almost all of which are 0. The product is taken in the 
natural way, 

( Z £ *\ T Ì = Z a a b T ax. 

\aeG J \xeG J a,z 

Let E be an R- module. Every algebra-homomorphism 

R[G] En d R (E) 
induces a group-homomorphism 

G Aut R (E), 

and thus a representation of the ring /?[G] in E gives rise to a representation of 
the group. Given such representations, we also say that R[G], or G, operate on 
E. We note that the representation makes E into a module over the ring /?[G]. 

Conversely, given a representation of the group, say p : G —> Aut R (E), we 
can extend p to a representation of /?[G] as follows. Let a = X ci^a and x e E. 
We define 


p{a)x = £ a„p(a)x. 

It is immediately verified that p has been extended to a ring-homomorphism of 
R[G] into End/jíE’). We say that p is faithful on G if the map p : G —» Aut/^Zs) 
is injective. The extension of p to /?[G] may not be faithful, however. 

Given a representation of G on E, we often write simply crx instead of p(cr)x , 
whenever we deal with a fixed representation throughout a discussion. 

An /?-module E, together with a representation p, will be called a G-module, 
or G-space, or also a (G, /?)-module if we wish to specify the ring R. If E , F 
are G-modules, we recall that a G-homomorphism /: E —> F is an /?-linear map 
such that f(crx) = af(x) for all x e E and a e G. 

Given a G-homomorphism f:E->F, we note that the kernel of / is a G- 
submodule of E, and that the /?-factor module F/f(E) admits an operation of G 
in a unique way such that the canonical map F —> F//(£’) is a G-homomorphism. 

By a trivial representation p :G—> Aut R (E), we shall mean the representation 
such that p(G) = 1. A representation is trivial if and only if ax = x for all 
x e E. We also say in that case that G operates trivially. 
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We make R into a G-module by making G act trivially on R. 

We shall now discuss systematically the representations which arise from a 
given one, on Hom, the dual, and the tensor product. This pattern will be repeated 
later when we deal with induced representations. 

First, Hom/^Zs, F) is a G-module under the action defined for /e Hom R (E, F) 
by 

(Mf)(x) = (Tf((T~ l x). 

The conditions for an operation are trivially verified. Note the a~ l inside the 
expression. We shall usually omit parentheses, and write simply [í r]f(x ) for the 
left-hand side. We note that/is a G-homomorphism if and only if [a]f = /for 
all cr e G. 

We are particularly concerned when F = R (so with trivial action), in which 
case Hom/^Zs, R) = Zs v is the dual module. In the terminology of representations, 
if p: G —> Aut/^Zs) is a representation of G on E , then the action we have just 
described gives a representation denoted by 

p*:G^ Aut«(£ v ), 

and called the dual representation (also called contragredient (ugh!) in the 
literature). 

Suppose now that the modules E, F are free and finite dimensional over R. 
Let p be representation of G on E. Let M be the matrix of p(&) with respect to 
a basis, and let M v be the matrix of p v (<r) with respect to the dual basis. Then 
it is immediately verified that 

(1) A/ v = l M~ x . 

Next we consider the tensor product instead of Hom. Let E, E' be (G, R)- 
modules. We can form their tensor product E ® E\ always taken over R. Then 
there is a unique action of G on E <8> E' such that for cr e G we have 

c t(x ® x') = ox ® (jx'. 

Suppose that E, F are finite free over R. Then the /?-isomorphism 

(2) E v ® F « Hom*(£, F) 

of Chapter XVI, Corollary 5.5, is immediately verified to be a G-isomorphism. 

Whether E is free or not, we define the G-invariant submodule of E to be 
inv G (£) = /?-submodule of elements x e E such that ax = x for all cr e G. If 
E , F are free then we have an /?-isomorphism 


(3) 


inv G (/s v ® F) » Hom G (E, F). 
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If p: G —> Aut r (E) and p': G —» Autfl(F') are representations of G on E 
and E' respectively, then we define their sum p © p' to be the representation 
on the direct sum E © E\ with er e G acting componentwise. Observe that G-iso- 
morphism classes of representations have an additive monoid structure under 
this direct sum, and also have an associative multiplicative structure under the 
tensor product. With the notation of representations, we denote this product by 
p ® p'. This product is distributive with respect to the addition (direct sum). 

If G is a finite group, and E is a G-module, then we can define the trace 
Tr G : E —» E which is an /?-homomorphism, namely 

Tr G (x) = X ax. 

a 6 G 

We observe that Tr G (x) lies in inv G (£), i.e. is fixed under the operation of 
all elements of G. This is because 

t Tr c (x) = £ TffX > 

aeG 

and multiplying by t on the left permutes the elements of G. 

In particular, if f: E —» F is an /?-homomorphism of G-modules, then 
Tr G (/): E —> F is a G-homomorphism. 


Proposition 1.1. Let G be a finite group and let E\ E , F, F' be G-modules. 
Let 

£' ^ £ 1+ f f' 

be R-homomorphisms , and assume that <p, ỳ are G-homomorphisms. Then 

Tr c (^ o/o (p) = ij/o Tr G (f) ° (p. 

Proof. We have 

Tr c (i/' °fo q>) = <r(<A 0 / 0 <p) = X (ail/) ° (af) ° (ct^) 

<reG creG 

= ° ^ I>/ j 0 = </* ° Tr c (/) o 


Theorem 1.2. (Maschke). Let G be afinite group oforder n 9 and let k be a 
field xvhose characteristic does not divide n. Then the group ring k[G] is 
semisimple. 

Proof. Let E be a G-module, and F a G-submodule. Since k is a field, 
there exists a k- subspace F' such that E is the /c-direct sum of F and F'. We let 
the fc-linear map tc : E -► F be the projection on F. Then n(x) = x for all x e F. 
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Let 


<P=~ Tr 0 (7i). 
n 

We have then two G-homomorphisms 

0 -> F E 

<p 

such that j is the inclusion, and (p°j = id. It follows that E is the G-direct sum 
of F and Ker cp, thereby proving that /c[G] is semisimple. 

Except in §7 we denote by G a finite group, and we denote E , F finite 
dimensional /:-spaces, where k is a fìeld of characteristic not dividing 
#(G). We usually denote #(G) by n. 


§2. CHARACTERS 

Let p : /c[G] -► End k (E) be a representation. By the character X p of the 
representation, we shall mean the /c-valued function 

X P : ^[G] -► k 

such that / p (a) = tr p(a) for all a e /c[G]. The trace here is the trace of an endo- 
morphism, as defined in Chapter XIII, §3. If we select a basis for E over /c, it is 
the trace of the matrix representing p(a), i.e., the sum of the diagonal elements. 
We have seen previously that the trace does not depend on the choice of the basis. 
We sometimes write Xe instead of / p . 

We also call E the representation space of p. 

By the trivial character we shall mean the character of the representation of 
G on the /c-space equal to k itself, such that ax = x for all xek. It is the function 
taking the value 1 on all elements of G. We denote it by Xo or a l so by 1 G if we 
need to specify the dependence on G. 

We observe that characters are functions on G, and that the values of a 
character on elements of /c[G] are determined by its values on G (the extension 
from G to /c[G] being by /c-linearity). 

We say that two representations p, cp of G on spaces E , F are isomorphic if 
there is a G-isomorphism between E and F. We then see that if p, cp are iso- 
morphic representations, then their characters are equal. (Put in another way, 
if E, F are G-spaces and are G-isomorphic, then Xe = Xf •) I n everything that 
follows, we are interested only in isomorphism classes of representations. 
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If E , F are G-spaces, then their direct sum E © F is also a G-space, the opera- 
tion of G being componentwise. UxÇ&yeEÇ&F with xeE and ye F, then 
o(x © y) = ox © oy. 

Similarly, the tensor product £® fc F = £(g)Fisa G-space, the operation 
of G being given by o(x ® y) = ox (x) cry. 


Proposition 2.1. If E, F are G-spaces , í/ien 

Ze + Xf = Ze©F ZeZf = Ze<8)F* 

// denotes the character of the dual representation on E v , then 

* v (<r) = ATÍ^ -1 ) 

= X<(t) if k = C 

Proof The first relation holds because the matrix of an element o in the 
representation E © F decomposes into blocks corresponding to the representa- 
tion in E and the representation in F. As to the second, if {i ;J is a basis of E and 
{wj} is a basis of F over k , then we know that {v ( ® Wj} is a basis of E <g) F. Let 
(a ív ) be the matrix of o with respect to our basis of £, and (b jlx ) its matrix with 
respect to our basis of F. Then 

a(vt <g) wj) = ovj ® awj = £ a iv v v ® X 

V 

= X a h b W V v ® W H' 

V, M 

By definition, we find 

Xe®f(°') = E Z a « b Jj = XÁ°)XÁ°)> 

i j 

thereby proving the statement about tensor products. The statement for the char- 
acter of the dual representation follows from the formula for the matrix 'M -1 
given in §1. The value given as the complex conjugate in case k = C will be 
proved later in Corollary 3.2. 

So far, we have defined the notion of character associated with a representa- 
tion. It is now natural to form linear combinations of such characters with more 
general coefficients than positive integers. Thus by a character of G we shall 
mean a function on G which can be written as a linear combination of characters 
of representations with arbitrary integer coefficients. The characters associated 
with representations will be called effective characters. Everything we have 
defined of course depends on the field k , and we shall add over k to our expressions 
if we need to specify the field k. 
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We observe that the characters form a ring in view of Proposition 2.1. For 
most of our work we do not need the multiplicative structure, only the additive 
one. 

By a simple or irreducible character of G one means the character of a 
simple representation (i.e., the character associated with a simple Â:[G]-module). 

Taking into account The.orem 1.2, and the results of the preceding chapter 
concerning the structure of simple and semisimple modules over a semisimple 
ring (Chapter XVII, §4) we obtain: 


Theorem 2.2. There are only a finite number of simple characters of G (over 
k). The characters of representations of G are the linear combinations of the 
simple characters mth integer coefficients ^ 0. 

We shall use the direct product decomposition of a semisimple ring. We 
have 


/c[gi = n Ri 

i = 1 

where each R t is simple, and we have a corresponding decomposition of the unit 
element of /c[G]: 


1 — e x H- • • • H- e s , 

where e x is the unit element of R h and e.ej = 0 if i # j. Also, R^Rj = 0 if i # j. 
We note that s = s(k) depends on k. 

If Lj denotes a typical simple module for R^ (say one of the simple left ideals), 
we let Xi be the character of the representation on L x . 


We observe that Xi(oì) = 0 for all a e Rj if i / j. This is a fundamental relation 
of orthogonality , which is obvious , but from xvhich all our other relations xvill 
folloxv. 


Theorem 2.3. Assume that k has characteristic 0. Then every ejfective char- 
acter has a unique expression as a linear combination 


s 

X=Z n iXi, n i e Z, n f ^ 0, 

i — 1 


xvherexi ,•.., Xs are the simple characters of G over k. Txvo representations are 
isomorphic if and only if their associated characters are equal. 
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Proof. Let E be the representation space of X- Then by Theorem 4.4 of 
Chapter XVII, 


£ « 0 MiLj. 
î= 1 

The sum is finite because we assume throughout that E is finite dimensional. 
Since acts as a unit element on L h we find 

Xiied = dim k L, . 

We have already seen that Xi( e j) = 0 if i ^ j- Hence 

l(ej) = «i dim fc L t . 

Since dim k depends only on the structure of the group algebra, we have 
recovered the multiplicities n u ..., n s . Namely, n, is the number of times that 
L ( occurs (up to an isomorphism) in the representation space of and is the 
value of x( e d divided by dim k L t (we are in characteristic 0). This proves our 
theorem. 

As a matter of definition, in Theorem 2.3 we call n, the multiplicity of Xi in x • 
In both corollaries, we continue to assume that k has characteristic 0. 


Corollary 2.4. As functions of G into k , the simple characters 

Zi 

are linearly independent over k. 

Proof Suppose that £ a^Xi = 0 with a t e k. We apply this expression to e } 
and get 

o = (Z a ìXi)(ej) = aj dim k Lj. 

Hence a } = 0 for all j. 

In characteristic 0 we define the dimension of an effective character to be 
the dimension of the associated representation space. 


CoroIIary 2.5. Thefunction dim is a homomorphism ofthe monoid ofejfective 
characters into Z. 
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Example. Let G be a cyclic group of order equal to a prime number p. 
We form the group algebra Q[G]. Let o be a generator of G. Let 

1 + (7 + cr 2 4- ••• + cr p_1 

= -, É? 2 ~ 1 

P 

Then xe x = e x for any xeG and consequently e\ = e v It then follows that 
e\ — e 2 and e x e 2 = 0. The field Qe^ is isomorphic to Q. Let a> = ce 2 . Then 
a> p = e 2 . Let Q 2 = Qe 2 • Since ío # e 2 , and satisfies the irreducible equation 

+ ••• + 1 = 0 

over Q 2 , it follows that Q 2 (co) is isomorphic to the field obtained by adjoining 
a primitive p-th root of unity to the rationals. Consequently, Q[G] admits the 
direct product decomposition 

Q[G] * Q x Q(0 

where ( is a primitive p-th root of unity. 

As another example, let G be any finite group, and let 

1 v 

e x = - ) o. 
n Jtc 

Then for any x e G we have xe x = e u and e\ = e v If we let e\ = 1 — e x then 
e'i = e\, and e\e x = e x e\ = 0. Thus for any field fc (whose characteristic does 
not divide the order of G according to conventions in force), we see that 

fc[G] = ke x x k[GK 

is a direct product decomposition. In particular, the representation of G on the 
group algebra k[G] itself contains a 1-dimensional representation on the 
component ke u whose character is the trivial character. 


§3. 1 -DIMENSIONAL REPRESENTATIONS 

By abuse of language, even in characteristic p > 0, we say that a character is 
1-dimensional if it is a homomorphism G -► fc*. 

Assume that E is a 1-dimensional vector space over fc. Let 

p : G -► Aut fc (E) 

be a representation. Let {*;} be a basis of E over fc. Then for each o e G, we have 

w 
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for some element /((j) e /c, and /(cr) # 0 since o induces an automorphism of E. 
Then for t e G, 


xcv = x(o)xv = i(o)x(x)v = x(<xt)r 

We see that x : G -> /c* is a homomorphism, and that our 1-dimensional char- 
acter is the same type of thing that occurred in Artin’s theorem in Galois theory. 

Conversely, let x • G fc* be a homomorphism. Let E be a 1-dimensional 
/c-space, with basis {t>}, and define o(av) = ax(o)v for all a e k. Then we see at 
once that this operation of G on E gives a representation of G, whose associated 
character is x • 

Since G is finite, we note that 


Z(^)" = l(o n ) = z(l) = 1. 

Hence the values of 1-dimensional characters are n-th roots of unity. The 
1-dimensional characters form a group under multiplication, and when G is a 
finite abelian group, we have determined its group of 1-dimensional characters 
in Chapter I, §9. 


Theorem 3.1. Let G be a finite abelian group , and assume that k is alge- 
braically closed. Then every simple representation of G is ì-dimensional. The 
simple characters of G are the homomorphisms of G into k*. 

Proof The group ring k[G] is semisimple, commutative, and is a direct 
product of simple rings. Each simple ring is a ring of matrices over k (by Corollary 
3.6 Chapter XVII), and can be commutative if and only if it is equal to k. 

For every 1-dimensional character x of G we have 

X(°Y 1 = X(o~ l l 

If k is the field of complex numbers, then 

x(o) = x(o)~ l = x(o~ l y 

Corollary 3.2. Let k be algebraically closed. Let G be a finite group. For 
any character x and oeG , the value x(o) is equal to a sum of roots of unity mth 
integer coejficients (i.e. coejficients in Z or Z/pZ depending on the char- 
acteristic of k). 

Proof Let H be the subgroup generated by o. Then H is a cyclic subgroup. 
A representation of G having character x can be viewed as a representation for 
H by restriction, having the same character. Thus our assertion follows from 
Theorem 3.1. 
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§4. THE SPACE OF CLASS FUNCTIONS 

By a class function of G (over k, or with values in k ), we shall mean a function 
f:G—>k such that/(errer -1 ) = /(r) for all er, r e G. It is clear that characters 
are class functions, because for square matrices M, M' we have 

tr (MM'M -1 ) = tr(M'). 

Thus a class function may be viewed as a function on conjugacy classes. 

We shall always extend the domain of definition of a class function to the 
group ring, by linearity. If 


a = £ a„(T, 

<T 6 G 

and/is a class function, we define 

/(«)= Z«,/(a). 

<xe G 

Let a 0 e G. If <r e G, we write cr ~ o 0 if o is conjugate to cr 0 , that is, if there 
exists an element i such that cr 0 = xox~ l . An element of the group ring of type 

y = Z a 

<T~(To 

will also be called a conjugacy class. 


Proposition 4.1. An element of k[G] commutes with every element of G if 
and only ifit is a linear combination of conjugacy classes with coejfìcients in k. 

Proof Let a = £ a a° an d assume at = ta for all x e G. Then 

<T€ G 

X a a U7r~ l = X a a <r. 

<reG <reG 

Hence a ao = a a whenever cr is conjugate to cr 0 , and this means that we can write 

a = Z a ?y 

y 

where the sum is taken over all conjugacy classes y. 

Remark. We note that the conjugacy classes in fact form a basis of the 
center of Z[G] over Z, and thus play a universal role in the theory of rep- 
resentations. 

We observe that the conjugacy classes are linearly independent over /c, 
and form a basis for the center of /c[G] over k. 
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Assume for the rest of this section that k is algebraically closed. Then 

k[G] =f\ R i 

i = 1 

is a direct product of simple rings, and each is a matrix algebra over k. In a 
direct product, the center is obviously the product of the centers of each factor. 
Let us denote by k t the image of kin R h in other words, 

k t = ke i9 

where e { is the unit element of R Then the center of fc[G] is also equal to 

n*. 


which is 5-dimensional over k. 

If is a typical simple left ideal of R i9 then 

R i « End 4 (Li). 

We let 


Then 


di = dim^ L,. 



S 

df = dim A R 

and Yj df = n. 

i= 1 


We also have the direct sum decomposition 

Ri * L| di) 


as a (G, fc)-space. 

The above notation will remain fixed from now on. 

We can summarize some of our results as follows. 

Proposition 4.2. Letkbe algebraically closed. Thenthenumberof conjugacy 
classes of G is equal to the number of simple characters of G, both of these being 
equal to the number s above. The conjugacy classes y ì9 -.. 9 y s and the unit 
elements e l9 ... 9 e s form bases of the center of /c[G]. 

The number of elements in y t will be denoted by h t . The number of elements 
in a conjugacy class y will be denoted by h y . We call it the class number. The 
center of the group algebra will be denoted by Z k (G). 
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We can view /c[G] as a G-module. Its character will be called the regular 
character, and will be denoted by / reg or r G if we need to specify the dependence 
on G. The representation on /c[G] is called theregular representation. From our 
direct sum decomposition of /c[G] we get 



We shall determine the values of the regular character. 
Proposition 4.3. Let x reg be the regular character. Then 
XregM = 0 Ìf (7GG,( 7/1 


Xregí 1 ) = n. 

Proof Let 1 = . .., o n be the elements of G. They form a basis of /c[G] 

over k. The matrix of 1 is the unit n x n matrix. Thus our second assertion 
follows. If (7 / 1, then multiplication by o permutes o í9 ..., o n , and it is im- 
mediately clear that all diagonal elements in the matrix representing o are 0. 
This proves what we wanted. 

We observe that we have two natural bases for the center Z k (G ) of the 
group ring. First, the conjugacy classes of elements of G. Second, the elements 
e l9 ..., e s (i.e. the unit elements of the rings Rf. We wish to find the relation 
between these, in other words, we wish to fìnd the coefficients of when ex- 
pressed in terms of the group elements. The next proposition does this. The 
values of these coefficients will be interpreted in the next section as scalar 
products. This will clarify their mysterious appearance. 

Proposition 4.4. Assume again that k is algebraically closed. Let 

e,' = X a T z, a x ek. 

T E G 


Then 


flr=^Xreg fax ') = ~ Xi(^ ‘)- 


Proof We have for all t e G: 


Xreg( e i T ‘) = Xreg^ 'L“<’ aX 


lu reg(^ T ’)• 
a eG 
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By Proposition 4.3, we find 

W g . T_1 ) = n °r■ 


On the other hand, 

s 

Xreg ( e i t'" 1 ) = Y d i x/e, x " 1 ) = di xtei X - 1 ) = d, Xi(x ~ 1 ). 

j=l 

Hence 


*) = na x 

for all t e G. This proves our proposition. 

Corollary 4.5. Each e x can be expressed in terms of group elements mth 
coefficients which lie in the field generated over the prime field by m-th roots 
of unity , if m is an exponent for G. 

Corollary 4.6. The dimensions d x are not divisible by the characteristic of k. 
Proof Otherwise, e { = 0, which is impossible. 

Corollary 4.7. The simple characters % u . . . , % s are linearly independent 
over k. 

Proof. The proof in Corollary 2.4 applies, since we now know that the 
characteristic does not divide d x . 

Corollary 4.8. Assume in addition that k has characteristic 0. Then d ( |n 
for each i. 

Proof. Multiplying our expression for e t by n/d x , and also by e i9 we find 

j e i= £ 

<T6 G 

Let C be a primitive m-th root of unity, and let M be the module over Z gen- 
erated by the finite number of elements Cve x (v = 0,..., m — 1 and o e G). 
Then from the preceding relation, we see at once that multiplication by n/d, 
maps M into itself. By definition, we conclude that n/d, is integral over Z, 
and hence lies in Z, as desired. 

Theorem 4.9. Let k be algebraically closed. Let Z k (G) be the center of 
k[G], and let X k (G) be the k-space of class functions on G. Then Z k (G) and 
X k (G) are the dual spaces of each other , under the pairing 


(/,a)^/(a). 
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The simple characters and the unit elements e l9 , e s form orthogonal bases 
to each other. We have 

X,<ej) = Sijdf 

Proof. The formula has been proved in the proof of Theorem 2.3. The 
two spaces involved here both have^dimension s, and d t # 0 in k. Our prop- 
osition is then clear. 


§5. ORTHOGONALITY RELATIONS 

Throughout this section, we assume that k is algebraically closed . 

If R is a subring of k , we denote by X R (G) the /?-module generated over R 
by the characters of G. It is therefore the module of functions which are linear 
combinations of simple characters with coefficients in R. If R is the prime ring 
(i.e. the integers Z or the integers mod p if k has characteristic p), then we denote 
X R (G)byX(G). 

We shall now define a bilinear map on X(G) x X(G). If f geX(G\ we 
define 


n oeG 

Theorem 5.1. The symbol </, g ) forfge X(G) takes on values in the prime 
ring. The simple charactersform an orthonormal basisfor X(G), in other xvords 

<Xh Xj> = Sij. 

For each ring R c= k, the symbol has a unique extension to an R-bilinear form 
X r (G) x X r (G) -+ R , given by the same formula as above. 

Proof By Proposition 4.4, we find 

Xjieò = ^ I Xii^'Yxf a)- 

If i # j we get 0 on the left-hand side, so that Xi and Xj are orthogonal. If i = j 
we get d t on the left-hand side, and we know that d t # 0 in /c, by Corollary 4.6. 
Hence Xi) = 1- Since every element of X(G) is a linear combination of 
simple characters with integer coefficients, it follows that the values of our 
bilinear map are in the prime ring. The extension statement is obvious, thereby 
proving our theorem. 
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Assume that k has characteristic 0. Let m be an exponent for G, and let R 
contain the m-th roots of unity. If R has an automorphism of order 2 such that 
its effect on a root of unity is ( i—► 1 , then we shall call such an automorphism 
a conjugation, and denote it by a i—► a. 

Theorem 5.2. Let k have characteristic 0, and let Rbe a subring containing 
the m-th roots of unity , and having a conjugation. Then the bilinear form on 
X(G) has a unique extension to a hermitian form 

X r (G) x X r (G) - R 9 


given by the formula 


</ 0> = - Z ma(o). 

^ (jg G 

The simple characters constitute an orthonormal basis of X R (G) mth respect 

to thisform. 

Proof. The formula given in the statement of the theorem gives the same 
value as before for the symbol </ g ) when/, g lie in X(G). Thus the extension 
exists, and is obviously unique. 

We return to the case when k has arbitrary characteristic. 

Let Z(G) denote the additive group generated by the conjugacy classes 
y j,..., y s over the prime ring. It is ofdimension s. We shall define a bilinear map 
on Z(G) x Z(G). If a = £ a a o has coefficients in the prime ring, we denote by 
a - the element £ a a o ~ l . 

Proposition 5.3. For a, / e Z(G ), we can define a symbol <a, /) by either one 

of the following expressions , which are equal: 

<0, j8> = ^ Xreg(aJÎ') = 1 È 7.Áa)'/.ÂP~)- 

The values of the symbol lie in the prime ring. 

Proof. Each expression is linear in its first and second variable. Hence 
to prove their equality, it will suffice to prove that the two expressions are equal 
when we replace a by e { and / by an element t of G. But then, our equality is 
equivalent to 


s 

Xreg (e,* -1 ) = I XÂ e i)/Á x ~ ’)• 

V —- 1 

Since Xv(*i) = 0 unless v = i , we see that the right-hand side of this last relation 
is equal to d{Xi (r -1 ). Our two expressions are equal in view of Proposition 4.4. 
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The fact that the values lie in the prime ring follows from Proposition 4.3: The 
values of the regular character on group elements are equal to 0 or n, and hence 
in characteristic 0, are integers divisible by n. 

As with X r (G ), we use the notation Z R (G) to denote the fl-module generated 
by yi,...,y s over an arbitrary subring R of k. 

Lemma 5.4. For each ring R contained in k, the pairing of Proposition 5.3 

has a unique extension to a map 

Z r (G) x Z(G) -> R 

which is R-linear in its first variable. If R contains the m-th roots of unity, 

where m is an exponent for G, and also contains 1/n, then e t e Z R (G) for all i. 

The class number h t is not divisible by the characteristic of k, and we have 

s 1 

e,- = £ <«?,-, Tv> 7- Vv 

v= i n x 

Proof We note that h t is not divisible by the characteristic because it is 
the index of a subgroup of G (the isotropy group of an element in y, when G 
operates by conjugation), and hence /i, divides n. The extension of our pairing 
as stated is obvious, since y ì9 ..., y s form a basis of Z(G) over the prime ring. 
The expression of e { in terms of this basis is only a reinterpretation of Proposition 
4.4 in terms of the present pairing. 

Let E be a free module over a subring R of k, and assume that we have a 
bilinear symmetric (or hermitian) form on E. Let {v u ..., v s } be an orthogonal 
basis for this module. If 


v = a x v x + ••• 4- a s v s 

with a { g R, then we call a l9 ..., a s the Fourier coefflcients of v with respect to 
our basis. In terms of the form, these coefficients are given by 

<v , v t > 

<v i9 Vi> 

provided (v if v t > # 0. 

We shall see in the next theorem that the expression for e { in terms of 
y l9 ..., y s is a Fourier expansion. 

Theorem 5.5. The conjugacy classes y l9 ..., y s constitute an orthogonal 
basis for Z(G). We have (y t , y t > = /i f . For each ring R contained in k, the 
bilinear map of Proposition 5.3 has a unique extension to a R-bïlinear map 


Zr(G) x Z r (G) - R. 
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Proof ! We use the lemma. By linearity, the formula in the lemma remains 
valid when we replace R by /e, and when we replace e { by any element of Z k (G\ in 
particular when we replace e { by y { . But {y^,..., y s } is a basis of Z fc (G), over k. 
Hence we find that <y f , and <y,-, y\) = 0 if i # / as was to shown. 

Corollary 5.6. //G is commutative , í/ien 

1 " (0 if <7 is not equal to t 

) = |l if (7 is equal to T. 

Proof When G is commutative, each conjugacy ciass has exactly one ele- 
ment, and the number of simple characters is equal to the order of the group. 

We consider the case of characteristic 0 for our Z(G) just as we did for X(G). 
Let k have characteristic 0, and R be a subring of k containing the m-th roots of 
unity, and having a conjugation. Let a = £ a a o with a a e R. We define 

oeG 

oeG 


Theorem 5.7. Let k have characteristic 0, and let Rhe a subring of k , con- 
taining the m-th roots of unity , and having a conjugation. Then the pairing of 
Proposition 5.3 has a unique extension to a hermitian form 

Z r (G) x Z r (G) -> R 

given by the formulas 

<a, /?> = - n Zreg(a^) = ^ z XÁrìxÂP)- 

The conjugacy classes y u ..., y s form an orthogonal basis for Z R (G). If R 
contains 1 /n, then e \,..., e s lie in Zr(G ) and also form an orthogonal basis for 
Z r (G). We have < e i9 e f ) = df/n. 

Proof. The formula given in the statement of the theorem gives the same 
value as the symbol <a, /> of Proposition 5.3 when a, /? lie in Z(G). Thus the 
extension exists, and is obviously unique. Using the second formula in Propo- 
sition 5.3, defining the scalar product, and recalling that Xv( e d = 0 if v # i, we 
see that 


<<?,, e i) = - Xi( e i)Xi( e il 


whence our assertion follows. 
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We observe that the Fourier coefficients of e { relative to the basis y l9 ..., y s 
are the same with respect to the bilinear form of Theorem 5.5, or the hermitian 
form of Theorem 5.7. This comes from the fact that y l9 ..., y s lie in Z(G), and 
form a basis of Z(G) over the prime ring. 

We shall now reprove and generalize the orthogonality relations by another 
method. Let E be a finite dimensional (G, /c)-space, so we have a representation 

G - Aut*(£). 

After selecting a basis of £, we get a representation of G by d x d matrices. If 
{v í9 ..., v d } is the basis, then we have the dual basis {A l5 ..., X d } such that 
Xì(vj) = Sij. If an element o of G is represented by a matrix (p,/cr)), then each 
coefficient Ptfo) is a function of o 9 called the iy-coefficient function. We can also 
write 


= J-jiaVi). 

But instead of indexing elements of a basis or the dual basis, we may just as 
well work with any functional X on £, and any vector v. Then we get a function 

o\-+À(ov) = Px, v (<j), 

which will also be called acoefficient function. In fact, one can always complete 
v = v { to a basis such that X = is the first element in the dual basis, but using 
the notation p x v is in many respects more elegant. 

We shall constantly use: 

Schur’s Lemma. Let E , F be simple (G, k)-spaces , and let 

cp: E -> F 

be a homomorphism. Then either cp = 0 or cp is an isomorphism. 

Proof. Indeed, the kernel of (p and the image of cp are subspaces, so the 
assertion is obvious. 

We use the same formula as before to define a scalar product on the space of 
all fc-valued functions on G, namely 

<f,9> = ^ Z /( ff )0(^ _1 )- 

U <TE G 

We shall derive various orthogonality relations among coefficient functions. 

Theorem 5.8. Let E , F be simple (G, k)-spaces. Let Xbea k-linear functional 
on E 9 letxeE and yeF. IfE 9 F are not isomorphic , then 

£ X(ox)o~ l y = 0. 

a eG 
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Ifp is a functional on F then the coefficient functions p x x and p^ y are ortho - 
gonal, that is 

X A(<7 x)p((7~ l y) = 0. 

<xeG 

Proof The map x i—► £ A(<tx)<7 _ l y is a G-homomorphism of E into F, so 
Schur’s lemma concludes the proof of the first statement. The second comes by 
applying the functional p. 

As a corollary, we see that if ij/ are distinct irreducible characters of G 
over k, then 

(x< <A> = 0, 

that is the characters are orthogonal. Indeed, the character associated with a 
representation p is the sum of the diagonal coefficient functions, 

d 

x = Z Pu> 

i = 1 

where d is the dimension of the representation. Two distinct characters cor- 
respond to non-isomorphic representations, so we can apply Proposition 5.8. 

Lemma5.9. Let Ehe a simple (G, k)-space. Then any G-endomorphism of 
E is equal to a scalar multiple of the identity . 

Proof. The algebra End c k (E) is a division algebra by Schur’s lemma, 
and is finite dimensional over k. Since k is assumed algebraically closed, it must 
be equal to k because any element generates a commutative subfield over k. 
This proves the lemma. 

Lemma 5.10. Let E be a representation space for G of dimension d. Let X 
be afunctional on E, and let xeE. Let (p XjX e End k (E) be the endomorphism 
such that 


<Px, x (y) = My)x. 

Then tr(<p A x ) = A(x). 

Proof. If x = 0 the statement is obvious. Let x # 0. If A(x) # 0 we pick 
a basis of E consisting of x and a basis of the kernel of X. If A(x) = 0, we pick a 
basis of E consisting of a basis for the kernel of X, and one other element. In 
either case it is immediate from the corresponding matrix representing cp k x that 
the trace is given by the formula as stated in the lemma. 

Theorem 5.11. Let p:G-> Aut k (E) be a simple representation of G, of 
dimensiond. Thenthecharacteristicofkdoesnotdivided. Letx,yeE. Then 
for any functionals X, p on E, 

Z k{ox)n(o~ V) = \ l(y)Kx)- 

<reG U 
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Proof. It suffices to prove that 

Z A((7X)<7~ ‘y = ” À(y)x. 

aeG a 


For fixed y the map 


x i—► £ ^(crx)cr l y 

aeG 

is immediately verified to be a G-endomorphism of £, so is equal to cl for some 
c e k by Lemma 5.9. In fact, it is equal to 

'Zp(o~ 1 )° <Px,y° p(o)- 

a eG 

The trace of this expression is equal to n • tr (cp Xt y ) by Lemma 5.10, and also to dc. 
Taking A, y such that A(y) = 1 shows that the characteristic does not divide d , 
and then we can solve for c as stated in the theorem. 

Corollary 5.12. Let % be the character of the representation of G on the 
simple space E. Then 


<Z, Z> = 1 . 


Proof. This follows immediately from the theorem, and the expression of 
X as 


X = Pn + + p dd . 

We have now recovered the fact that the characters of simple representations 
are orthonormal. We may then recover the idempotents in the group ring, that 
is, if Xi 9 • • • , Xs are the simple characters, we may now define 

e. = - Z Xi(a)a~ l - 

^ a e G 

Then the orthonormality of the characters yields the formulas: 

s 

Corollary 5.13. xfcj) = ôijd, and x reg = I dj A . 

i = 1 

Proof. The first formula is a direct application of the orthonormality of the 
characters. The second formula concerning the regular character is obtained 
by writing 


Xreg = I m jXj 
j 
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with unknown coefficients. We know the values Xre g (0 = n an d Zr eg ( <7 ) = 0 if 
a # 1. Taking the scalar product of / reg with Xi for i = 1, ..., s immediately 
yields the desired values for the coefficients m^. 

Since a character is a class function, one sees directly that each e x is a linear 
combination of conjugacy classes, and so is in the center of the group ring k[G]. 

Now let Ei be a representation space of and let p, be the representation 
of G or k[G] on E^. For a g /c[G] we let p f (a): £, -► £, be the map such that 
p,(a)x = ax for all x g £, . 

Proposition 5.14. We have 

Páed = id and pleỳ = 0 ifi / j. 

Proof. The map x i-> e t x is a G-homomorphism of Et into itself since e t is in 
the center of /c[G]. Hence by Lemma 5.9 this homomorphism is a scalar 
multiple of the identity. Taking the trace and using the orthogonality relations 
between simple characters immediately gives the desired value of this scalar. 

We now find that 


E = 1 

i — 1 

because the group ring /c[G] is a direct sum of simple spaces, possibly with 
multiplicities, and operates faithfully on itself. 

The orthonormality relations also allow us to expand a function in a Fourier 
expression, relative to the characters if it is a class function, and relative to the 
coefficient functions in general. We state this in two theorems. 

Theorem 5.15. Let fbea class function on G. Then 

/= iaxi>L- 

i — 1 

Proof The number of conjugacy class is equal to the number of distinct 
characters, and these are linearly independent, so they form a basis for the class 
functions. The coefficients are given by the stated formula, as one sees by taking 
the scalar product of/ with any character Xj an d using the orthonormality. 

Theorem 5.16. Let p (,) be a matrix representation of G on E t relative to a 
choice ofbasis , and let be the coejfìcientfunctions of this matrix , i = 1,..., s 
and v, p = 1,..., di. Then the functions p { f >tl form an orthogonal basis for the 
space of all functions on G, and hence for any functionf on G we have 

f= i Z j </, 

Ì = 1 V, PL U'i 
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Proof. That the coefficient functions form an orthogonal basis follows from 
Theorems 5.8 and 5.11. The expression of/in terms of this basis is then merely 
the standard Fourier expansion relative to any scalar product. This concludes 
the proof. 

Suppose now for concreteness that k = C is the complex numbers. Recall 
that an effective character x ls an element of X(G), such that if 

5 

* = X m,Xì 

Ì= 1 

is a linear combination of the simple characters with integral coefficients, then 
we have m, ^ 0 for all i. ln light of the orthonormality of the simple characters, 
we get for all elements x e X(G) the relations 

\ìx\\ 2 = (x, x) = X m} and m, = <*, \i)■ 

Ì= 1 

Hence we get (a) of the next theorem. 

Theorem 5.17. (a) Let x be an effective character in X(G). Then x is simple 

over C if and only if ||^|| 2 = 1, or alternatively , 

2 Ix(<7)l 2 = #(G). 

creG 

(b) Let x . be effective characters in X(G ), and let E , F be their representation 

spaces over C. Then 

ÌX< <P)g = dim Hom G (£. F). 

Proof. The first part has been proved, and for (b), let tf/ = 2 q,X,■ Then by 
orthonormality, we get 

(x> 'P)g = 2 m,?,. 

But if Ei is the representation space of Xi over C, then by Schur’s lemma 
dim Hom G (£ z , £,) = 1 and dim Hom^/Zï/ Ej) = 0 for i =A j. 

Hence dim Hom^^E, F) = 2 m^, thus proving (b). 

Corollary 5.18 With the above notation and k = C for simplicity , we have: 

(a) The multiplicity of \ G in F v ® F is dim^ inv G (£ v ® F). 

(b) The (G, k)-space E is simple ifand only if Ì G has multiplicity 1 in E v ® E. 

Proof Immediate from Theorem 5.17 and formula (3) of §1. 

Remark. The criterion of Theorem 5.17(a) is useful in testing whether a 
representation is simple. In practice, representations are obtained by inducing 
from 1-dimensional characters, and such induced representations do have a ten- 
dency to be irreducible. We shall see a concrete case in §12. 
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§6. INDUCED CHARACTERS 

The notation is the same as in the preceding section. However, we don’t need 
all the results proved there; all we need is the bilinear pairing on X(G), and its 
extension to 


X r (G) x X r (G) - R. 

The symbol < , ) may be interpreted either as the bilinear extension, or the 
hermitian extension according to Theorem 5.2. 

Let S be a subgroup of G. We have an R-linear map called the restriction 

resf : X R (G) X R (S) 

which to each class function on G associates its restriction to S. It is a ring- 
homomorphism. We sometimes let f s denote the restriction of/ to S. 

We shall define a map in the opposite direction, 

indj : X x (S) -► X«(G), 

which we call the induction map. If g e X R (S), we extend g to g s on G by 
letting g s ((r) = 0 if cr£ S. Then we define the induced function 

g G (&) = ind^(g)(o-) = - ; 2 g^T(TT~ l ). 

• l > reC 

Then ind s (g) is a class function on G. It is clear that ind^ is /?-linear. 

Since we deal with two groups S and G , we shall denote the scalar product 
by < , > s and < , ) G when it is taken with these respective groups. The next 
theorem shows among other things that the restriction and transfer are adjoint 
to each other with respect to our form. 

Theorem 6.1. Let S be a subgroup of G. Then the follomng rules hold : 

(i) (Frobenius reciprocity) For /e X R (G) y and g e X R (S) we have 

(ind $(g),f) G = (g, Resf(/)>,. 

(ii) Ind£(g)/ = indÇigfs). 

(iii) IfTCSCG are subgroups ofG , then 

inds ° indf = ind^. 

(iv) // cr e G and g a is defined by g^r^) = g(r), where r 0 " = cr _1 rcr, then 

indjíg) = ind ^ig”). 

(v) If i/r is an effective character ofS then indý( f) is ejfective. 
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Proof. Let us first prove (ii). We must show that g G f — (gf s ) G . We have 

(g G f)(r) = 77^-77 E g^ara~ l )f(T) = 1 2 g s ((rra~ x )f(<rra~ x ). 

The last expression just obtained is equal to ( gf s ) G , thereby proving (ii). Let us 
sum over i in G. The only non-zero contributions in our double sum will come 
from those elements of S which can be expressed in the form cxo~ 1 with o 9 tsG. 
The number of pairs ( o , t) such that oxo~ 1 is equal to a fixed element of G is 
equal to n (because for every Ae G, (<rA, X~ l xÁ.) is another such pair, and the 
total number of pairs is n 2 ). Hence our expression is equal to 

(GiD^L; I3(A)/(A). 

• U ÂeS 

Our first rule then follows from the definitions of the scalar products in G and S 
respectively. 

Now let g = \\t be an effective character of 5, and let / = \ be a simple 
character of G'. From (i) we find that the Fourier coefficients of g G are integers 
^ 0 because res£(*) is an effective character of S. Therefore the scalar product 

res£(x)}s 

is = 0. Hence ỳ 0 is an effective character of G, thereby proving (v). 

In order to prove the transitivity property, it is convenient to use the fol- 
lowing notation. 

Let {c} denote the set of right cosets of S in G. For each right coset c, we 
select a fixed coset representative denoted by c. Thus if c u ..., c r are these 
representatives, then 


G = (Jc = USc=U Sc ( . 

c c i = 1 


Lemma 6.2. Let gbe a class function on S. Then 


'nd G ( S m = tgs(c^cr'). 

Ì= 1 

Proof. We can split the sum over all o e G in the definition of the induced 
function into a double sum 


I = I í 
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and observe thateach term g s (erc£c -1 o- -1 ) is equal to é 7 s(c£c - 1 ) if cte 5, because 
g is a class function. Hence the sum over cr e 5 is enough to cancel the factor 
1/(5 : 1) in front, to give the expression in the lemma. 

If T c 5 c G are subgroups of G, and if 

G = (J Scì and 5 = [j Tdj 

aredecompositions into right cosets, then {3,-cJ form a system of representatives 
for the right cosets of T in G. From this the transitivity property (iii) is obvious. 
We shall leave (iv) as an exercise (trivial, using the lemma). 


§7. INDUCED REPRESENTATIONS 


Let G be a group and 5 a subgroup of finite index. Let F be an 5-module. 
We consider the category C whose objects are 5-homomorphisms <p : F —> E of 
F into a G-module E. (We note that a G-module E can be regarded as an 5- 
moduleby restriction.) If <p': F—> E' is another object in C, we define a morphism 
<p' —> ç in (3 to be a G-homomorphism r\: E' —> E making the following diagram 
commutative: 

.E' 

F 



A universal object in (3 is determined up to a unique G-isomorphism. It will 
be denoted by 


indf : F —> ind£(F). 


We shall prove below that a universal object always exists. If cp : F -> E is a 
universal object, we call E an induced module It is uniquely determined, up to a 
unique G-isomorphism making a diagram commutative. For convenience, we 
shall select one induced module such that cp is an inclusion. We shall then call 
this particular module indf (F) the G-module induced by F. In particular, given 
an 5-homomorphism <p: F —> E into a G-module E , there is a unique G-homo- 
morphism <p%: ind^(F) —> E making the following diagram commutative: 



indf(F) 

V* = indJ((o) 

E 
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The association ç i-> indf (<p) then induces an isomorphism 


Hom G (ind 5 (F), E) ~ Hom 5 (F, res 5 (£)), 


for an 5-module F and a G-module E. We shall see in a moment that ind 5 is a 
functor from Mod(5) to Mod(G), and the above formula may be described as 
saying that induction is the adjoint functor of restriction. One also calls this 
relation Frobenius reciprocity for modules, because Theorem 6.1(i) is a 
corollary. 

Sometimes, if the reference to F as an S-module is clear, we shall omit the 
subscript S, and write simply 

ind G (F) 

for the induced module. 

Let/: F' -► F be an S-homomorphism. If 

<P§ ■ F' ind^(F') 


is a G-module induced by F\ then there exists a unique G-homomorphism 
indf(F') —> ind 5 (F) making the following diagram commutative: 


F',———► ind^(F') 


/ 

F 



<Ps 


* 


ind^(/) 


indj(F) 


It is simply the G-homomorphism corresponding to the universal property 
for the S-homomorphism q> s G © f represented by a dashed line in our diagram. 
Thus ind 5 is a functor, from the category of S-modules to the category of G- 
modules. 

From the universality and uniqueness of the induced module, we get some 
formal properties: 

ind 5 commutes with direct sums : // we have an S-direct sum F ® F', then 
ind 5 (F © F') » ind 5 (F) © indg(F'), 

the direct sum on the right being a G-direct sum. 

Iff g:F' -* F are S-homomorphisms , then 

indj(/ + g) = indf(/) + ind£(g). 

IfT cz S cz G are subgroups of G, and F is a T-module, then 

indf o indf-(F) == ind^(F). 
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In all three cases, the equality between the left member and the right member 
of our equations follows at once by using the uniqueness of the universal object. 
We shall leave the verifications to the reader. 

To prove the existence of the induced module, we let Mq(F) be the additive 
group of functions/: G -► F satisfying 

=fm 

for o e S and £, e G. We define an operation of G on Mq(F) by letting 

(o fw 

for (7, £ e G. It is then clear that Mq(F) is a G-module. 


Proposition 7.1. Let q>: F -► Mq(F) be such that cp(x) = (p x is the map 


(p x (ï) = 



if T 4 S 
if t eS. 


Then cp is an S-homomorphism , q>: F -► Mq(F) is universal , and q> is injective. 
The image of q> consists of those elements feM s G (F) such that f(x) = 0 if 
T <£S. 

Proof. Let o e S and xeF. Let zeG. Then 


(<7<P x)(*) = <Px(™)- 


If t eS, then this last expression is equal to q> ax ( t). If t ^ 5, then xo $ 5, and 
hence both q> ax (x) and <p x (xo) are equal to 0. Thus q> is an 5-homomorphism, 
and it is immediately clear that q> is injective. Furthermore, if/e Mq(F) is such 
that f(x) = 0 if t ^ 5, then from the definitions, we conclude that/ = cp x where 

*=f( 1). 

There remains to prove that q> is universal. To do this, we shall analyze more 
closely the structure of M S G (F). 

r 

Proposition 7.2. Let G = (J 5c f be a decomposition of G into right cosets. 

i — 1 

Let Fj be the additive group offunctions in having value 0 at elements 

ÇeG, l; 4 S. Then 

M S C (F)= ©cr 1 ^, 

i = 1 

the direct sum being taken as an abelian group. 

Proof. For each/e Mq(F), let/ be the function such that 


m = 



if £ $ Sci 
if ÇeSâi. 
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For all o e S we have/(<Tc,) = (c i / i X<r)- It is immediately clear that c ; /■ lies in 
Fj, and 


/= X / r 1 (/•/)• 

i= 1 

Thus Mg(F) is the sum of the subgroups c^F^. It is clear that this sum is 
direct, as desired. 

We note that {c f \..., c^ 1 } form a system of representatives for the 
cosets of S in G. The operation of G on M S G (F) is defined by the presceding direct 
sum decomposition. We see that G permutes the factors transitively. The factor 
F x is S-isomorphic to the original module F, as stated in Proposition 7.1. 

Suppose that instead of considering arbitrary modules, we start with a com- 
mutative ring R and consider only R- modules E on which we have a representation 
of G, i.e. a homomorphism G —> Aut r(E), thus giving rise to what we call a 
(G, /?)-module. Then it is clear that all our constructions and definitions can be 
applied in this context. Therefore if we have a representation of S on an R- module 
F, then we obtain an induced representation of G on ind^(F). Then we deal with 
the category C of S-homomorphisms of an (S, F)-module into a (G, F)-module. 
To simplify the notation, we may write “G-module” to mean “(G, F)-module” 
when such a ring R enters as a ring of coefficients. 

Theorem 7.3. Let {X u ..., X r ) be a system of left coset representatives of S in 

G. There exists a G-module E containing F as an S-submodule , such that 

£ = 0 A ( F 

í = 1 

is a direct sum (as R-modules). Let ç : F —> E be the inclusion mapping. Then 

ç is universal in our category C, i.e. E is an induced module. 

Proof. By the usual set-theoretic procedure of replacing Fj by F in M G (F\ 
obtain a G-module E containing F as a S-submodule, and having the desired 
direct sum decomposition. Let cp': F -> E' be an S-homomorphism into a 
G-module E'. We define 

h.E^E' 


by the rule 


h(X^x ! + • • • + À r X r ) = X^Cpfx j) + • ■ • + A r <p'(*r) 

for Xi e F. This is well defined since our sum for E is direct. We must show that 
h is a G-homomorphism. Let ogG. Then 

— ^<t(Î) T <7, Ì 

where <r(/) is some index depending on o and i, and x ati is an element of S, also 
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depending on o 9 i. Then 

hiaXiXi) = hiX^T.'iXi) = ^(p'ÌT^iXi). 

Since cp' is an S-homomorphism, we see that this expression is equal to 

1*0)* *.i<P'(xd = ch(XiXi). 

By linearity, we conclude that h is a G-homomorphism, as desired. 

In the next proposition we return to the case when R is our field k. 

Proposition 7.4. Let xjj be the character of the representation of S on the 
k-space F. Let E be the space of an induced representation. Then the character 
X of E is equal to the induced character \f G y i.e. is given by the formula 

l(0 = Z 'l'oicÇc- 1 ), 

c 

xvhere the sum is tahen over the right cosets cofS in G, c is a fixed coset repre - 
sentative for c, and \jj 0 is the extension ofxjjtoG obtained by setting ý 0 (o) = 0 
ifo^S. 

Proof. Let {w l9 ..., xv m } be a basis for F over k. We know that 

E = @c~ l F. 

Let o be an element of G. The elements {cô~ l Wj} c j form a basis for E over k. 
We observe that cocô~ 1 is an element of S because 

Sco — Sco = Scô. 


We have 


Let 


o(cO l XVj) = C l (cOCO 1 )\Vj. 


(coco l )j 

be the components of the matrix representing the effect of cocô ~ 1 on F with 
respect to the basis {vv^,..., xv m }. Then the action of o on E is given by 

o(cd~ 'iVy) = c" 1 Z (coccf- 

= Z (cocô~ ') w (c _ 'vv^). 


x(o) = I I (coco %. 


By definition, 
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But ca = c if and only if cac 1 € 5. Furthermore, 

l/'(C(7C“ 1 ) = £(C<7C"%. 
j 

Hence 

z(ff) = I 

C 

as was to be shown. 

Remark. Having given an explicit description of the representation space 
for an induced character, we have in some sense completed the more elementary 
part of the theory of induced characters. Readers interested in seeing an application 
can immediately read §12. 

Double cosets 

Let G be a group and let S be a subgroup. To avoid superscripts we use the 
following notation. Let yeG. We write 

[y]S = y.S'y" 1 and 5[y] = y ~ l Sy. 

We shall suppose îhat S has finite index. We let H be a subgroup. A subset of G 
of the form HyS is called a double coset. As with cosets, it is immediately 
verified that G is a disjoint union of double cosets. We let {y} be a family of 
double coset representatives, so we have the disjoint union 

G = U HyS. 
y 

For each y we have a decomposition into ordinary cosets 

H = U tJH n[y]5), 

Ty 

where {r 7 } is a finite family of elements of H, depending on y. 

Lemma 7.5. The elements {r y y} form a family of left coset representatives 
for S in G; that is, we have a disjoint union 

G = U r y yS. 

y,Ty 

Proof. First we have by hypothesis 

G = U U tJH n [y]5)y5, 

y T y 

and so every element of G can be written in the form 

Tyys x y~ x ys 2 = T y ys with s Ì9 s 2 , s e S. 

On the other hand, the elements T y y represent distinct cosets of 5, because if 
T y yS = T y >y'S 9 then y = y\ since the elements y represent distinct double cosets, 
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whence r y and iy represent the same coset of ySy l , and therefore are equal. 
This proves the lemma. 

Let F be an S-module. Given y e G, we denote by [y]F the [y]S-module 
such that for ysy~ l e [y]S, the operation is given by 

ysy~ l • [y]x = [y]s;ï. 

This notation is compatible with the notation that if F is a submodule of a G- 
module E , then we may form yF either according to the formal definition above, 
or according to the operation of G. The two are naturally isomorphic (essentially 
equal). We shall write 

[y] : F —> yF or [y]F 

for the above isomorphism from the 5-module F to the [y]S-module yF. If Sj 
is a subgroup of S, then by restriction F is also an S r module, and we use [y] 
also in this context, especially for the subgroup H fì [y]S which is contained in 

[y]S. 

Theorem 7.6. Applied to the S-module F, we have an isomorphism of H- 
modules 

resg * indíf - 0 indg n[y ]s 0 ™ l $ [y]S ° [y] 
y 

where the direct sum is taken over douhle coset representatives y. 

Proof. The induced module ind^(F) is simply the direct sum 

ind §(F) = 0 r y yF 

y,Ty 

by Lemma 7.5, which gives us coset representatives of S in G, and Theorem 
7.3. On the other hand, for each y, the module 

0 T y y f 

T y 

is a representation module for the induced representation from /ffl[y]S on y F 
to H. Taking the direct sum over y, we get the right-hand side of the expression 
in the theorem, and thus prove the theorem. 

Remark. The formal relation of Theorem 7.6 is one which occurred in 
Artin’s formalism of induced characters and L-functions; cf. the exercises and 
[La 70], Chapter XII, §3. For applications to the cohomology of groups, see 
[La 96]. The formalism also emerged in Mackey’s work [Ma 51], [Ma 53], which 
we shall now consider more systematically. The rest of this section is due 
to Mackey. For more extensive results and applications, see Curtis-Reiner 
[CuR 81], especially Chapter 1. See also Exercises 15, 16, and 17. 

To deal more systematically with conjugations, we make some general func- 
torial remarks. Let E be a G-module. Possibly one may have a commutative ring 
R such that E is a (G, R)-module. We shall deal systematically with the functors 
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Hom G , £ v , and the tensor product. Let 

A : E \E 

by a /?-isomorphism. Then interpreting elements of G as endomorphisms of E 
we obtain a group ÀGÀ -1 operating on À£. We shall also write [À]G instead of 
ÀGÀ" 1 . Let E i, E 2 be (G, /?)-modules. Let À^ : E t —> À z £ z be /?-isomorphisms. 
Then we have a natural /?-isomorphism 

(1) À 2 Hom G (£!, £ 2 )À^ 1 = Hom A2GA -i(Ài£i, À 2 £ 2 ), 
and especially 

[À]Hom G (£, E) = Hom [A]G (À£, À£). 

As a special case of the general situation, let //, S be subgroups of G, and let 
£j, F 2 be (//, /?)- and (5, £)-modules respectively, and let cr, r e G. Suppose 
that (t~ 1 t lies in the double coset D = //y5. Then we have an /?-isomorphism 

(2) Hom [CT j //n [r]s([ ír ]^i > [t]£ 2 ) ~ Hom//n[*y]s(^i, \yWi)' 

This is immediateby conjugation, writing r = ahys with heH,seS, conjugating 
first with [oh}~\ and then observing that for seS, and an S-module £, we 
have [s]S = 5, and [s -1 ]£ is isomorphic to £. In light of (2), we see that the 
£-module on the left-hand side depends only on the double coset. Let D be a 
double coset. We shall use the notation 

M d (£i, £ 2 ) = Hom/zn^s (^i> [yWi) 
where y represents the double coset D. With this notation we have: 

Theorem 7.7. Let H , S be subgroups of finite index in G. Let F x , F 2 be 
( H, R) and (S, R)-modules respectively. Then xve have an isomorphism of R- 
modules 

HomcOndgCF,), ind£(F 2 )) - 0 M D (F U F 2 ), 

D 

where the direct sum is tahen over all double cosets HyS = D. 

Proof. We have the isomorphisms: 

Hom c (indg(F,), indg(F 2 )) » Hom w (F,, resg « indf(F 2 )) 

* 0 Hom w (F], ind^fn[yjs ° res^f [y]S ° [y]F 2 ) 

y 

~ @ Hom // n[ T ]5(^i > [yWi) 
y 

by applying the definition of the induced module in the first and third step, and 
applying Theorem 7.6 in the second step. Each term in the last expression is 
what we denoted by M D (F U F 2 ) if y is a representative for the double coset D. 
This proves the theorem. 
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Corollary 7.8. Let R = k = C. Let S, H be subgroups of the finite group 
G. Let D = HyS range over the double cosets, with representatives y. Let \ 
be an ejfective character of H and i/j an effective character of S. Then 

<indg((*), ind<?(i//)) G = 'Z(x, [y]<//>//n^js- 

Proof. Immediate from Theorem 5.17(b) and Theorem 7.7, taking dimen- 
sions on the left-hand side and on the right-hand side. 


CoroIIary 7.9. (Irreducibility of the induced character). Let S be a 

subgroup of thefinite group G. Let R = k = C. Let ỳbe an effective character 

of S. Then ind^i/O is irreducible ifand only if i/j is irreducible and 

<</', [y]</ , )sn [y ]s = 0 

for all y e G, y £ S. 

Proof. Immediate from Corollary 7.8 and Theorem 5.17(a). It is of course 
trivial that if i/j is reducible, then so is the induced character. 

Another way to phrase Corollary 7.9 is as follows. LetF, F’ be representation 
spaces for S (over C). We call F, F’ disjoint if no simple S-space occurs both 
in F and F’. Then Corollary 7.9 can be reformulated: 


Corollary 7.9'. Let S be a subgroup of the finite group G. Let F be an 
(S, k)-space (xvith k = C). Then indf(F) is simple if and only if F is simple 
and for all y e G and y £ S, the S fì [y]S-modules F and [y]F are disjoint. 

Next we have the commutation of the dual and induced representations. 


Theorem 7.10. Let S be a subgroup ofG and let F be afinitefree R-module. 
Then there is a G-isomorphism 

indf(F v ) = (indf(F)) v . 

Proof. Let G = LJ À,S be a left coset decomposition. Then, as in Theorem 
7.3, we can express the representation space for ind^ (F) as 

indf(F) = 0À,F. 

We may select Àj = 1 (unit element of G). There is a unique /?-homomorphism 

f ■ F v (ind£(F)) v 

such that for <p e F v and x e F we have 


/(<p)(À,x) = 


0 if i* 1 
<p(x) if i = 1, 


which is in fact an F-isomorphism of F v on (À]F) V . We claim that it is an S- 
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homomorphism. This is a routine verification, which we write down. We have 

fO if i * 1 

/([<r]*)(M , 

Lcr(<p(cr 'x)) ìf i = 1. 

On the other hand, note that if cr e 5 then cr~ 'à, e S so cr" 'à^.ï e À]F for 
jef; but if cr ^ 5, then cr _1 À, £ 5 for i ¥= 1 so ct _ 1 à,a: £ À,F. Hence 

_, í 0 if í + 1 

[«■](/( <P))(A]T) = cr/(<p)(cr À, jc) = | , 

lcr(<p(er ‘x)) ìf i = 1. 

This proves that / commutes with the action of S. 

By the universal property of the induced module, it follows that there is a 
unique (G, /?)-homomorphism 

indŷ(/):indF(F v )^(ind^(F)) v , 

which must be an isomorphism because /was an isomorphism on its image, the 
À r component of the induced module. This concludes the proof of the theorem. 

Theorems and definitions with Hom have analogues with the tensor product. 
We start with the analogue of the definition. 

Theorem 7.11. Let S be a subgroup of finite index in G. Let F be an S- 
module, and E a G-module (over the commutative ring R). Then there is an 
isomorphism 

ind£(res 5 (E) ® F) « E ® ind£(F). 

Proof. The G-module ind 5 (F) contains Fas a summand, because it is the 
direct sum (j^À z F with left coset representatives À z as in Theorem 7.3. Hence 
we have a natural 5-isomorphism 

/: res 5 (E) ® F E ®Àj F C E® ind£(F). 

taking the representative À^ to be 1 (the unit element of G). By the universal 
property of induction, there is a G-homomorphism 

ind 5 (/) : ind 5 (res s (F) ® F) -► E ® ind^(F), 

which is immediately verified to be an isomorphism, as desired. (Note that here 
it only needed to verify the bijectivity in this last step, which comes from the 
structure of direct sum as /?-modules.) 

Before going further, we make some remarks on functorialities. Suppose we 
have an isomorphism G G', a subgroup H of G corresponding to a subgroup 
H' of G’ under the isomorphism, and an isomorphism F ~ F' from an H -module 
F to an H ’-module F' commuting with the actions of H , H'. Then we get an 
isomorphism 


indg(F) = indg/F'). 
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In particular, we could take cr e G, let G' = [cr]G = G, H' = [cr]H and 
F' = [cr]F. 

Next we deal with the analogue of Theorem 7.7. We keep the same notation 
as in that theorem and the discussion preceding it. With the two subgroups H 
and 5, we may then form the tensor product 

l<r]F { ® [t]F 2 

with o-, r e G. Suppose ! r e D for some double coset D = HyS. Note that 
[o-]F x ® [t]F 2 is a [cr]// fl [r]5-module. By conjugation we have an isomorphism 

(3) ind^ ]Hn[ T] 5 ([o’]^i ® [t]F 2 ) = ind# n[y]S (F, ® [y]F 2 ). 

Theorem 7.12. There is a G-isomorphism 

indg(F,) ® ind s (F 2 ) == 0 indg n [ y] s(F\ ® [y]F 2 ), 
y 

where the sum is talcen over double coset representatives y. 

Proof. We have: 

indg(F!) ® inds(F 2 ) ~ indg(F! ® res^ indf(F 2 )) by Theorem 7.11 

“ 0 indg(F, ® indg nMS res wn [^([y]F 2 ) by Theorem 7.6 

y 

« 0 indg^indg n [y]s (res# n[y]s (F,) ® res|^ [7]s ([y]F 2 )^ by Theorem 7.7 

« (J) indg n [<y]s(^j ® [y]F 2 ) by transitivity of induction 

where we view Fj fì [y]F 2 as an H D [y]5-module in this last line. This proves 
the theorem. 

General comment. This section has given a lot of relations for the induced 
representations. In light of the cohomology of groups, each formula may be 
viewed as giving an isomorphism of functors in dimension 0, and therefore gives 
rise to corresponding isomorphisms for the higher cohomology groups H q . The 
reader may see this developed further than the exercises in [La 96]. 
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The next three sections , xvhich are essentially independent of each other , give 
examples of induced representations. In each case , we show that certain 
representations are either induced from certain well-known types , or are linear 
combinations with integral coeffcients of certain well-known types. The most 
striking feature is that we obtain all characters as linear combinations of in- 
duced characters arising from ì-dimensional characters. Thus the theory of 
characters is to a large extent reduced to the study of 1 - dimensional , or abelian 
characters. 


§8. POSITIVE DECOMPOSITION OF THE 
REGULAR CHARACTER 

Let G be a finite group and let k be the complex numbers. We let 1 G be the 
trivial character, and r G denote the regular character. 

Proposition 8.1. Let H be a subgroup of G, and let i/j be a character of H. 
Let il/ G be the induced character. Then the multiplicity of 1 H in i/j is the same 
as the multiplicity of \ G in i \j g . 

Proof. By Theorem 6.1 (i), we have 


1 h)h ~ (*A G > 1 g)g • 

These scalar products are precisely the multiplicities in question. 

Proposition 8.2. The regular representation is the representation induced 
by the trivial character on the trivial subgroup of G. 

Proof This follows at once from the defìnition of the induced character 
<A G (t) = 2 «/'//(crro-' 1 ), 

creC 

taking \/j = 1 on the trivial subgroup. 

Corollary 8.3. The multiplicity ofì G in the regular character r G is equal to 1. 
We shall now investigate the character 

U G ~ r G ~ 1 G ’ 

Theorem 8.4. (Aramata). The character nu G is a linear combination with 
positive integer coefficients of characters induced by 1 -dimensional characters 
of cyclic subgroups of G. 

The proof consists of two propositions, which give an explicit description of 
the induced characters. I am indebted to Serre for the exposition, derived from 
Brauer’s. 
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If A is a cyclic group of order a, we define the function d A on A by the condi- 
tions: 


8a(<?) = 


if o is a generator of A 
otherwise. 


We let Â A = cp(a)r A — d A (where q> is the Euler function), and X A = 0 if a = 1. 
The desired result is contained in the following two propositions. 


Proposition 8.5. Let Gbe a finite group of order n. Then 

nu G = Yj A£, 

the sum being taken over all cyclic subgroups of G. 

Proof Given two class functions x, °n G, we have the usual scalar 
product: 

<'/'» z>g = - £ ' l '( a M °)- 
n ae c 

Let ip be any class function on G. Then: 

<i/í, nu G > = <i/í, nr G ) - (\j/, nl G > 

= n<A(l) - £ 

(T 6 G 

On the other hand, using the fact that the induced character is the transpose of 
the restriction, we obtain 

£<i/f, Aj) = £<|/<|A, a m ) 

= £<'/'U, <P(a)^ ~ 0<> 

= £ <?(<#( 1) - £ - £ ai//((T) 

A A n g en A 

= ml/(l) - £ i/<(<7). 

<T E G 


Since the functions on the right and left of the equality sign in the statement of our 
proposition have the same scalar product with an arbitrary function, they are 
equal. This proves our proposition. 

Proposition 8.6. // A # {1}, the function À A is a linear combination of ir- 

reducible nontrivial characters of A with positive integral coefficients. 
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Proof. If A is cyclic of prime order, then by Proposition 8.5, we know that 
X A = nu A , and our assertion follows from the standard structure of the regular 
representation. 

In order to prove the assertion in general, it suffices to prove that the Fourier 
coefficients of X A with respect to a character of degree 1 are integers ^ 0. Let 
\j/ be a character of degree 1. We take the scalar product with respect to A, and 
obtain: 

<<P, )-a> = <p(a)<p(l) - X l /'( <7 ) 

o gen 

= <p(a) - X <P(<t) 

o gen 

= £ (1 - H<t)). 

o gen 

The sum £ ^(<r) taken over generators of A is an algebraic integer, and is in fact 
a rational number (for any number of elementary reasons), hence a rational 
integer. Furthermore, if i// is non-trivial, all real parts of 

1 - ìp((j) 

are > 0 if o ^ id and are 0 if o = id. From the last two inequalities, we conclude 
that the sums must be equal to a positive integer. If ip is the trivial character, 
then the sum is clearly 0. Our proposition is proved. 

Remark. Theorem 8.4 and Proposition 8.6 arose in the context of zeta 
functions andL-functions, in Aramata’s proof that the zeta function of a number 
field divides the zeta function of a finite extension [Ar 31], [Ar 33]. See also 
Brauer [Br 47a], [Br 47b]. These results were also used by Brauer in showing 
an asymptotic behavior in algebraic number theory, namely 

log (hR) ~ log D 1/2 for [k : Q]/log D —» 0, 

where h is the number of ideal classes in a number field k , R is the regulator, 
and D is the absolute value of the discriminant. For an exposition of this appli- 
cation, see [La 70], Chapter XVI. 
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§9. SUPERSOLVABLE GROUPS 

Let G be a finite group. We shall say that G is supersolvable if there exists a 
sequence of subgroups 


{1}cG,cGjC...cG, = G 

such that each G f is normal in G, and G i+ JGi is cyclic of prime order. 

From the theory of p-groups, we know that every p-group is super-solvable, 
and so is the direct product of a p-group with an abelian group. 

Proposition 9.1. Every subgroup and every factor group of a super-solvable 
group is supersolvable. 

Proof Obvious, using the standard homomorphism theorems. 

Proposition 9.2. Let G be a non-abelian supersolvable group. Then there 
exists a normal abelian subgroup which contains the center properly. 

Proof Let C be the center of G, and let G = G/C. Let H be a normal 
subgroup of prime order in G and let H be its inverse image in G under the 
canonical map G -► G/C. If o is a generator of //, then an inverse image o of ô, 
together with C, generate H. Hence H is abelian, normal, and contains the 
center properly. 

Theorem 9.3. (Blichfeldt). Let G be a supersolvable group, let k be alge- 
braically closed. Let E be a simple (G, k)-space. If dim* E > 1, then there 
exists a proper subgroup H of G and a simple H-space F such that E is induced 
by F. 

Proof. Since a simple representation of an abelian group is 1-dimensional, 
our hypothesis implies that G is not abelian. 

We shall first give the proof of our theorem under the additional hypothesis 
that E is faithful. (This means that ox = x for all xeE implies o = 1.) It will 
be easy to remove this restriction at the end. 

Lemma 9.4. Let G be a finite group, and assume k algebraically closed. Let 
Ebea simple, faithful G-space over k. Assume that there exists a normal abelian 
subgroup H of G containing the center of G properly. Then there exists a 
proper subgroup H\ of G containing H , and a simple H\-space Fsuch that E 
is the induced module of F from H\ to G. 

Proof. We view E as an H- space. It is a direct sum of simple H- spaces, and 
since H is abelian, such simple H- space is 1-dimensional. 

Let veE generate a 1-dimensional H- space. Let \j/ be its character. If 
we E also generates a 1-dimensional H- space, with the same character then 
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for all a 9 bek and x e H we have 

t (av + bw ) = \j/(x)(av + bw). 

If we denote by the subspace of E generated by all 1-dimensional H- sub- 
spaces having the character \j/ 9 then we have an H -direct sum decomposition 

E = ®F,. 

* 

We contend that E ^ F^. Otherwise, let v e E, v # 0, and oeG. Then o~ l v 
is a 1-dimensional JT-space by assumption, and has character \j/. Hence for 
t g H, 

x(o~ l v) = \j/(x)o~ l v 


(oxo Jy = o\j/(x)o l v = \//(x)v. 

This shows that oxo~ 1 and x have the same effect on the element v of E. Since 
H is not contained in the center of G, there exist xeH and oeG such that 
oxo~ 1 / t, and we have contradicted the assumption that E is faithful. 

We shall prove that G permutes the spaces F^ transitively. 

Let v e Fỳ. For any xeH and o e G, we have 

x(ov) = o(o~ l xo)v = <7i/<(<7 -1 T<7)z; = \l/ a (x)ov 9 

where \l/ a is the function on H given by i/^( t) = \j/(o~ l xo). This shows that o 
maps Fý into F^. However, by symmetry, we see that <r“ 1 maps F^ a into F^, 
and the two maps <r, <x“ 1 give inverse mappings between F^ and F^. Thus G 
permutes the spaces {F 

Let E' = GF^ = 2 vF^ for some fixed \f/ 0 . Then E' is a G-subspace of E , 
and since E was assumed to be simple, it follows that E' = E. This proves that 
the spaces {F^} are permuted transitively. 

Let F = F^ x for some fixed \f/ v Then F is an H-subspace of E. Let H x be 
the subgroup of all elements xeG such that xF = F. Then H^ # G since 
E # Fỳ. We contend that F is a simple H ^-subspace, and that E is the induced 
space of F from H x to G. 

To see this, let G = (J H x c be a decomposition of G in terms of right cosets 
of H v Then the elements {c -1 } form a system of left coset representatives of 
H x . Since 

E= Y, aF 

<reG 

it follows that 


E = Y ê~ lF - 

C 


We contend that this last sum is direct, and that F is a simple H x - space. 
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Since G permutes the spaces {F^}, we see by definition that H x is the isotropy 
group of F for the operation of G on this set of spaces, and hence that the elements 
of the orbit are precisely {c~ l F} 9 as c ranges over all the cosets. Thus the spaces 
{c~ 'F} are distinct, and we have a direct sum decomposition 

£ = ©c-'F. 

C 

If W is a proper JT^-subspace of F, then © c~ 1 W is a proper G-subspace of F, 
contradicting the hypothesis that E is simple. This proves our assertions. 

We can now apply Theorem 7.3 to conclude that E is the induced module 
from F, thereby proving Theorem 9.3, in case E is assumed to be faithful. 

Suppose now that E is not faithful. Let G 0 be the normal subgroup of G 
which is the kernel of the representation G -> Aut k (E). Let G = G/G 0 . Then 
E gives a faithful representation of G. As E is not 1-dimensional, then G is not 
abelian and there exists a proper normal subgroup H of G and a simple H- space 
F such that 

E = ind|(F). 

Let H be the inverse image of H in the natural map G -► G. Then H z> G 0 , 
and F is a simple H- space. In the operation of G as a permutation group of the 
fc-subspaces {oF} aeG9 we know that H is the isotropy group of one component. 
Hence H is the isotropy group in G of this same operation, and hence applying 
Theorem 7.3 again, we conclude that E is induced by F in G, i.e. 

E = indg(F), 


thereby proving Theorem 9.3. 

Corollary 9.5. Let Gbe a product of a p-group and a cyclic group , and let k 
be algebraically closed. IfE is a simple (G, k)-space and is not \-dimensional y 
then E is induced by a \-dimensional representation of some subgroup. 

Proof. We apply the theorem step by step using the transitivity of induced 
representations until we get a 1-dimensional representation of a subgroup. 


§10. BRAUER'S THEOREM 

We let k = C be the field of complex numbers. We let R be a subring of k. 
We shall deal withX^(G), i.e. the ring consisting of all linear combinations with 
coefficients in R of the simple characters of G over k. (It is a ring by Proposition 
2 . 1 .) 
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Let H = {H a } be a fixed family of subgroups of G, indexed by indices {a}. 
We lçt V r (G) be the additive subgroup of X R (G) generated by all the functions 
which are induced by functions in X R (H a ) for some H a in our family. In other 
words, 

V r (G) = s ind § a (X R (H a )). 

a 

We could also say that V R (G) is the subgroup generated over R by all the char- 
acters induced from all the H a . 

Lemma 10.1. V R (G) is an ideal in X R (G). 

Proof. This is immediate from Theorem 6.1. 

For many applications, the family of subgroups will consist of “elementary” 
subgroups: Let p be a prime number. By a />-elementary group we shall mean 
the product of a p-group and a cyclic group (whose order may be assumed prime 
to p, since we can absorb the p-part of a cyclic factor into the p-group). An 
element o e G is said to be /?-regular if its period is prime to p, and p-singular 
if its period is a power of p. Given xeG, we can write in a unique way 

X = CJT 

where o is p-singular, t is p-regular, and o , t commute. Indeed, if p r m is the period 
of x, with m prime to p, then 1 = vp r -f pm whence x = (x m Y(x pr y an d we g et our 
factorization. It is clearly unique, since the factors have to lie in the cyclic 
subgroup generated by x. We call the two factors the p-singular and /^-regular 
factors of x respectively. 

The above decomposition also shows: 

Proposition 10.2. Every subgroup and every factor group of a p-elementary 

group is p-elementary. If S is a subgroup of the p-elementary group P x C, 

xvhere P is a p-group , and C is cyclic , of order prime to p, then 

S = (SnP) x (Sn C). 

Proof. Clear. 

Our purpose is to show , among other things , that if ourfamily {H a } is such that 
every p-elementary subgroup of G is contained in some /í a , then V R (G) = X R (G) 
for every ring R . It would of course suffice to do it for R = Z, but for our pur- 
poses, it is necessary to prove the result first using a bigger ring. The main result 
is contained in Theorems 10.11 and 10.13, due to Brauer. We shall give an 
exposition of Brauer-Tate (Annals of Math ., July 1955). 

We let R be the ring Z[(] where C is a primitive n-th root of unity. There 
exists a basis of R as a Z-module, namely 1, C, ..., 1 f° r some integer N. 

This is a trivial fact, and we can take N to be the degree of the irreducible poly- 
nomial of C over Q. This irreducible polynomial has leading coefficient 1, and 
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has integer coefficients, so the fact that 

form a basis of Z[C] follovvs from the Euclidean algorithm. We don’t need to 
know anything more about this degree N. 

We shall prove our assertion first for the above ring R. The rest then follows 
by using the following lemma. 

Lemma 10.3. If deZ and the constant function d. 1 G belongs to V R then 
d.ì G belongs to V z . 

Proof. We contend that 1, C, •.., Ç N ~ 1 are linearly independent over X Z (G). 
Indeed, a relation of linear dependence would yield 

s N- 1 

É Z c *jXví j = o 

v= 1 j =0 

with integers c vj not all 0. But the simple characters are linearly independent 
over k. The above relation is a relation between these simple characters with 
coefficients in R , and we get a contradiction. We conclude therefore that 

Vr = Vz® VzC®---® Vzí N ~ l 

is a direct sum (of abelian groups), and our lemma follows. 

If we can succeed in proving that the constant function 1 G lies in V R (G\ 
then by the lemma, we conclude that it lies in V Z (G\ and since V Z (G) is an ideal, 
that X Z (G) = V Z (G). 

To prove our theorem, we need a sequence of lemmas. 

Two elements x, x' of G are said to be .p-conjugate if their p-regular factors 
are conjugate in the ordinary sense. It is clear that p-conjugacy is an equivalence 
relation, and an equivalence class will be called a p-conjugacy class, or simply a 
p-class. 

Lemma 10.4. Let f e X R (G), and assume that f(c) e Z for all a e G. Then 
f is constant mod p on every p-class. 

Proof. Let x = crt, where o is p-singular, and t is p-regular, and cr, t com- 
mute. It will suffice to prove that 

/(x) =/(t) (mod p). 

Let H be the cyclic subgroup generated by x. Then the restriction of / to H 
can be written 


Íh — Z 
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with dj £ R , and \j/j being the simple characters of H , hence homomorphisms of 
H into /c*. For some power p r we have x pr = t pr , whence \I/ J (x) pr = ^/t) pr , and 
hence 

f{x) pr =f{T) pr (mod pR). 

We now use the following lemma. 

Lemma 10.5. Let R = Z[£] be as before. Ifa e Z and aepR then a e pZ. 

Proof. This is immediate from the fact that R has a basis over Z such that 
1 is a basis element. 

Applying Lemma 10.5, we conclude that /(x)=/(t) (mod p), because 
b pr = b (mod p) for every integer b. 

Lemma 10.6. Let t be p-regular in G, and let T be the cyclic subgroup 
generated by t. Let C be the subgroup of G consisting of all elements com- 
muting mth t. Let Pbea p-Sylow subgroup of C. Then there exists an element 
\j/ E X r (T X P) such that the inducedfunctionf= \\P has thefollowing properties : 

(i) f(o)e Z for all oeG. 

(ii) f(o) = 0 if o does not belong to the p-class of t. 

(iii) /(t) = (C:P)# 0(mod p). 

Proof. We note that the subgroup of G generated by T and P is a direct pro- 
duct T x P. Let \f/ u ..., \// r be the simple characters of the cyclic group T, and 
assume that these are extended to T x P by composition with the projection: 

T x P- T-/c*. 

We denote the extensions again by \j/ ... ,\j/ r . Then we let 

= Z 


The orthogonality relations for the simple characters of T show that 

^(ty) = lA(t) = (T: 1) for yeP 
^(cr) = 0 if o £ TP, and <j ^ tP. 


We contend that ij/ 0 satisfies our requirements. 
First, it is clear that \j/ lies in X R (TP). 
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We have for cr e G : 

^ a) = WTÏ) L +”* xax ~ l) = JpTÎ)^ a) 

where /ll(ct) is the number of elements x e G such that xox~ l lies in rP. The 
number n(o) is divisible by (F: 1) because if an element x of G moves o into xP 
by conjugation, so does every element of Px. Hence the values of lie in Z. 

Furthermore, n(o) # 0 only if o is p-conjugate to t, whence our condition 
(ii) follows. 

Finally, we can have xxx ~ 1 = xy with yeP only if y = 1 (because the period 
of x is prime to p). Hence p(x) = (C: 1), and our condition (iii) follows. 

Lemma 10.7. Assume that the family of subgroups {H a } covers G (i.e. every 
element of G lies in some H a ). Iff is a class function on G taking its values in 
Z, and such that all the values are divisible by n = (G: 1), then f belongs to 
Vr(G). 

Proof. Let y be a conjugacy class, and let p be prime to n. Every element 
of G is p-regular, and all p- subgroups of G are trivial. Furthermore, p-conjugacy 
is the same as conjugacy. Applying Lemma 10.6, we find that there exists in 
V r (G) a function taking the value 0 on elements o 4 7, and taking an integral 
value dividing n on elements of y. Multiplying this function by some integer, we 
fìnd that there exists a function in V r (G) taking the value n for all elements of y, 
and the value 0 otherwise. The lemma then follows immediately. 

Theorem 10.8. (Artin). Every character of G is a linear combination with 
rational coefficients of induced characters from cyclic subgroups. 

Proof. In Lemma 10.7, let {H a } be the family of cyclic subgroups of G. The 
constant function n.l G belongs to V R (G). By Lemma 10.3, this function belongs 
to V Z (G\ and hence nX z (G) <= V Z (G). Hence 

X Z (G) <= 1 V Z (G), 

thereby proving the theorem. 

Lemma 10.9. Let pbe a prime number , and assume that every p-elementary 
subgroup of G is contained in some H a . Then there exists afunctionf e V R (G) 
whose values are in Z, and = 1 (mod p r ). 

Proof. We apply Lemma 10.6 again. For each p-class y, we can find a func- 
tion f y in V R (G\ whose values are 0 on elements outside y, and ^ 0 mod p for 
elements of y. Let/ = £/ y , the sum being taken over all p-classes. Then 

f(o) # 0 (modp) for all o e G. Taking f^~ l )P r 1 gj ves w hat we want. 
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Lemma 10.10. Let pbe a prime number and assume that every p-elementary 
subgroup of G is contained in some H a . Let n = n 0 p r where n 0 is prime to p. 
Then the constant function n 0 .1 G belongs to K Z (G). 

Proof. By Lemma 10.3, it suffices to prove that n 0 .ì G belongs to V R (G). 
Let/be as in Lemma 10.9. Then 


n 0 A G = n 0 (ì G -/) + n of 


Since n 0 (\ G -/) has values divisible by n 0 p r = n, it lies in V R (G) by Lemma 
10.7. On the other hand, n 0 f e Vr(G) because/ e V R (G). This proves our lemma. 


Theorem 10.11. (Brauer). Assume that for every prime number p, every 
p-elementary subgroup of G is contained in some H a . Then X(G) = V Z (G). 
Every character of G is a linear combination, with integer coefficients, of 
characters induced from subgroups H a . 

Proof. Immediate from Lemma 10.10, since we can find functions n 0 lc i n 
V Z (G) with n 0 relatively prime to any given prime number. 

CoroIIary 10.12. A class function f on G belongs to X(G) if and only if its 
restriction to H a belongs to X(H a ) for each a. 

Proof. Assume that the restriction of/to H a is a character on H a for each a. 
By the theorem, we can write 

1 c = S c a indg a (^ a ) 

a 

where c a e Z, and {j/ a e X(H a ). Hence 

/ = 2 C a ^^Ha^afHc)^ 


using Theorem 6.1. If f fí<x eX(H a ), we conclude that/belongs to X(G). The 
converse is of course trivial. 

Theorem 10.13. (Brauer). Every character of G is a linear combination 
with integer coejficients of characters induced by ì-dimensional characters of 
subgroups. 

Proof. By Theorem 10.11, and the transitivity of induction, it suffices to 
prove that every character of a p-elementary group has the property stated in 
the theorem. But we have proved this in the preceding section, Corollary 9.5. 
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§11. FIELD OF DEFINITION OF A 
REPRESENTATION 

We go back to the general case of k having characteristic prime to #G. Let 
E be a £-space and assume we have a representation of G on E. Let k' be an 
extension field of k . Then G operates on k' ® k E by the rule 

o(a ® x) = a ® ox 

for a g k' and xe E. This is obtained from the bilinear map on the product 
k' x E given by 


(a, x) hh► a (x) (jx. 

We view E' = k' ® k E as the extension of E by k\ and we obtain a representation 
of G on E'. 

Proposition 11.1. Let the notation he as ahove. Then the characters of the 
representations of G on E and on E' are equal. 

Proof. Let {v u ..., v m } be a basis of E over k. Then 
{1 ® v u ..., 1 ® v m ) 

is a basis of E' over k'. Thus the matrices representing an element o of G with 
respect to the two bases are equal, and consequently the traces are equal. 

Conversely, let k' be a field and k a subfìeld. A representation of G on a 
/c'-space E' is said to be definable over k if there exists a /e-space E and a repre- 
sentation of G on E such that E' is G-isomorphic to k' ® k E. 

Proposition 11.2. Let E , F be simple representation spaces for the finite 
group G over k. Let k' be an extension of k. Assume that E , F are not G- 
isomorphic. Then no k'-simple component of E w appears in the direct sum 
decomposition of F k > into k'-simple subspaces. 

Proof Consider the direct product decomposition 

s(k) 

fc[G] = n w 

n= 1 

over /c, into a direct product of simple rings. Without loss of generality, we may 
assume that E , F are simle left ideals of fe[G], and they will belong to distinct 
factors of this product by assumption. We now take the tensor product with 
fe', getting nothing else but fe'[G]. Then we obtain a direct product decomposi- 
tion over fe'. Since R v (k)R^(k) = 0 if v # p 9 this will actually be given by a direct 
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product decomposition of each factor Rfk): 

s(k) m(n) 

k'i g] = n n 

A/=l i=l 

Say E = L v and F = with v # /x. Then R^E = 0. Hence R^E k . = 0 for 
each i = 1,..., m(jj). This implies that no simple component of E k . can be 
G-isomorphic to any one of the simple left ideals of and proves what we 
wanted. 

Corollary 11.3. The simple characters / j,..., x s(k) of G over k are linearly 
independent over any extension k' of k. 

Proof. This follows at once from the proposition, together with the linear 
independence of the k f -simple characters over k'. 

Propositions 11.1 and 11.2 are essentially general statements of an abstract 
nature. The next theorem uses Brauer’s theorem in its proof. 

Theorem 11.4. (Brauer). Let G be a fìnite group of exponent m. Every 
representation of G over the complex numbers (or an algebraically closed field 
of characteristic 0) is definable over the field Q(C m ) where C m is a primitive 
m-th root of unity. 

Proof. Let x he the character of a representation of G over C, i. e. an effective 
character. By Theorem 10.13, we can write 

X = 2cj ind£.(</<,), Cj e Z, 

the sum being taken over a finite number of subgroups Sj, and 0, being a 1- 
dimensional character of Sj. It is clear that each 0 y is definable over Q(£ m ). Thus 
the induced character 0p is definable over Q(Cm)- E ac h can be written 

= X d j/xX^ 4> e z 

where {/^} are the simple characters of G over Q(C m ). Hence 



The expression of x as a linear combination of the simple characters over k is 
unique, and hence the coefïìcient 

Yj c jdjn 


is ^ 0. This proves what we wanted. 
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§12. EXAMPLE: GL 2 OVER A FINITE FIELD 

Let F be a field. We view GL 2 (F) as operating on the 2-dimensional 
vector space V = F 2 . We let F a be the algebraic closure as usual, and we let 
y a = f a xF a = p®y (tensor product over F). By semisimple, we always 
mean absolutely semisimple, i.e. semisimple over the algebraic closure F a . An 
element a e GL 2 (F) is called semisimple if V a is semisimple over F a [a]. A sub- 
group is called semisimple if all its elements are semisimple. 

Let K be a separable quadratic extension of F. Let {ío^, ù) 2 } be a basis of K. 
Then we have the regular representation of K with respect to this basis, namely 
multiplication representing K* as a subgroup of GL 2 (F). The elements of norm 
1 correspond precisely to the elements of SL 2 (F) in the image of K*. A different 
choice of basis of K corresponds to conjugation of this image in GL 2 (F). Let C K 
denote one of these images. Then C K is called a non-split Cartan subgroup. 
The subalgebra 

F[C k \ C Mat 2 (F) 

is isomorphic to K itself, and the units of the algebra are therefore the elements 
of C K ~ K*. 

Lemma 12.1. The subgroup C K is a maximal commutative semisimple 

subgroup. 

Proof. If a e GL 2 (F) commutes with all elements of C K then a must lie in 
F[C k \, for otherwise {1, a} would be linearly independent over F[C K ], whence 
Mat 2 (F) would be c.ommutative, which is not the case. Since a is invertible, a 
is a unit in F[C K \, soae C K , as was to be shown. 

By the split Cartan subgroup we mean the group of diagonal matrices 


(a 0\ 

with a, d e F*. 

\0 dì 

We denote the split Cartan by A, or A(F) if the reference to F is needed. 

By a Cartan subgroup we mean a subgroup conjugate to the split Cartan or 
to one of the subgroups C K as above. 

Lemma 12.2. Every maximal commutative semisimple subgroup of GL 2 (F) 
is a Cartan subgroup , and conversely. 

Proof. It is clear that the split Cartan subgroup is maximal commutative 
semisimple. Suppose that// is a maximal commutative semisimple subgroup of 
GL 2 (F). If H is diagonalizable over F , then H is contained in a conjugate of the 
split Cartan. On the other hand, suppose H is not diagonalizable over F. It is 
diagonalizable over the separable closure of F, and the two eigenspaces of 
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dimension 1 give rise to two characters 

</r, </r' : H -> F s * 

of // in the multiplicative group of the separable closure. For each element 
a e H the values </<a) and (A'(a) are the eigenvalues of a, and for some element 
a e H these eigenvalues are distinct, otherwise H is diagonalizable over F. 
Hence the pair of elements i//(a), \fj\a) are conjugate over F. The image i/^(//) 
is cyclic, and if \l/(a) generates this image, then we see that i/<a) generates a 
quadratic extension K of F. The map 

a i-> \f/(a) with a e H 

extends to an F-linear mapping, also denoted by i/>, of the algebra F[H] into K. 
Since F[H] is semisimple, it follows that ij/ : F[H] —> K is an isomorphism. 
Hence \f/ maps H into K *, and in fact maps H onto F* because H was taken to 
be maximal. This proves the lemma. 

In the above proof, the two characters i//, i//' are called the (eigen)characters 
of the Cartan subgroup. In the split case, if a has diagonal elements, a , d then 
we get the two characters such that i//(a) = a and i //'(a) = d. In the split case, 
the values of the characters are in F. In the non-split case, these values are 
conjugate quadratic over F, and lie in K. 

Proposition 12.3. Let H be a Cartan subgroup ofGL 2 (F) (split or not). Then 

H is of index 2 in its normalizer N(H ). 

Proof. We may view GL 2 (F) as operating on the 2-dimensional vector space 
V* = F a © F a , over the algebraic closure F a . Whether H is split or not, the 
eigencharacters are distinct (because of the separability assumption in the non- 
split case), and an element of the normalizer must either fix or interchange the 
eigenspaces. If it fixes them, then it lies in H by the maximality of H in Lemma 
12.2. If it interchanges them, then it does not lie in //, and generates a unique 
coset of N/H, so that H is of index 2 in N. 

In the split case, a representative of N/A which interchanges the eigenspaces 
is given by 



In the non-split case, let cr\ K —> K be the non-trivial automorphism. Let 
{a, cra} be a normal basis. With respect to this basis, the matrix of a is precisely 
the matrix 


vv = 



Therefore again in this case we see that there exists a non-trivial element in the 
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normalizer of A. Note that it is immediate to verify the relation 

M(o-)M(x)M((t~ { ) = M(ox), 

if M(x) is the matrix associated with an element x e K. 

Since the order of an element in the multiplicative group of a field is prime 
to the characteristic, we conclude: 

IfF has characteristic p, then an element offinite order in GL 2 (F) is semisimple 
ifand only if its order is prime to p. 

Conjugacy classes 

We shall determine the conjugacy classes explicitly. We specialize the sit- 
uation, and from now on we let: 

F = finite field with q elements; 

G = GL 2 (F)\ 

Z = center of G; 

A = diagonal subgroup of G; 

C * K* = a non-split Cartan subgroup of G. 

Up to conjugacy there is only one non-split Cartan because over a finite field 
there is only one quadratic extension (in a given algebraic closure F a ) (cf. 
Corollary 2.7 of Chapter XIV). Recall that 

#(G) = (q 2 - 1 )(q 2 - q) = q(q + 1 )(q ~ l) 2 . 

This should have been worked out as an exercise before. Indeed, F x F has q 2 
elements, and #(G) is equal to the number of bases of F x F. There are q 2 — 1 
choices for a first basis element, and then q 2 - q choices for a second (omitting 
(0, 0) the first time, and all chosen elements the second time). This gives the 
value for #(G). 

There are two cases for the conjugacy classes of an element a. 

Case 1. The characteristic polynomial is reducible, so the eigenvalues lie 
in F. In this case, by the Jordan canonical form, such an element is conjugate 
to one of the matrices 

c :)• c :)• c:) 

These are called central, unipotent, or rational not central respectively. 

Case 2. The characteristic polynomial is irreducible. Then a is such that 
F[a\ ~ E , where E is the quadratic extension of F of degree 2. Then {1, a) is 
a basis of F[a\ over F, and the matrix associated with a under the representation 
by multiplication on F[a] is 


0 —b 
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where a , b are the coefficients of the characteristic polynomial X 2 -F ax + b. 
We then have the following table. 

Table 12.4 


class 

# of classes 

# of elements in the class 

(; °) 

q - 1 

1 

\0 a/ 



C ‘) 

q- 1 

q 2 - 1 

V0 aì 


\0 d) 

\{q - \){q - 2) 

q 2 + q 

with a ¥= d 



aeC - F* 


q 2 - q 


In each case one computes the number of elements in a given class as the index 
of the normalizer of the element (or centralizer of the element). Case 1 is trivial. 
Case 2 can be done by direct computation, since the centralizer is then seen to 
consist of the matrices 


with x =£ 0. The third and fourth cases can be done by using Proposition 12.3. 

As for the number of classes of each type, the first and second cases correspond 
to distinct choices of a e F* so the number of classes is q — 1 in each case. In 
the third case, the conjugacy class is determined by the eigenvalues. There are 
q — 1 possible choices for a, and then q — 2 possible choices for d. But the 
non-ordered pair of eigenvalues determines the conjugacy class, so one must 
divide (q — l)(q — 2) by 2 to get the number of classes. Finally, in the case 
of an element in a non-split Cartan, we have already seen that if o generates 
Gal(A’/F), then M(ox) is conjugate to M(x) in GL^^F). But on the other 
hand, suppose x , x' e K * and M(x ), M(x ') are conjugate in GL 2 (F) under a given 
regular representation of K* on K with respect to a given basis. Then this 
conjugation induces an F-algebra isomorphism on F[C K ], whence an automor- 
phism of K , which is the identity, or the non-tri vial automorphism cr. Consequently 
the number of conjugacy classes for elements of the fourth type is equal to 

#{K) ~ #(F) qLz_í 

2 2 ’ 


which gives the value in the table. 
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Borel subgroup and induced representations 

We let: 

U = group of unipotent elements 
B = Borel subgroup = UA = AU. 

Then #{B) = q(q - \) 2 = (q — 1 )(q 2 - q). We shall construct representations 
of G by inducing characters from B , and eventually we shall construct all irre- 
ducible representations of G by combining the induced representations in a suitable 
way. We shall deal with four types of characters. Except in the fìrst type, which 
is 1-dimensional and therefore obviously simple, we shall prove that the other 
types are simple by computing induced characters. In one case we need to subtract 
a one-dimensional character. In the other cases, the induced character will turn 
out to be simple. The procedure will be systematic. We shall give a table of 
values for each type. We verify in each case that for the character x which we 
want to prove simple we have 

2 1*03)1 2 = #(G), 

jSeG 

and then apply Theorem 5.17(a) to get the simplicity. Once we have done this 
for all four types, from the tables of values we see that they are distinct. Finally, 
the total number of distinct characters which we have exhibited will be equal to 
the number of conjugacy classes, whence we conclude that we have exhibited 
all simple characters. 

We now carry out this program. I myself learned the simple characters of 
GL 2 (F) from a one-page handout by Tate in a course at Harvard, giving the 
subsequent tables and the values of the characters on conjugacy classes. I filled 
out the proofs in the following pages. 

First type 

: F* —> C* denotes a homomorphism. Then we obtain the character 
fji ° det: G —> C*, 

which is 1-dimensional. Its values on representatives of the conjugacy classes 
are given in the following table. 



Table 12.5(1) 



(a 0\ 

(a 1\ 

(a 0\ 


X 

\0 a) 

VO a) 

Vo d)d # a 

a E C — F* 

T: 

o 

O- 

fD 

H(a) 2 

ix(a) 2 

li(ad) 

fJL ° det(a) 
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The stated values are by definition. The last value can also be written 

)it(det á) = i(N KIF (a)), 

viewing a as an element of /f*, because the reader should know from field theory 
that the determinant gives the norm. 

A character of G will be said to be of fìrst type if it is equal to fL° det for 
some fJL. There are q - 1 characters of first type, because #(F*) = q - 1 . 

Secondtype 

Observe that we have B/U = A. A character of A can therefore be viewed 
as a character on B via B/U. We let: 

ifjjjL = res A (/Lt ° det), and view therefore as a character on B. Thus 

We obtain the induced character 

'I'Z = ind f(^). 

Then is not simple. It contains /1 ° det, as one sees by Frobenius reciprocity: 

<ind^,/iodet> G = {\j/ n o det> s = —£ \n ° det(/?)| 2 = 1 . 
Characters \ ~ M ° det will be called of second type. 

The values on the representatives of conjugacy classes are as follows. 


Table 12.5(11) 



Actually, one computes the values of and one then subtracts the value of 
6 ° det. For this case and the next two cases, we use the formula for the induced 
function: 

indg((p)(a) = ^ (p H (i 3a/3“‘) 

where ç H is the function equal to cp on H and 0 outside H. An element of the 
center commutes with all /3 e G, so for <p = the value of the induced character 
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on such an element is 


^^/x(a) 2 = (q + 1 )fi(a) 2 . 


which gives the stated value. 

For an element u = ^ the only elements /3eG such that lies 

in B are the elements of B (by direct verification). It is then immediate that 



= 


which yields the stated value for the character x • Using Table 12.4, one finds 
at once that 2 \x(P)\ 2 ~ #(G)> and hence; 

A character x of second type is simple. 

The table of values also shows that there are q - 1 characters of second type. 
The next two types deal especially with the Cartan subgroups. 


Third type 

i/r: A —> C* denotes a homomorphism. 

As mentioned following Proposition 12.3, the representative w = w A = w _1 for 
N(A)/A is such that 



Thus conjugation by w is an automorphism of order 2 on A. Let [w]ij/ be the 
conjugate character; that is, ([w]i/f)(a) = i//(waw) = i j/(a w ) for a e A. Then 
[w\(p o det) = po det. The characters p o det on A are precisely those which are 
invariant under [w]. The others can be written in the form 

with distinct characters i// Ì9 il/ 2 : F* —> C*. In light of the isomorphism 
B/U ~ A 9 we view ij/ has a character on B. Then we form the induced character 

ij/ G = ind §(if/) = ind ^([w]\f/). 

With ij/ such that [w]ij/ i/>, the characters x ~ sa i^ t0 b e of the 

third type. Here is their table of values. 
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Table 12.5(111) 


X 

Co°) 

n 

*-(“ °) 

VO dìd ¥= a 

a e C - F* 

\j> G 

(/» + [w]lp 

(q + 1 Ma) 

4>(a) 

ij/(a) + if/(a w ) 

0 


The first entry on central elements is immediate. For the second, we have already 
seen that if /3 e G is such that conjugating 


/3 


r 1 efl, 


then /3 e B, and so the formula 
* a{eí > ~ 

immediately gives the value of ifP on unipotent elements. For an element of A 
with a ± d, there is the additional possibility of the normalizer of A with the 
elements w , and the value in the table then drops out from the formula. For 
elements of the non-split Cartan group, there is no element of G which conjugates 
them to elements of B, so the value in the last column is 0. 


We claim that a character \ = \\fi of third type is simple. 

Theproofagainusesthetestforsimplicity,i.e. that^ \x(P)\ 2 = #(G). Observe 
that two elements a, a'eA are in the same conjugacy class in G if and only if 
a = a or a' = [ w]a . This is verified by brute force. Therefore, writing the 
sum X | ý G (p)\ 2 for /3 in the various conjugacy classes, and using Table 12.4, 
we find: 

2 I <A°0) 1 2 = (q + 1 )\q - 1) 

/3eG 

+ (q - l)(q 2 - 1) + (q 2 + q) 2 !</'(«) + <A(« M )I 2 - 

ae(A-F*)/w 

The third term can be written 

\(q 2 + q) 2 (<K«) + iK« w )MeT') + 4>(a ~ w )) 

^ a&A-F* 

= \(q 2 + q) 2 (1 + 1 + < l/(a'~ w ) + i/»(a* v_1 )). 

^ a^A-F* 

We write the sum over a e A — F* as a sum for a e A minus the sum for 
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a e F*. If a e F* then a x ~ w = a w ~ l = 1. By assumption on i/f, the character 

a i-> (//(a 1_vv ) for aeA 

is non-trivial, and therefore the sum over aeA is equal to 0. Therefore, putting 
these remarks together, we find that the third term is equal to 

\(q 2 + q)[2(q - l) 2 - 2 (q - 1) - 2 (q - 1)] = q(q 2 - 1 )(q - 3). 

Hence finally 

2 I^OS)! 2 = (q + 1 )(q 2 - 1 ) + (q~ D (q 2 ~ D + q(q 2 ~ D(q ~ 3) 

/3 eG 

= <?(<7 - D(<? 2 " 1) = #(G), 
thus proving that i/j g is simple. 

Finally we observe that there are \(q — 1 )(q - 2) characters of third type. 
This is the number of characters i/> such that [w]i/> ^ i/f, divided by 2 because 
each pair i/í and [w]i/f yields the same induced character i/f G . The table of values 
shows that up to this coincidence, the induced characters are distinct. 

Fourth type 

0 : K* —> C* denotes a homomorphism, which is viewed as a character on 
C = C K . 

By Proposition 12.3, there is an element vv e N(C) but vv ^ C, w = w _1 . Then 

a h-> waw = [w]a 

is an automorphism of C, but x »-> wxw is also a field automorphism of 
F[C] ~ K over F. Since [K : F] = 2, it follows that conjugation by w is the auto- 
morphism a »-> a 9 . As a result we obtain the conjugate character [w]0 such that 

([w]6)(a) = 0([w) a) = 6(a w ), 

and we get the induced character 

0 G = ind g(0) = indg([w]0). 

Let /jl : F* — > C* denote a homomorphism as in the first type. Let: 

À : F + —> C* be a non-trivial homomorphism. 

(fj t, À) = the character on ZU such that 

<( *' «(C D ) =Mawx) ’ 


(H, A) g = ind^C/tt, A). 
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A routine computation of the same nature that we have had previously gives the 
following values for the induced characters S G and (/x, À) G . 


X 

CD 

c:) 

(• °) 

VO dìd + a 

a e C — F* 

e G 

(q 2 - q)S(a) 

0 

0 

6(a) + 6(a w ) 

(/*, A) G 

(q 2 - Dm(û) 

-fi(a) 

0 

0 


These are intermediate steps. Note that a direct computation using Frobenius 
reciprocity shows that 6 G occurs in the character (res 0 , À) G , where the restriction 
res0 is to the group F*, so res0 is one of our characters p. Thus we define: 

6' = (res 0, À) G - 6 G = ([w]6)\ 


which is an effective character. A character 6' is said to be of fourth type if 6 
is such that 6 # [w]6. These are the characters we are looking for. Using the 
intermediate table of values, one then finds the table of values for those characters 
of fourth type. 


Table 12.5(IV) 


X 

C3 

c:) 

f °) 

\0 d/d # a 

a e C — F* 

6' 

6 + [w]0 

(q - Dfi(«) 

~6(a) 

0 

-6(a) - 6(a w ) 


We claim thaî the characters 6' offourth type are simple. 

To prove this, we evaluate 

2 | 6*'0)I 2 = (q - 1 )\q - 1 ) + (q- D (q 2 ~ D 

/3eG 

+ \(q 2 - q) 2 I 6(a) + 0(a M )| 2 

We use the same type of expansion as for characters of third type, and the final 
value does turn out to be #(G), thus proving that 6' is simple. 

The table also shows that there are|#(C - F*) =\(q 2 ~ q) distinct characters 
of fourth type. We thus come to the end result of our computations. 
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Theorem 12.6. The irreducible characters of G = GL 2 (F) are as follows. 


type 

number of 
that type 

dimension 

i 

I 

/x ° det 

q - 1 

1 

II 

det 

q - 1 

<1 

n 


\{q - 1)0? - 2) 

q -F 1 

IV 

6' from pairs 0 4= [w]d 

\{q ~ 1)<7 

q - 1 


Proof We have exhibited characters of four types. In each case it is imme- 
diate from our construction that we get the stated number of distinct characters 
of the given type. The dimensions as stated are immediately computed from the 
dimensions of induced characters as the index of the subgroup from which we 
induce, and on two occasions we have to subtract something which was needed 
to make the character of given type simple. The end result is the one given in 
the above table. The total number of listed characters is precisely equal to the 
number of classes in Table 12.4, and therefore we have found all the simple 
characters, thus proving the theorem. 


EXERCISES 

1. The group S 3 . Let S 3 be the symmetric group on 3 elements, 

(a) Show that there are three conjugacy classes. 

(b) There are two characters of dimension 1, on S 3 /A 3 . 

(c) Let di (i = 1, 2, 3) be the dimensions of the irreducible characters. Since 

= 6, the third irreducible character has dimension 2. Show that 
the third representation can be realized by considering a cubic equation 
X 3 -I- aX -I- b = 0, whose Galois group is S 3 over a field k . Let V be the k- 
vector space generated by the roots. Show that this space is 2-dimensional 
and gives the desired representation, which remains irreducible after tensoring 
with k a . 

(d) Let G = S 3 . Write down an idempotent for each one of the simple components 
of C [G]. What is the multiplicity of each irreducible representation of G in 
the regular representation on C[G]? 
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2. The groups S 4 and A 4 . Let S 4 be the symmetric group on 4 elements. 

(a) Shovv that there are 5 conjugacy classes. 

(b) Show that A 4 has a unique subgroup of order 4, which is not cyclic, and 
which is normal in S 4 . Show that the factor group is isomorphic to S 3 , so 
the representations of Exercise 1 give rise to representations of S 4 . 

( c ) Using the relation 2 dj = #(S 4 ) = 24, conclude that there are only two other 
irreducible characters of S 4 , each of dimension 3. 

(d) Let X 4 + a 2 X 2 + a { X + a 0 be an irreducible polynomial over a field k, with 
Galois group S 4 . Show that the roots generate a 3-dimensional vector space 
V over k, and that the representation of S 4 on this space is irreducible, so 
we obtain one of the two missing representations. 

(e) Let p be the representation of (d). Define p' by 

p'(o) = p(a) if a is even; 
p'(a) = —p(a) if a is odd. 

Show that p' is also irreducible, remains irreducible after tensoring with k & , 
and is non-isomorphic to p. This concludes the description of all irreducible 
representations of S 4 . 

(f) Show that the 3-dimensional irreducible representations of S 4 provide an 
irreducible representation of A 4 . 

(g) Show that all irreducible representations of A 4 are given by the representations 
in (f) and three others which are one-dimensional. 

3. The quaternion group. Let Q = {±1, ±x, ±y, ±z} be the quaternion group, with 
x 2 = y 2 = z 2 = —1 and x y = —yx, xz = —zx, yz = —zy. 

(a) Show that Q has 5 conjugacy classes. 

Let A = {± 1}. Then Q/A is of type (2,2), and hence has 4 simple characters, 
which can be viewed as simple characters of Q. 

(b) Show that there is only one more simple character of Q, of dimension 2. 
Show that the corresponding representation can be given by a matrix rep- 
resentation such that 



(c) Let H be the quaternion field, i.e. the algebra over R having dimension 4, 
with basis {1, x, y, z) as in Exercise 3, and the corresponding relations as 
above. Show that C ® R H =» Mat 2 (C) (2x2 complex matrices). Relate this 
to (b). 

4. Let S be a normal subgroup of G. Let ý be a simple character of S over C. Show 
that inâs(^) IS simple if and only if ij/ = [a]i// for all cr E S. 

5. Let G be a finite group and 5 a normal subgroup. Let p be an irreducible representation 
of G over C. Prove that either the restriction of p to S has all its irreducible components 
5-isomorphic to each other, or there exists a proper subgroup H of G containing S 
and an irreducible representation 6 of H such that p ~ ind g(0)- 

6. Dihedral group D ln . There is a group of order 2 n (n even integer ^ 2) generated 
by two elements cr, t such that 
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a n = 1, r 2 = 1, and rar = a 1 . 

It is called the dihedral group. 

(a) Show that there are four representations of dimension 1, obtained by the four 
possible values ±1 for a and r. 

(b) Let C„ be the cyclic subgroup of D ln generated by <r. For each integer 
r = 0,. .., n — 1 let i// r be the character of C„ such that 

\l/ r (o r) = C (í = prim. n-th root of unity) 

Let Xr be the induced character. Show that \ r = Xn-r ■ 

(c) Show that for 0 < r < n/2 the induced character \ r 1S simple, of dimension 

2, and that one gets thereby ^ distinct characters of dimension 2. 

(d) Prove that the simple characters of (a) and (c) give all simple characters of 

D 2 n- 

7. Let G be a finite group, semidirect product of A, H where A is commutative and 
normal. Let A A = Hom(A, C*) be the dual group. Let G operate by conjugation on 
characters, so that for a e G, a 6 A, we have 

[a]il/(a) = ijj(a~ l aa). 

Let ip r be representatives of the orbits of H in A A , and let H t (i = 1,. . . , r) 

be the isotropy group of i fa. Let G, = AH r 

(a) For aeA and h E //,, define ipi(ah) = ipi(a). Show that </<, is thus extended 
to a character on G t . 

Let 6 be a simple representation of //, (on a vector space over C). From 
Hi = G f -/A, view 0 as a simple representation of G r Let 

Pi,e = indg/i/», 0). 

(b) Show that p t e is simple. 

(c) Show that p, d ~ p i e > implies i = i' and 6 ~ 0'. 

(d) Show that every irreducible representation of G is isomorphic to some p t e 

8. Let G be a finite group operating on a finite set S. Let C [S] be the vector space 
generated by S over C. Let ip be the character of the corresponding representation 
of G on C[5]. 

(a) Let a 6 G. Show that ip(a) = number of fixed points of cr in S. 

(b) Show that (</<, 1 G ) G is the number of G-orbits in S. 

9. Let A be a commutative subgroup of a finite group G. Show that every irreducible 
representation of G over C has dimension ^ (G : A). 

10. Let F be a finite field and let G = SL 2 ( F). Let B be the subgroup of G consisting of 
all matrices 

E SL 2 ( F), so d = a~ x . 

Let p, : F* —» C* be a homomorphism and let ip^ : B —> C* be the homomorphism 
such that <A M (a) = p(a). Show that the induced character ind^(<A M ) is simple if 

p? ± 1. 
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11. Determine all simple characters of SL 2 ( F), giving a table for the number of such 
characters, representatives for the conjugacy classes, as was done in the text for GL 2 , 
over the complex numbers. 

12. Observe that A s ~ SL 2 ( F 4 ) ~ PSL 2 ( F 5 ). As a result, verify that there are 5 conjugacy 
classes, whose elements have orders 1, 2, 3, 5, 5 respectively, and write down 
explicitly the character table for A 5 as was done in the text for GL 2 . 

13. Let G be a p-group and let G —> Aut(V) be a representation on a finite dimensional 
vector space over a field of characteristic p. Assume that the representation is irre- 
ducible. Show that the representation is trivial, i.e. G acts as the identity on V. 

14. Let G be a finite group and let C be a conjugacy class. Prove that the following two 
conditions are equivalent. They define what it means for the class to be rational 

RAT 1. For all characters x of G, *(<j) E Q for cr e C. 

RAT 2. For all a E C, and j prime to the order of cr, we have cr J E C. 

15. Let G be a group and let H x , H 2 be subgroups of finite index. Let p,, p 2 be repre- 
sentations of H x , H 2 on R- modules F x , F 2 respectively. Let M G (F X , F 2 ) be the R- 
module of functions f: G —> Hom^(F!, F 2 ) such that 

f(h x ah 2 ) = p 2 (h 2 )f(o)p\(h\) 

for all cr E G, h t E //, (i = 1, 2). Establish an /?-module isomorphism 
Hom r (F?,FG)^M g (F x ,F 2 ). 

By Ff we have abbreviated indg (F t ). 

16. (a) Let G x , G 2 be two finite groups with representations on C-spaces E x , E 2 . Let 

E x 0 E 2 be the usual tensor product over C, but now prove that there is an action 
of G x x G 2 on this tensor product such that 

(a x , o- 2 )( x 0 y) = (T x x 0 cr 2 y for a x E Gj, <r 2 E G 2 . 

This action is called the tensor product of the other two. If p x , p 2 are the 
representations of G x , G 2 on E x , E 2 respectively, then their tensor product is 
denoted by p x 0 p 2 . Prove: If p x , p 2 are irreducible then p 2 0 p 2 is also irreducible. 
[Hint: Use Theorem 5.17.] 

(b) Let x\, \2 b e the characters of p x , p 2 respectively. Show that X\ ® Xi 1S the 
character of the tensor product. By definition, 

Xl 0 X2(<7\’ °2) = X\(<7\) X2(V2)- 

17. With the same notation as in Exercise 16, show that every irreducible representation 
of G x x G 2 over C is isomorphic to a tensor product representation as in Exercise 
16. [Hint: Prove that if a character is orthogonal to all the products X\ 0 X 2 °f 
Exercise 16(b) then the character is 0.] 

Tensor product representations 

18. Let P be the non-commutative polynomial algebra over a field k, in n variables. Let 
Xj,..., x r be distinct elements of P x (i.e. linear expressions in the variables t x ,...,t n ) 
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and let a u ..., a r e k. If 


a { x\ -F • • ■ 4- a r x v r = 0 

for all integers v = 1,..., r show that a x = 0 for i = 1,..., r. [Hint: Take the 
homomorphism on the commutative polynomial algebra and argue there.] 

19. Let G be a finite set of endomorphisms of a finite-dimensional vector space £ over the 
field k. For each o e G, let c a be an element of k. Show that if 

IcJV) = û 

a e G 

for all integers r ^ 1, then c a = 0 for all oeG. [ Hint : Use the preceding exercise, and 
Proposition 7.2 of Chapter XVI.] 

20. (Steinberg). Let G be a finite monoid, and /:[G] the monoid algebra over a field k. Let 
G -*■ End*(£) be a faithful representation (i.e. injective), so that we identify G with a 
multiplicative subset of End fc (£). Show that T r induces a representation of G on T r (£), 
whence a representation of /c[G] on T r (£) by linearity. If a e /c[G] and if T r (a) = 0 for 
all integers r ^ 1, show that a = 0. [Hint : Apply the preceding exercise.] 

21. (Burnside). Deduce from Exercise 20 the following theorem of Burnside: Let G be 
a finite group, k a field of characteristic prime to the order of G, and £ a finite 
dimensional (G, k)~ space such that the representation of G is faithful. Then every 
irreducible representation of G appears with multiplicity ^ 1 in some tensor power 
T r (£). 

22. Let X(G) be the character ring of a finite group G, generated over Z by the simple 
characters over C. Show that an element/E X(G) is an effective irreducible character 
if and only if (/, f) G = 1 and /(1) ^ 0. 

23. In this exercise, we assume the next chapter on alternating products. Let p be an 
irreducible representation of G on a vector space £ over C. Then by functoriality we 
have the corresponding representations S r (p) and /\ r (p) on the r-th symmetric power 
and r-th alternating power of £ over C. If x is the character of p, we let S r (x) and 
/\ r (x) be the characters of S r (p) and /\ r (p) respectively, on 5 r (£) and /\ r (E). Let t 
be a variable and let 


a,(X) = 2 S r (x)t r , A,(*) = 2 A '(X)t r - 

r =0 r^= 0 

(a) Comparing with Exercise 24 of Chapter XIV, prove that for x E G we have 

<r,(x)(x) = det(/ - p(x)t)~' and \,(x)(x) = det(/ + p(x)t). 

(b) For a function /on G define ^ n (f) by ty n (f)(x) = f(x n ). Show that 

-j log <T,(x) = Ê 'ì' n (x)t n and - jlog A_,(^) = £ Vix)!". 

Ul n = 1 Ul n =1 

(c) Show that 

n 00 

nS n (x) = 2 'f rr (x)S n ~ r (x) and n/\"(x) = 2 (~O r ~'^ r (x)/\ n ~ r (x)- 

r= 1 r= 1 
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24. Let x be a simple character of G. Prove that y P n (x) is also simple. (The characters 
are over C.) 

25. We now assume that you know §3 of Chapter XX. 

(a) Prove that the Grothendieck ring defined there for Mod c (G) is naturally 
isomorphic to the character ring X(G). 

(b) Relate the above formulas with Theorem 3.12 of Chapter XX. 

(c) Read Fulton-Lang’s Riemann-Roch Algebra, Chapter I, especially §6, and 
show that X(G) is a À-ring, with y P n as the Adams operations. 

Note. For further connections with homology and the cohomology of groups, see 
Chapter XX, §3, and the references given at the end of Chapter XX, §3. 

26. The following formalism is the analogue of Artin’s formalism of L-series in number 
theory. Cf. Artin’s “Zur Theorie der L-Reihen mit allgemeinen Gruppenchar- 
akteren”, Collected papers, and also S. Lang, “L-series of a covering”, Proc. Nat. 
Acad. Sc. USA (1956). For the Artin formalism in a context of analysis, see J. Jor- 
genson and S. Lang, “Artin formalism and heat kemels”, J. reine angew. Math. 447 
(1994) pp. 165-200. 

We consider a category with objects {U}. As usual, we say that a finite group G 
operates on U if we are given a homomorphism p:G -+ Aut((7). We then say that U is a 
G-object, and also that p is a representation of G in U. We say that G operates trivially 
if p(G) = id. For simplicity, we omit the p from the notation. By a G-morphism 
f:U->V between G-objects, one means a morphism such that fo G = c °/for all oeG. 

We shall assume that for each G-object U there exists an object U/G on which G 
operates trivially, and a G-morphism n U G : U -> U/G having the following universal 
property: If/: U -> U' is a G-morphism, then there exists a unique morphism 

f/G : U/G - U'/G 

making the following diagram commutative: 

U - *—* U' 


U/G-^U'/G 

In particular, if H is a normal subgroup of G, show that G/H operates in a natural way 
on U/H. 

Let k be an algebraically closed fìeld of characteristic 0. We assume given a functor 
E from our category to the category of finite dimensional k- spaces. If U is an object in 
our category, and /: U -* U' is a morphism, then we get a homomorphism 

E(f)=f*:E(U)^E(U'). 

(The reader may keep in mind the special case when we deal with the category of 
reasonable topological spaces, and E is the homology functor in a given dimension.) 

If G operates on U, then we get an operation of G on E(U) by functoriality. 

Let U be a G-object, and F: U -+ U a G-morphism. If P F (t) = ]”[ ( l ~ a .) ÌS the 
characteristic polynomial of the linear map : E(U) -> E(U), we define 

z F (t) =n (i - “.0. 
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and call this the zeta function of F. If F is the identity, then Z F (t) = (1 — t) B{U) where 
we define B(U) to be dim* E(U). 

Let x be a simple character of G. Let d x be the dimension of the simple representation 
of G belonging to and n = ord(G). We define a linear map on E(U) by letting 

e t = 7 

” oeG 

Show that e\ = e x , and that for any positive integer \i we have (e x ° F t Y = e x ° F*. 
ifp,(o=n (t — Pj(x)) is the characteristic polynomial of e x ° F*, define 

L F (t, X, U/G) = n (> - PÁ/n 

Show that the logarithmic derivative of this function is equal to 

î*e x o FÏP-'. 

/ 1=1 

Define L f (í, *, U/G) for any character x by linearity. If we write V = U/G by abuse of 
notation, then we also write L F (t, x , U/V). Then for any x> x' we have by definition, 

L f (L X + X\ U/V) = L F (t, x, U/V)L F (t, X \ U/V). 

We make one additional assumption on the situation: 

Assume that the characteristic polynomial of 


is equal to the characteristic polynomial of F/G on E(U/G). Prove the following statement: 

(a) IfG = {1} then 

L F (t, 1, U/U) = Z F (t). 


(b) Let V = U/G. Then 

L F (t, 1, U/V) = Z F (t). 

(c) Let H be a subgroup of G and let \j/ be a character of H. Let W = U/H, and let 
\jj G be the induced character from H to G. Then 

L F (t, ỳ, U/W) = L F (t, il/ G , U/V). 

(d) Let H be normal in G. Then G/H operates on U/H = W. Let ij/ be a character 
of G/H, and let x be the character of G obtained by composing \j/ with the 
canonical map G G/H. Let q> = F/H be the morphism induced on 

U/H = W. 

Then 

L<p(t, \l/, W/V) = L F (t, X , U/V). 

(e) If V= U/G and B(V) = dim k E(V), show that (1 - t) B{V) divides (1 - t) B{U) . 
Use the regular character to determine a factorization of (1 - t) B{U) . 
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27. Do this exercise after you have read some of Chapter VII. The point is that for fields 
of characteristic not dividing the order of the group, the representations can be obtained 
by “reducing modulo a prime”. Let G be a finite group and let p be a prime not 
dividing the order of G. Let F be a finite extension of the rationals with ring of 
algebraic integers o F . Suppose that F is sufficiently large so that all F-irreducible 
representations of G remain irreducible when tensored with Q a = F a . Let p be a 
prime of o F lying above p , and let o p be the corresponding local ring. 

(a) Show that an irreducible (G, F)-space V can be obtained from a (G, o p )- 
module E free over o p , by extending the base from o p to F, i.e. by tensoring 
so that V = E <8) F (tensor product over o p ). 

(b) Show that the reduction mod p of E is an irreducible representation of G in 
characteristic p. In other words, let k = o/p = o p /m p where m p is the maximal 
ideal of o p . Let E(p) = E ® k (tensor product over o p ). Show that G operates 
on E(p) in a natural way, and that this representation is irreducible. In fact, 
if x is the character of G on V, show that \ is also the character on E , and 
that x m0( Ì m p is the character on £(p). 

(c) Show that all irreducible characters of G in characteristic p are obtained as 
in (b). 




CHAPTER XIX _ 

The Alternating Product 


The alternating product has applications throughout mathematics. In differ- 
ential geometry, one takes the maximal alternating product of the tangent space 
to get a canonical line bundle over a manifold. Intermediate alternating products 
give rise to differential forms (sections of these products over the manifold). In 
this chapter, we give the algebraic background for these constructions. 

For a reasonably self-contained treatment of the action of various groups of 
automorphisms of bilinear forms on tensor and alternating algebras, together 
with numerous classical examples, I refer to: 

R. Howe, Remarks on classical invariant theory, Trans. AMS 313 (1989), 
pp. 539-569 


§1 DEFINITION AND BASIC PROPERTIES 

Consider the category of modules over a commutative ring R. 

We recall that an r-multilinear map /: E {r) -> F is said to be alternating 
if /(xj,..., x r ) = 0 whenever x, = Xj for some i / j. 

Let a r be the submodule of the tensor product T r (E) generated by all elements 
of type 

Xj (g) • • • ® x r 

where x, = Xj for some i # j. We define 

A '(£) = T'(E)/a 

Then we have an /•-multilinear map E {r) -> / \ r (E) (called canonical) obtained 


731 
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from the composition 


E (r) - T\E) - T r (E)/a r = [\ r (E). 

It is clear that our map is alternating. Furthermore , it is universal mth respect 
to r-multilinear alternating maps on E. In other words, if /: E (r) -> F is such a 
map, there exists a unique linear map / + : /\ r (E) -> F such that the following 
diagram is commutative: 


A r (£) 



Our map /* exists because we can first get an induced map T r (E) -► F making 
the following diagram commutative: 


T r (E) 



F 

and this induced map vanishes on a r , hence inducing our /*. 

The image of an element (x l9 ..., x r ) e E (r) in the canonical map into 
[\ r (E) will be denoted by x^ a • • • a x r . It is also the image of x^ ® ® x r in 

the factor homomorphism T r (E) -► [\ r (E). 

In this way, A r becomes a functor, from modules to modules. Indeed, let 
u: E —> F be a homomorphism. Given elements jq,..., x r e £, we can map 

(*i, ...,x r )i-+ u(xi ) A • • • a u(x r ) e A r (0' 

This map is multilinear alternating, and therefore induces a homomorphism 

A r («): /\ r (E) ^ Mn. 

The association u i-> A r (“) i s obviously functorial. 

Example. Open any book on differential geometry (complex or real) and 
you will see an application of this construction when E is the tangent space of 
a point on a manifold, or the dual of the tangent space. When taking the dual, 
the construction gives rise to differential forms. 

We let A (E) be the direct sum 

A(£) = © A r ( £ )- 

r—0 
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We shall make A(£) i nt0 a graded /?-algebra and call it the alternating algebra 
of /s, or also the exterior algebra, or the Grassmann algebra. We shall first 
discuss the general situation, with arbitrary graded rings. 

Let G be an additive monoid again, and let A = 0) A r be a G-graded 

reG 

R- algebra. Suppose given for each A r a submodule a r , and let a = © a r . 

reG 

Assume that a is an ideal of A. Then a is called a homogeneous ideal, and we can 
define a graded structure on >4/a. Indeed, the bilinear map 

A r x A s -> A r+S 

sendsa r x A s into a r + s and similarly, sends A r x a s intoa r+s . Thus using repre- 
sentatives in >4 r , A s respectively, we can define a bilinear map 

A r /& r x A s /d s ► A r+s /a r + sì 

and thus a bilinear map A/a x A/a -> A/a, which obviously makes >4/a into a 
graded /?-algebra. 

We apply this to T r (E) and the modules a r defìned previously. If 

Xi = Xj (i / j) 

in a product a • • • a x r , then for any y u ...,y s eE we see that 
X^ A • • • A X r A y x A • - a y s 

lies in a r + s , and similarly for the product on the left. Hence the direct sum 0) a r 
is an ideal of T(E ), and we can define an /?-algebra structure on T(E)/a. The 
product on homogeneous elements is given by the formula 

((x j A * A X r ), (y^ A — • A .y s )) h-> X X A " • A X r A y x A •• • A y s . 

We use the symbol a also to denote the product in /\(E). This product is called 
the alternating product or exterior product. If x e E and y e E, then 

x A y = -y A x, as follows from the fact that (x + y) A (x + y) = 0. 

We observe that /\ is afunctor from the category of modules to the category 
of graded R-algebras. To each linear map f : E -> F we obtain a map 

A(/):A(£)-A(0 

which is such that for x l5 ..., x r e E we have 

A(/)(*I A • • • A X r ) = f(Xi) A • • • A f(x r ). 

Furthermore, /\(f) is a homomorphism of graded K-algebras. 
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Proposition 1.1. Let E be free of dimension n over R. Ifr>\n then 
/\{E) — 0. Let {v í9 . . . , v n } be a basis of E over R. If 1 ^ r g n 9 then 
/\ r (E) is free over R 9 and the elements 

0«, A A V ir , i i < * * * < i r 

form a basis of /\ r (E ) over k. We have 

dim R /\ r (E) = 

Proof. We shall first prove our assertion when r = n. Every element of E 
can be written in the form £ a^and henceusingtheformulax a y = — y a x 
we conclude that v x a • • • a v n generates /\(E). On the other hand, we know 
from the theory of determinants that given ae R, there exists a unique multi- 
linear alternating form f a on E such that 

fa(vu •••>»„) = a • 

Consequently, there exists a unique linear map 

A n (E) R 

taking the value a on v^ a ••• a v n . From this it follows at once that 
v x a • • • a v n is a basis of / \ n (E ) over R. 

We now prove our statement for 1 ^ r ^ n. Suppose that we have a relation 

0 = Z a (0 t ’i, A ••• A »tr 

with /j < • • • < i r and a (i) e R. Select any r-tuple (j) = (j Ì9 . .., j r ) such that 
ji < • • • < j r and let j r+ u ...J n be those values of i which do not appear among 
O'i, ..., j r ). Take the alternating product with v jr+l a • • • a v jn . Then we shall 
have alternating products in the sum with repeated components in all the terms 
except the ( 7 ’)-term, and thus we obtain 

0 = a U) V h A • • • A Vj r A • • • A v jn . 

Reshufïling v jx a • • • a v jn into v x a • • • a v n simply changes the right-hand 
side by a sign. From what we proved at the beginning of this proof, it follows 
that a U) = 0. Hence we have proved our assertion for 1 ^ r ^ n. 

When r = 0, we deal with the empty product, and 1 is a basis for /\°(E) = R 
over R. We leave the case r > n as a trivial exercise to the reader. 

The assertion concerning the dimension is trivial, considering that there is a 
bijection between the set of basis elements, and the subsets of the set of integers 
(l,...,n). 
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Remark. It is possible to give the first part of the proof, for /\ n (E\ without 
assuming known the existence of determinants. One must then show that a„ 
admits a 1-dimensional complementary submodule in T n (E). This can be done 
by simple means, which we leave as an exercise which the reader can look up 
in the more general situation of §4. When R is a field, this exercise is even more 
trivial, since one can verify at once that v x ® ® v n does not lie in a n . This 

alternative approach to the theorem then proves the existence of determinants. 

Proposition 1.2. Let 

0 0 

be an exact sequence offree R-modules offinite ranks r, n, and s respectively. 

Then there is a natural isomorphism 

<p : A r E' ® A 5 A n E. 

This isomorphism is the unique isomorphism having the folloxving property. For 

elements v x ,..., v r e £' and w x ,..., w s e £", let u { ,..., u s be liftings of 

w x , ..., w s in E. Then 

<p((v { A • • • A V r ) ® (w x A • • * A W s )) = V x A • * * A V r A U x A • * * A U s . 

Proof. The proof proceeds in the usual two steps. First one shows the 
existence of a homomorphism <p having the desired effect. The value on the right 
of the above formula is independent of the choice of u x ,..., u s lifting 
w x ,... ,w s by using the alternating property, so we obtain a homomorphism <p. 
Selecting in particular {v x ,.. ., v r } and {w x ,..., vvj to be bases of £' and £" 
respectively, one then sees that <p is both injective and surjective. We leave the 
details to the reader. 

Given a free module £ of rank n, we define its determinant to be 
det £ = A max £ = A n E. 

Then Proposition 1.2 may be reformulated by the isomorphism formula 
det(£') ® det(£") « det(£). 

If R = k is a field, then we may say that det is an Euler-Poincaré map on the 
category of finite dimensional vector spaces over k. 

Example. Let V be a finite dimensional vector space over R. By a volume 
on V we mean a norm || || on det V. Since V is finite dimensional, such a norm 
is equivalent to assigning a positive number c to a given basis of det(V). Such 
a basis can be expressed in the form e x a • • • a e n , where {e x ,. . . , e n } is a basis 
of V. Then for a e R we have 


I|û«i A • • • A e n \\ = \a\c. 
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In analysis, given a volume as above, one then defines a Haar measure p on V 
by defining the measure of a set S to be 

fi(S) = | \\e x A • • • A e n \\ dx x • • • dx n , 
s 

where x x ,. . . , x n are the coordinates on V with respect to the above basis. As 
an exercise, show that the expression on the right is the independent of the choice 
of basis. 

Proposition 1.2 is a special case of the following more general situation. We 
consider again an exact sequence of free R- modules of finite rank as above. With 
respect to the submodule E ' of E, we define 

/\"E = submodule of /\ n E generated by all elements 
x\ a • • • a x'i a y i+ 1 a • * • a y n 
with x'j,... , x\ e E' viewed as submodule of E. 

Then we have a filtration 


= Aì+iE. 


Proposition 1.3. There is a natural isomorphism 

A'f® A" _i £''-A?£/A?+i£- 

Proof Let x'í, . ..,x"_ f be elements of E", and lift them to elements 
yi,...,y n -iOÎE. We consider the map 

(xi, . . . , x \, x'í, . . . , x"_i) I—► x\ A • • • A Xj A A • • • A y n _ f 

with the right-hand side taken mod /\"+ X E. Then it is immediate that this map 
factors through 


A^' ® A n ~ iE " 


A?£/A" + i £ > 


and picking bases shows that one gets an isomorphism as desired. 


In a similar vein, we have: 


Proposition 1.4. Let E = E’ ® E" be a direct sum of finite free modules. 
Then for every positive integer n , we have a module isomorphism 

A n £ « 0 A p e ' ® A ,£ '- 

p + q = n 
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In terms of the alternating algebras , we have an isomorphism 

A E ~ A£' A, A£". 

w //*£ superproduct of graded algebras. 

Proof Each natural injection of E' and E" into E induces a natural map on 
the alternating algebras, and so gives the homomorphism 

/\E' ® /\E" - /\E, 

which is graded, i.e. for p = 0 ,..., n we have 

f\ p E' ® f\ n ~ p E" - f\ n E. 

To verify that this yields the desired isomorphism, one can argue by picking 
bases, which we leave to the reader. The anti-commutation rule of the alternating 
product immediately shows that the isomorphism is an algebra isomorphism for 
the super product f\E' f\E". 

We end this section with comments on duality. In Exercise 3, you will prove: 

Proposition 1.5. Let E be free of rank n over R. For each positive integer 
r, we have a natural isomorphism 

A r (£ v ) - • 

The isomorphism is explicitly described in that exercise. A more precise property 
than “natural” would be that the isomorphism is functorial with respect to the 
category whose objects are finite free modules over /?, and whose morphisms 
are isomorphisms. 

Examples. Let L be a free module over R of rank 1. We have the dual 
module L v = Hom^(L, /?), which is also free of the same rank. For a positive 
integer m, we define 

L®~ m - (jf }®m - jy (g) ... 0 iy (tensor product taken m times). 

Thus we have defined the tensor product of a line with itself for negative integers. 
We define L®° = R. You can easily verify that the rule 

L®P (g) JjS>q 25= JjSiP+q) 

holds for all integers p, q e Z, with a natural isomorphism. In particular, if 
q = —p then we get R itself on the right-hand side. 

Now let E be an exact sequence of free modules: 

E : 0 — > Eq —■> E\ —> E m — > 


0 . 
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We define the determinant of this exact sequence to be 

det(E) = 0 det^)^' 1 ^. 

As an exercise, prove that det(E) has a natural isomorphism with R , functorial 
with respect to isomorphisms of exact sequences. 

Examples. Determinants of vector spaces or free modules occur in several 
branchesof mathematics, e.g. complexes of partial differential operators, homol- 
ogy theories, the theory of determinant line bundles in algebraic geometry, etc. 
For instance, given a non-singular projective variety V over C, one defines the 

determinant of cohomology of V to be 

det H(V) = (g) det H\\ 0® (_l)í , 

where H l (V) are the cohomology groups. Then det H(V) is a one-dimensional 
vector space over C, but there is no natural identification of this vector space 
with C, because a priori there is no natural choice of a basis. For a notable 
application of the determinant of cohomology, following work of Faltings, see 
Deligne, Le determinant de la cohomologie, in Ribet, K. (ed.), Current Trends 
in Arithmetical Algebraic Geometry , Proc. Arcata 1985. (Contemporary Math. vol 
67, AMS (1985), pp. 93-178.) 


§2. FITTING IDEALS 

Certain ideals generated by determinants are coming more and more into 
use, in several branches of algebra and algebraic geometry. Therefore I include 
this section which summarizes some of their properties. For a more extensive 
account, see Northcott’s book Finite Free Resolutions which I have used, as well 
as the appendix of the paper by Mazur-Wiles: “Class Fields of abelian extensions 
of Q,” which they wrote in a self-contained way. (Invent. Math. 76 (1984), pp. 
179-330.) 

Let R be a commutative ring. Let A be a p x q matrix and B a q x 5 matrix 
with coefficients in R. Let r ^ 0 be an integer. We define the determinant ideal 
I r (A) to be the ideal generated by all determinants of r x r submatrices of A. 
This ideal may also be described as follows. Let S p r be the set of sequences 

J = Ol, • • • Jr) WÌth 1 g j, < j 2 < ■ ■ ■ < )r ú P ■ 

Let A = (aij). Let 1 ^ r ^ min(p, q). Let K = (k u ..., k r ) be another element 
of S p . We define 


l hki 

a j\k2 

' ' a UK 

l j2k 1 

a j2k2 

a jìk r 

ijrk 1 

a jrk 2 ' 

' ' U jrkr 


A( r ) — 
h JK — 
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where the vertical bars denote the determinant. With J, K ranging over S p 
we may view A { J ] K as the JX-component of a matrix A (r) which we call the r-th 
exterior power of A. 

One may also describe the matrix as follows. Let {e x ,. .. , e p } be a basis of 
RP and {w l5 . . . , u q } a basis of R q . Then the elements 

e h A A e jr O'i <7*2 < ••• < Jr) 

form a basis for f\JR p and similarly for a basis of ffR q . We may view A as a 
linear map of R p into R q , and the matrix A (r) is then the matrix representing the 
exterior power f\A viewed as a linear map of f\R p into f\R q . On the whole, 
this interpretation will not be especially useful for certain computations, but it 
does give a slightly more conceptual context for the exterior power. Just at the 
beginning, this interpretation allows for an immediate proof of Proposition 2.1. 

For r = 0 we define A {0) to be the 1 x 1 matrix whose single entry is the 
unit element of R. We also note that A {1) = A. 

Proposition 2.1. Let A be a p x q matrix and B a q X s matrix. Then 
(AB) (r) = A ir) B (r) for r ^ 0. 

If one uses the alternating products as mentioned above, the proof simply 
says that the matrix of the composite of linear maps with respect to fixed bases 
is the product of the matrices. If one does not use the alternating products, then 
one can prove the proposition by a direct computation which will be left to the 
reader. 

We have formed a matrix whose entries are indexed by a finite set S p . For 
any finite set S and doubly indexed family ( c JK ) with J, K e S we may also 
define the determinant as 

det (c JK ) = Yj e ( a )\ n c J,<r(j)] 

<r \J eS / 

where <x ranges over all permutations of the set. 

For r ^ 0 we define the determinant ideal I r (A) to be the ideal generated by 
all the components of A (r) , or equivalently by all r x r subdeterminants of A. 
We have by definition 

A {0) = R and A {1) = ideal generated by the components of A. 


Furthermore 


and the inclusions 


I r (A) = 0 for r > min (p, q) 


R = I 0 (A) => IfA) => I 2 (A) => • • • 
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By Proposition 10.1, we also have 

(1) IÁAB) c= I r (A) n I r (B). 

Therefore, if A = UBU' where U , U' are square matrices of determinant 1, then 

(2) I r (A) = I r (B\ 

Next, let E be an R-module. Let x ì9 ...,x q be generators of E. Then we 
may form the matrix of relations (a u ..., a q ) e R q such that 

Z a i x i = °- 

i — 1 

Suppose first we take only finitely many relations, thus giving rise to a p x q 
matrix A. We form the determinant ideal I r (A). We let the determinant ideals 
of the family of generators be: 

I r (x u ...,x q ) = I r (x) = ideal generated by I r (A) for all A. 

Thus we may in fact take the infinite matrix of relations, and say that I r (x) is 
generated by the determinants of all r x r submatrices. The inclusion relations 
of (1) show that 


R = I 0 (x) => I^x) => I 2 (x) 3 ••• 

I r (x) = 0 if r > q. 

Furthermore, it is easy to see that if we form a submatrix M of the matrix of all 
relations by taking only a family of relations which generate the ideal of all 
relations in R q , then we have 


I r (M) = I r (x). 

We leave the verification to the reader. We can take M to be a finite matrix when 
E is finitely presented, which happens if R is Noetherian. 

In terms of this representation of a module as a quotient of R q , we get the 
following characterization. 

Proposition 2.2. Let R q —> E —> 0 be a representation ofEasa quotient of 
R q , and let x x , ..., x q be the images of the unit vectors in R q . Then I r (x) is the 
ideal generated by all values 


A(wj,..., w r ) 

xvhere vv l5 ..., w r e Ker (R q -> E) and X e L r a (R q , R). 

Proof. This is immediate from the definition of the determinant ideal. 
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The above proposition can be useful to replace a matrix computation by a 
more conceptual argument with fewer indices. The reader can profitably trans- 
late some of the following matrix arguments in these more invariant terms. 

We now change the numbering, and let the Fitting ideals be: 

F k (x) = I q - k (x) for 0^k^q 

F k (x ) = R when k > q. 

Lemma 2.3. The Fitting ideal F k (x) does not depend on the choice of 
generators (*). 

Proof. Let y u ..., y s be elements of E. We shall prove that 

I r (x) = / r+s (x, y). 

The relations of (x, y ) constitute a matrix of the form 



By elementary column operations, we can change this to a matrix 

A C 
0 1 

and such operations do not change the determinant ideals by (2). Then we 
conclude that for all r ^ 0 we have 

I r (A) = I r+s (W)c:I r+s (x 9 y). 

This proves that I r (x) cz I r+S (x , y). 

Conversely, let C be a matrix of relations between the generators (x, y). 
We also have a matrix of relations 



By elementary row operations, we can bring this matrix into the same shape 
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as B above, with some matrix of relations A' for (x), namely 



Then 

IÁA’) = I r+s (Z f ) = I r+s (Z) => I r+s (C\ 

whence I r+s (C) cz I r (x). Taking all possible matrices of relations C shows 
that / r+s (x, y) cz / r (x), which combined with the previous inequality yields 
/f+sC^» y) i r(x)- 

Now given two families of generators (x) and (y), we simply put them side 
by side (x, y) and use the new numbering for the F k to conclude the proof of 
the lemma. 

Now let E be a finitely generated K-module with presentation 

where the sequence is exact and K is defined as the kernel. Then K is generated 
by g-vectors, and can be viewed as an infinite matrix. The images of the unit 
vectors in R q are generators (x í9 ..., x q ). We define the Fitting ideal of the 
module to be 


F k (E) = F k (x). 

Lemma 2.3 shows that the ideal is independent of the choice of presentation. 
The inclusion relations of a determinant ideal I r (A) of a matrix now translate 
into reverse inclusion relations for the Fitting ideals, namely: 

Proposition 2.4. 

(i) We have 

F 0 (F) <= F^F) c= F 2 (F) <= • • • 

(ii) // E can be generated by q elements , then 

F q (E) = R. 

(iii) If E is finitely presented then F k (E) is finitely generated for all k. 

This last statement merely repeats the property that the determinant ideals of a 
matrix can be generated by the determinants associated with a finite submatrix 
if the row space of the matrix is finitely generated. 
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Example. Let E — R q be the free module of dimension q. Then: 


F>(E) = {" 


if 0 S k <q 
if k ^ q. 


This is immediate from the definitions and the fact that the only relation of a 
basis for E is the trivial one. 


The Fitting ideal F 0 (E) is called the zero-th or initial Fitting ideal. In some 
applications it is the only one which comes up, in which case it ìs called 4t the” 
Fitting ideal F(E) of E. It is the ideal generated by all q x q determinants in 
the matrix of relations of q generators of the module. 

For any module E we let ann R (E) be the annihilator of E in R , that is the 
set of elements a e R such that aE = 0. 


Proposition 2.5. Suppose that E can be generated by q elements. Then 
(ann R (E)) q c= F(E) c= ann K (E). 

In particular , if E can be generated by one element , then 

F(E) = ann R (E). 

Proof. Let x l9 ...,x q bt generators of E. Let a ì9 ... 9 a q be elements of R 
annihilating E. Then the diagonal matrix whose diagonal components are 
a l9 ..., a q is a matrix of relations, so the definition of the Fitting ideal shows 
that the determinant of this matrix, which is the product a x • • • a q lies in 
I q (E) c= F 0 (E). This proves the inclusion 

ann R (E) q <= F(E). 

Conversely, let A be a q x q matrix of relations between x l9 ... 9 x q . Then 
det(y4)x, = 0 for all i so det(^l) e ann R (E). Since F(E) is generated by such 
determinants, we get the reverse inclusion which proves the proposition. 

CoroIIary 2.6. Let E = R/a for some ideal a. Then F(E) = û. 

Proof. The module R/a can be generated by one element so the corollary 
is an immediate consequence of the proposition. 

Proposition 2.7. Let 

0 0 

be an exact sequence of finite R-moduies. For integers m,n^0we have 
£„(£')£„(£") <= £ m + „(£). 
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In particular for F = F 0 . 

F(F)F(F') c= F(E). 

Proof. We may assume E' is a submodule of E. We pick generators 
x j,..., x p of E' and elements y l9 .. . , y q in E such that their images ý[ 9 ... , y" q 
in E" generate E". Then (x, y ) is a family of generators for E. Suppose first that 
m ^ p and n rg q. Let A be a matrix of relations among y" with g 

columns. If (a l5 ..., a q ) is such a relation, then 

+ ••• + a q y q e E' 

so there exist elements b ì9 ..., b p e R such that 

Z fl «y« + Z bj x j = o. 

Thus we can find a matrix B with p columns and the same number of rows as 
A such that (£, A) is a matrix of relations of (x, y). Let C be a matrix of relations 
of (x l5 ..., x p ). Then 

B A 
C 0 

is a matrix of relations of (x, y). If D" is a (q — n) x (q — n) subdeterminant of 
A and D' is a (p — m) x (p — m) subdeterminant of C then D"D' is a 

(p + q — m — n) x (p + q — m — n) 
subdeterminant of the matrix 



and D"D' e F m+n (E). Since F m (E') is generated by determinants like D' and 
F n (E") is generated by determinants like D'\ this proves the proposition in the 
present case. 

Ifm > pandn > ^thenF m+w (£) = F m (E') = F n (E") = R so the proposition 
is trivial in this case. 

Say m ^ p and n > q. Then F n (E") = R = F q (E") and hence 

F m (E')F n (E") = F q (E")F m (E) c= F p+n (F) c= F m + rt (£) 

where the inclusion follows from the first case. A similar argument proves 
the remaining case with m > p and n ^ q. This concludes the proof. 

Proposition 2.8. Let E\ E" be finite R-modules. For any integer n ^ 0 we 
have 

F„(E’ © E") = X F r (E')F s (E"). 



XIX, §2 


FITTING IDEALS 745 


Proof. Let x x ,... , x p generate E l and y { ,. . . , y q generate E". Then (x, y) 
generate E' © E". By Proposition 2.6 we know the inclusion 

E F r (E')F s (E") <= F„(E’ 0 E"), 

so we have to prove the converse. If n ^ p -F q then we can take r ^ p and 
s ^ q in which case 

F r (E') = F S (E ") = F n (E) = R 

and we are done. So we assume n < p + q. A relation between (x, y) in the 
direct sum splits into a relation for (x) and a relation for (y). The matrix of 
relations for (x, y) is therefore of the form 


C = 



where A' is the matrix of relations for (x) and A" the matrix of relations for 
(y). Thus 

F n (E’ 0 E") = Y, Ip + q-n(C) 
c 

where the sum is taken over all matrices C as above. Let D be a 


(p + q - n) x (p + q - n) 


subdeterminant. Then D has the form 

D = 


B' 0 
0 B" 


where B' is a k' x (p — r) matrix, and B" is a k" x (q — s) matrix with some 
positive integers k', k", r, s satisfying 

k' + k" = p + q — n and r + s = n. 

Then D = 0 unless k' = p — r and k" = q - s. In that case 

D = det(£')det(£") e F r (E')F s (E"), 

which proves the reverse inclusion and concludes the proof of the proposition. 


CoroIIary 2.9. Let 

£ = 0 R/û, 

i= 1 

where û f is an ideal. Then F(E) = • • • a s . 

Proof. This is really a corollary of Proposition 2.8 and Corollary 2.6. 
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§3. UNIVERSAL DERIVATIONS 
AND THE DE RHAM COMPLEX 

In this section , all rings R , A, etc. are assumed commutative. 

Let A be an K-algebra and M an /1-module. By a derivation D: A -> M 
(over R) we mean an R-linear map satisfying the usual rules 

D(ab) = aDb + bDa. 

Note that D(l) = 2D(l) so D(l) = 0, whence D(R) = 0. Such derivations form 
an,4-moduleDerfl(+, M)inanatural way,whereaDisdefinedby(aD)(b) = aDb. 

By a universal derivation for A over R , we mean an ,4-module Q, and a 
derivation 

d:A^> Q 

such that, given a derivation D : A -> M there exists a unique y4-homomorphism 
/: Q -► M making the following diagram commutative: 



M 


It is immediate from the defìnition that a universal derivation ( d , Q) is uniquely 
determined up to a unique isomorphism. By definition, we have a functorial 
isomorphism 


Dern(i4, M) % Hom^Q, M). 


We shall now prove the existence of a universal derivation. 

The following general remark will be useful. Let 

be two homomorphisms of R-algebras, and let J be an ideal in B such that 
J 2 = 0. Assume that f x = / 2 mod J; this means that f(x) = f 2 (x) mod J for 
all x in A. Then 

is a derivation. This fact is immediately verified as follows: 


f 2 (ab) = f 2 (a)f 2 (b) = Iffa) + D(à)][_ffb) + D(b)] 

= fi(ab) + h(b)D(a) + Ma)D(b). 
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But the ,4-module structure of J is given via f x or/ 2 (which amount to the same 
thing in light of our assumptions on f l9 / 2 ), so the fact is proved. 

Let the tensor product be taken over R. 

Let rn^t: A (x) A -> A be the multiplication homomorphism, such that 
m A (a (x) b) = ab. Let J = Ker m^. We define the module of differentials 

^AIR = J/J\ 

as an ideal in (A (x) Á)/J 2 . The ,4-module structure will always be given via the 
embedding on the first factor: 

A -► A ® A by anû(g) 1. 

Note that we have a direct sum decomposition of X-modules 


and therefore 

Let 


A® A = (A ® 1) 0 J, 

(A ® A)/J 2 = (A ® 1) 0 J/J 2 . 


d: -► J/J 2 be the K-linear map a h-> 1 ® a — a (x) 1 mod J 2 . 


Taking/, h-» a 8) 1 and/ 2 :a i-> 1 ® a, we see that d = f 2 — f\. Hence d is 
a derivation when viewed as a map into J/J 2 . 

We note that J is generated by elements of the form 


'Lx i dy i - 

Indeed, if £ x t (x) y, g J, then by defìnition £ x,y, = 0, and hence 
Z ® yt = Z x,(l ® y t - yi ® 1), 

according to the ,4-module structure we have put on ,4 (x) ,4 (operation of A on 
the left factor.) 


Theorem 3.1. The pair (J/J 2 ,d) is universal for derivations of A. This 
means'. Given a derivation D: A —> M there exists a unique A-linear map 
f: J/J 2 —» M mahing the following diagram commutative. 


A ——» J/J 2 



M 
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Proof. There is a unique R-bilinear map 

f:A®A^>M given by x ® y i—► xDy , 

which is /l-linear by our definition of the /1-module structure on A (x) A. Then 
by definition, the diagram is commutative on elements of A, when we take / 
restricted to J, because 


/(1 ®y - y ® 1) = 

Since J/J 2 is generated by elements of the form x dy, the uniqueness of the map 
in the diagram of the theorem is clear. This proves the desired universal 
property. 

We may write the result expressed in the theorem as a formula 


Der R (A, M) % Hom A (J/J 2 , M). 


The reader will find exercises on derivations which give an alternative way of 
constructing the universal derivation, especially useful when dealing with 
finitely generated algebras, which are factors of polynomial rings. 

I insert here without proofs some further fundamental constructions, im- 
portant in differential and algebraic geometry. The proofs are easy, and provide 
nice exercises. 

Let R -► A be an R- algebra of commutative rings. For i ^ 0 define 

&A/R = A* ^A/Ri 

where Q^* = A. 

Theorem 3.2. There exists a unique sequence of R-homomorphisms 

d/ * &A/R 

such that for co e Q l and rj e Q j we have 

d(co a rj) = dco a // -f (— ìyco a drj. 

Furthermore d ° d = 0. 

The proof will be left as an exercise. 

Recall that a complex of modules is a sequence of homomorphisms 

. . . _^ ^ gì + l _> 

such that d l ° d l ' x = 0. One usually omits the superscript on the maps d. With 
this terminology, we see that the Q^ //? form a complex, called the De Rham 
complex. 
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Theorem 3.3. Let k be afield of characteristic 0, and letA = k[X u ..., X n \ 
be the polynomial ring in n variables. Then the De Rham complex 

0 * k ► A ► íì l A/k -►-> Cl A/k -> 0 

is exact. 

Again the proof will be left as an exercise. Hint : Use induction and 
integrate formally. 

Other results concerning connections will be found in the exercises below. 


§4. THE CLIFFORD ALGEBRA 

Let k be a field. By an algebra throughout this section, we mean a /:-algebra 
given by a ring homomorphism k—> A such that the image of k is in the center 
of A. 

Let E be a finite dimensional vector space over the field k , and let g be a 
symmetric form on E. We would like to find a universal algebra over k , in which 
we can embed E , and such that the square in the algebra corresponds to the value 
of the quadratic form in E. More precisely, by a Clifford algebra for g, we 
shall mean a /:-algebra C(g), also denoted by C g (E), and a linear map 
p: E —> C(g) having the following property: If i/j : E —> L is a linear map of E 
into a /:-algebra L such that 

\fj() c) 2 = *) • 1 (1 = unit element of L) 

for all x e E, then there exists a unique algebra-homomorphism 

= »/'* : C(g) -» L 

such that the following diagram is commutative: 


E " * C(g) 



L 


By abstract nonsense, a Clifford algebra for g is uniquely determined, up to a 
unique isomorphism. Furthermore, it is clear that if (C(g\ p) exists, then C(g) 
is generated by the image of p, i.e. by p(E), as an algebra over k. 

We shall write p = p g if it is necessary to specify the reference to g explicitly. 
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We have trivially 


for all xeE, and 


p(x) 2 = g(x , x) • 1 


p(x)p(y) + p(y)p(x) = 2g(x, y) • 1 
as one sees by replacing x by x + y in the preceding relation. 


Theorem 4.1. Let g be a symmetric bilinear form on a finite dimensional 
vector space E over k. Then the Clifford algebra (C(g), p) exists. The map p 
in injective, and C(g) has dimension 2 n over k, if n = dim E. 

Proof. Let T(E) be the tensor algebra as in Chapter XVI, §7. In that algebra, 
we let I g be the two-sided ideal generated by all elements 

x ® x - g(x , jc) • 1 for x e E. 

We define C g (E) = T(E)/I g . Observe that E is naturally embedded in T(E) since 
T(E) = *®£©(£® £)©•••. 

Then the natural embedding of E in TE followed by the canonical homomorphisms 
of T(E) onto C g (E) defines our k- linear map p : £ —» C g (E). It is immediate from 
the universal property of the tensor product that C g (E) as just defined satisfies 
the universal property of a Clifford algebra, which therefore exists. The only 
problem is to prove that it has the stated dimension over k. 

We first prove that the dimension is ^ 2 n . We give a proof only when 
the characteristic of k is ± 2 and leave characteristic 2 to the reader. Let 
{i?!,. . . , v n } be an orthogonal basis of E as given by Theorem 3.1 of Chapter 
XV. Let e ( = i/^( v t ), where (/>:£—»£ is given as in the beginning of the sec- 
tion. Let c, = g(v h v { ). Then we have the relations 

ej = Cp c,- ej = —ejei for all i =# j. 

This immediately implies that the subalgebra of L generated by ij/(E) over k is 
generated as a vector space over k by all elements 

ep • • • e v n n with = 0 or 1 for i = 1,. . . , n. 

Hence the dimension of this subalgebra is ^ 2 n . In particular, dim C g (E) ^ 2 n 
as desired. 

There remains to show that there exists at least one ij/: E —> L such that L 
is generated by ij/(E) as an algebra over k , and has dimension 2 n \ for in that 
case, the homomorphism ij/* : C g (E) —» L being surjective, it follows that dim 
C g (E) ^ 2 n and the theorem will be proved. We construct L in the following 
way. We first need some general notions. 

Let M be a module over a commutative ring. Let /, j e Z/2Z. Suppose M 
is a direct sum M = M 0 © M x where 0, 1 are viewed as the elements of Z/2Z. 
We then say that M is Z/2Z-graded. If M is an algebra over the ring, we say 
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it is a Z/2Z-graded algebra if M t Mj C M i+ j for all /, j e Z/2Z. We simply 
say graded, omitting the Z/2Z prefix when the reference to Z/2Z is fixed 
throughout a discussion, which will be the case in the rest of this section. 

Let A, B be graded modules as above, with A = A 0 © A x and B = B 0 © B x . 
Then the tensor product A ® B has a direct sum decomposition 

A ® B = 0 A, , ® Bj. 
ij 

We define a grading on A ® B by letting (A ® 5 ) 0 consist of the sum over indices 
/, j such that /+7 = 0 (in Z/2Z), and (A ® 5)j consist of the sum over the 
indices /, j such that / + 7 = 1 . 

Suppose that A,B are graded algebras over the given commutative ring. There 
is a unique bilinear map of A ® B into itself such that 

(a ® 6)(fl' ® b') = (-1)'W 0 tó' 

if a' e A t and b e Bj. Just as in Chapter XVI, § 6 , one verifies associativity and 
the fact that this product gives rise to a graded algebra, whose product is called 
the super tensor product, or super product. As a matter of notation, when we 
take the super tensor product of A and B , we shall denote the resulting algebra 
by 

A® SU B 

to distinguish it from the ordinary algebra A ® B of Chapter XVI, § 6 . 

Next suppose that E has dimension 1 over k. Then the factor polynomial ring 
k[X]/(x 2 — c x ) is immediately verified to be the Clifford algebra in this case. 
We let t x be the image of X in the factor ring, so C g (E ) = k[t{\ with t\ = c x . 
The vector space E is imbedded as kt x in the direct sum k © kt x . 

In general we now take the super tensor product inductively: 

C g (E) = *[/,] ® su k[t 2 ] ® su ■ ■ ■ ® su k[t„], with *[/,] = k[X]/(X 2 - a). 

Its dimension is 2 n . Then E is embedded in C g (E) by the map 

+ • • • + a„v n a { t x © • • • © a n t n . 

The desired commutation rules among t h tj are immediately verified from the 
definition of the super product, thus concluding the proof of the dimension of 
the Clifford algebra. 

Note that the proof gives an explicit representation of the relations of the 
algebra, which also makes it easy to compute in the algebra. Note further that 
the alternating algebra of a free module is a special case, taking c z = 0 for all 
/. Taking the c z to be algebraically independent shows that the alternating algebra 
is a specialization of the generic Clifford algebra, or that Clifford algebras are 
what one calls perturbations of the alternating algebra. Just as for the alternating 
algebra, we have immediately from the construction: 

Theorem 4.2. Let g, g ' by symmetric forms on E , E ' respectively. Then we 
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have an algebra isomorphism 

C(g © g') « C(g) ® M C(g'). 

Examples. Clifford algebras have had increasingly wide applications in 
physics, differential geometry, topology, group representations (finite groups 
and Lie groups), and number theory. First, in topology I refer to Adams [Ad 62] 
and [ABS 64] giving applications of the Clifford algebra to various problems 
in topology, notably a description of the way Clifford algebras over the reals 
are related to the existence of vector fields on spheres. The multiplication in the 
Clifford algebra gives rise to a multiplication on the sphere, whence to vector 
fields. [ABS 64] also gives a number of computations related to the Clifford 
algebra and its applications to topology and physics. For instance, let E = R n 
and let g be the negative of the standard dot product. Or more invariantly, take 
for E an /z-dimensional vector space over R, and let g be a negative definite 
symmetric form on E. Let C n = C(g). 

The operation 

v x ® • • • ® v r i-» v r ® • • • 0 v x = (v x ® • • • ® v r )* for v t e E 

induces an endomorphism of T r (E) for r ^ 0. Siríce v 0 v — g(v , v) • 1 (for 
v e E) is invariant under this operation, there is an induced endomorphism 
* : C n —> C n , which is actually an involution, that is jc** = jc and (jcy)* = y*jc* 
for x e C n . We let Spin(«) be the subgroup of units in C n generated by the unit 
sphere in E (i.e. the set of elements such that g(v, v) = -1), and lying in the 
even part of C n . Equivalently, Spin (n) is the group of elements jc such that 
jcjc* = 1. The name dates back to Dirac who used this group in his study of elec- 
tron spin. Topologists and others view that group as being the universal cover- 
ing group of the special orthogonal group SO(n) = SU n ( R). 

An account of some of the results of [Ad 62] and [ABS 64] will also be 
found in [Hu 75], Chapter 11. Second I refer to two works encompassing two 
decades, concerning the heat kernel, Dirac operator, index theorem, and number 
theory, ranging from Atiyah, Bott and Patodi [ABP 73] to Faltings [Fa 91], see 
especially §4, entitled “The local index theorem for Dirac operators”. The vector 
space to which the general theory is applied is mostly the cotangent space at a 
point on a manifold. I recommend the book [BGV 92], Chapter 3. 

Finally, I refer to Brocker and Tom Dieck for applications of the Clifford 
algebra to representation theory, starting with their Chapter I, §6, [BtD 85]. 
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EXERCISES 

1 . Let E be a finite dimensional vector space over a field k. Let x { ,..., x p be elements of E 
such that a • • • a x p # 0, and similarly a • • • a y p ^ 0. If c e k and 

a * • • a x p = cy, A ' ‘ ‘ a y p 

show that x x , . . . , x p and y l9 .. ., y p generate the same subspace. Thus non-zero 
decomposable vectors in f\ p E up to non-zero scalar multiples correspond to 
p-dimensional subspaces of E. 

2. Let E be a free module of dimension n over the commutative ring R. Let /:£-►£ 
be a linear map. Let a r (/) = tr /\ r (/), where /\ r (f) is the endomorphism of /\ r (E) 
into itself induced by /. We have 

oL 0 (f) = 1, aj(/) = tr(/), a n (/) = det /, 

and a r (/) = 0 if r > n. Show that 

det(l + /) = X <+(/). 

r^O 

[ifmí: As usual, prove the statement when / ìs represented by a matrix with variable 
coefficients over the integers.] Interpret the a r (/) ìn terms of the coefficients of the 
characteristic polynomial of /. 

3. Let E be a finite dimensional free module over the commutative ring R. Let £ v be 
its dual module. For each integer r ^ 1 show that f\ r E and /\ r £ v are dual modules 
to each other, under the bilinear map such that 

(i>i a • • • a v r , v\ A • • • a v' r ) i-> det ((Vi, v'j)) 
where (v n v'j) is the value of v'j on v h as usual, for v { E E and v'j E £ v . 

4. Notation being as in the preceding exercise, let F be another R-module which is free, 
finite dimensional. Let /: E -> F be a linear map. Relative to the bilinear map of the 
preceding exercise, show that the transpose of /\ r f is f\ r (f), i.e. is equal to the r-th 
alternating product of the transpose of /. 

5. Let R be a commutative ring. If E is an /?-module, denote by L r a (E) the module of 
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r-multilinear alternating maps of E into R itself (i.e. the r-multilinear alternating 
forms on E). Let L%{E) = R , and let 


n(£) = © K(E). 

r = 0 

Show that fì(£) is a graded £-algebra, the multiplication being defined as follows. If 
ùj e L r a (E) and i p e L a (E\ and v u ..., v r+s are elements of £, then 

(co a .... t> r+s ) = £ e(<T)w(v„ u ..., V c ,)ìp(v„ r+ o. 

the sum being taken over all permutations o of (1,..., r + s) such that crl < • • • < or 
and o(r + 1) < ♦ • • < <75. 

Derivations 

In thefollowing exercises on derivations, all rings are assumed commutative. Among 
other things, the exercises give another proof of the existence of universal derivations. 

Let R -> A be a £-algebra (of commutative rings, according to our convention). 
We denote the module of universal derivations of A over R by (d A/R , fìi /jR ), but we do not 
assume that it necessarily exists. Sometimes we write d instead of d A/R for simplicity 
if the reference to A/R is clear. 

6. Let A = R[X J be a polynomial ring in variables X aì where a ranges over some 
indexing set, possibly infinite. Let fì be the free ,4-module on the symbols dX a , and let 

d:A - fì 


be the mapping defined by 




Show that the pair (d, fì) is a universal derivation (d A/R , fìi/*). 

7. Let A B be a homomorphism of £-algebras. Assume that the universal derivations 
for A/R, B/R, and B/A exist. Show that one has a natural exact sequence: 

B ®A &A/R -+ ^B/R -* &B/A 0. 

[Hint: Consider the sequence 

0 - Der Á (B, M) - Der*(fí, M) - Der R (A, M) 

which you prove is exact. Use the fact that a sequence of £-modules 

N' - N - N" - 0 


is exact if and only if its Hom into M ìs exact for every fí-module M. Apply this to the 
sequence of derivations.] 

8. Let R A be an fí-algebra, and let / be an ideal of A. Let fí = A/I. Suppose that the 
universal derivation of A over R exists. Show that the universal derivation of fí over R 
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also exists, and that there is a natural exact sequence 

l/I 2 ^ B ® A Sì l A/R -► Cì' BIR -> 0. 

[Hint’. Let M be a fî-module. Show that the sequence 

0 - Der*(£, M) - Der*(,4, M) - Hom B (/// 2 , M) 


is exact.] 

9. Let R —> B be an /?-algebra. Show that the universal derivation of B over R exists 
as follows. Represent B as a quotient of a polynomial ring, possibly in infinitely 
many variables. Apply Exercises 6 and 7. 

10. Let R -> A be an R-algebra. Let S 0 be a multiplicative subset of R, and S a multiplicative 
subset of A such that S 0 maps into S. Show that the universal derivation of S~ X A over 
S 0 l R is (d, S~ 1 Q 1 a/ r)> where 

d(a/s) = (sd A/R (a) - ad AIR (s))/s 2 . 

11. Let B be an R-algebra and M a B-module. On B ® M define a product 

( b , x)(b\ y) = ( bb', by 4- b'x). 

Show that B ® M is a fî-algebra, if we identify an element be B with (b, 0). For any 
R-algebra A , show that the algebra homomorphisms Hom Alg//î (/l, B®M) consist of 
pairs ((p, D), where (p:A->B is an algebra homomorphism, and D.A^M is a 
derivation for the /l-module structure on M induced by cp. 

12. Let /1 be an R-algebra. Let £: A -> R be an algebra homomorphism, which we call an 
augmentation. Let M be an R-module. Define an /1-module structure on M via £, by 

a • x = s(a)x for a g A and xe M. 

Write M £ to denote M with this new module structure. Let: 

Der £ (/1, M) = /1-module of derivations for the £-module structure on M 


/ = Ker £. 


Then Der c (/1, M) is an /l//-module. Note that there is an R-module direct sum de- 
composition A = R ® /. Show that there is a natural /1-module isomorphism 

Qa/r/IQa/r ~ J/J 2 


and an R-module isomorphism 

Der e (/1, M) % Hom K (/// 2 , M). 

In particular, let rj: A -> /// 2 be the projection of A on I/I 2 relative to the direct sum 
decomposition A = R © /. Then rj is the universal £-derivation. 

Derivations and connections 

13. Let R -► A be a homomorphism of commutative rings, so we view A as an R-algebra. 
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Let E be an /t-module. A connection on E is a homomorphism of abelian groups 

V: E —> Qa/r E 
such that for a e A and xeE we have 

V(ax) = aV(x) -I- da (x) x, 

where the tensor product is taken over A unless otherwise specified. The kernel of V, 
denoted by E v , is called the submodule of horizontal elements, or the horizontal submodule 
of (£, V). 

(a) For any integer i ^ 1, define 


^a/r — A tÇÌ A/R- 

Show that V can be extended to a homomorphism of £-modules 
v,: ® £ — QíiJjf ® H 


by 


Vi(cu ® x) = daj (x) x -I- (— l)‘cu a V(x). 


(b) Define the curvature of the connection to be the map 
K = V, o V: £ Í2 2 a/r < 8,4 E. 

Show that K is an /1-homomorphism. Show that 

V i+ 1 ° Vi(cu (X) x) = CO A K(x) 


for co e ÇŶ A / R and x e E. 

(c) Let Der(A/R) denote the ,4-module of derivations of A into itself, over R. 
Let V be a connection on E. Show that V induces a unique A-linear map 

V: Der (A/R) - End R (E) 


such that 


S7(D)(ax) = D(a)x + aV(D)(x). 


(d) Prove the formula 

[VÍDO, V(D 2 )] - V([D lf DJ) = (Dj a D 2 )(K). r 

In this formula, the bracket is defìned by [/, #] = / ° # — g ° / for two endo- 
morphisms/, # of E. Furthermore, the right-hand side is the composed mapping 


P X p. 2 ^ p r) i A D; 

E -*• Íîx/R ® E - 


/1 <8 £ w £. 
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14. (a) For any derivation D of a ring A mto itself, prove Leibniz’s rule: 


D*(xy) = t ^D‘(x)D"~U 


(b) Suppose A has characteristic p. Show that D p is a derivation. 

15. Let A/R be an algebra, and let E be an /l-module with a connection V. Assume that R 
has characteristic p. Define 


by 


ì/f: Dqt(A/R) - Ená R (E) 


w>) = (V(D)y - V(D p ). 

Prove that i/^(D) is /1-linear. [Hmí: Use Leibniz’s formula and the definition of a 
connection.] Thus the image of ip is actually in Ená A (E). 

Some Clifford exercises 

16. Let C g (E) be the Clifford algebra as defined in §4. Define F,(C 9 ) = (k 4- £)', viewing 
E as embedded in C g . Define the similar object F t (/\£') in the alternating algebra. Then 
F i+l D F t in both cases, and we define the i-th graded module gr t = F t /F t _,. Show 
that there is a natural (functorial) isomorphism 

gr,(C 9 (£)) ^ gr,( A E). 

17. Suppose that k = R, so E is a real vector space, which we now assume of even 
dimension 2 m. We also assume that g is non-degenerate. We omit the index g since 
the symmetric form is now fixed, and we write C + , C" for the spaces of degree 0 
and 1 respectively in the Z/2Z-grading. For elements x, y in C + or C", define their 
supercommutator to be 

{x , y} = xy — (—l) (degjr)(degy) yjc. 

Show that F lm ~\ is generated by supercommutators. 

18. Still assuming g non-degenerate, let J be an automorphism of (E, g) (i.e. 
g(Jx, Jy) = g(x, y) for all x, y E E) such that J 2 = -id. Let £ c = C be the 
extension of scalars from R to C. Then E c has a direct sum decomposition 

E c = Ec © E c 

into the eigenspaces of J, with eigenvalues 1 and -1 respectively. (Proof?) There 
is a representation of E c on /\E C , i.e. a homomorphism E c —> EndeíFe) whereby 
an element of E c operates by exterior multiplication, and an element of E c operates 
by inner multiplication, defined as follows. 

For x' E E c there is a unique C-linear map having the effect 

r 

x'(x, A • • • A X r ) = -2 X (- l) i_ ' (x', Xi) X t A ' ' ' A Xi A • • • A X r . 
i= 1 
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Prove that under this operation, you get an isomorphism 

C g {E ) c End c (A £ c)- 

[Hint: Count dimensions.] 

19. Consider the Clifford algebra over R. The standard notation is C n if E = R n with 
the negative definite form, and C' n if E = R" with the positive definite form. Thus 
dim C n = dim C n = 2 n . 

(a) Show that 

Cj ~ C C 2 ~ H (the division ring of quatemions) 

C| ~ R x R C 2 ~ M 2 (R) (2x 2 matrices over R) 

20. Establish isomorphisms: 

C 0 R C - C x C; C 0 R H ~ Af 2 (C); H 0 R H * M 4 (R) 

where M d (F) = d x d matrices over F. For the third one, with H 0 H, define an 
isomorphism 

/: H 0 R H —» Hom R (H, H) » M 4 (R) 
by f(x 0 y)(z) = xzŷ, where if y = y 0 + y { i 4- y 2 j + y 3 k then 
9 = y o - y\‘ ~ yii - y^- 

21. (a) Establish isomorphisms 

Cfi + 2 % C' 0C2 an ^ C'n + 2 ** Cn ® C.2 • 

[//mí: Let {e |s . . ., e w+2 } be the orthonormalized basis with e? = — 1. Then for 
. the first isomorphism map e t h-> e\ 0 <?,e 2 for i = 1,. . . , rc and map e n+] , e w+2 
on 1 0 e, and 1 0 e 2 respectively.] 

(b) Prove that C n+S « C„ 0 M, 6 (R) (which is called the periodicity property). 

(c) Conclude that C n is a semi-simple algebra over R for all n. 

From (c) one can tabulate the simple modules over C n . See [ABS 64], reproduced 
in Husemoller [Hu 75], Chapter 11, §6. 
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In the forties and fifties (mostly in the works of Cartan, Eilenberg, MacLane, 
and Steenrod, see [CaE 57]), it was realized that there was a systematic way of 
developing certain relations of linear algebra, depending only on fairly general 
constructions which were mostly arrow-theoretic, and were affectionately called 
abstract nonsenseby Steenrod. (For a more recent text, see [Ro 79].) The results 
formed a body of algebra, some of it involving homological algebra, which had 
arisen in topology, algebra, partial differential equations, and algebraic geometry. 
In topology, some of these constructions had been used in part to get homology 
and cohomology groups of topological spaces as in Eilenberg-Steenrod [ES 52]. 
In algebra, factor sets and 1 -cocycles had arisen in the theory of group extensions, 
and, for instance, Hilbert’s Theorem 90. More recently, homological algebra 
has entered in the cohomology of groups and the representation theory of groups. 
See for example Curtis-Reiner [CuR 81], and any book on the cohomology of 
groups, e.g. [La 96], [Se 64], and [Sh 72]. Note that [La 96] was written to pro- 
vide background for class field theory in [ArT 68]. 

From an entirely different direction, Leray developed a theory of sheaves 
and spectral sequences motivated by partial differential equations. The basic 
theory of sheaves was treated in Godement’s book on the subject [Go 58]. 
Fundamental insights were also given by Grothendieck in homological algebra 
[Gro 57], to be applied by Grothendieck in the theory of sheaves over schemes 
in the fifties and sixties. In Chapter XX, I have included whatever is necessary 
of homological algebra for Hartshorne’s use in [Ha 77]. Both Chapters XX and 
XXI gi ve an appropriate background for the homological algebra used in Griffiths- 
Harris [GrH 78], Chapter 5 (especially §3 and §4), andGunning [Gu90]. Chapter 

XX carries out the general theory of derived functors. The exercises and Chapter 

XXI may be viewed as providing examples and computations in specific concrete 
instances of more specialized interest. 
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The commutative algebra of Chapter X and the two chapters on homological 
algebra in this fourth part also provide an appropriate background for certain 
topics in algebraic geometry such as Serre’s study of intersection theory [Se 65], 
Grothendieck duality, and Grothendieck’s Riemann-Roch theorem in algebraic 
geometry. See for instance [SGA 6]. 

Finally I want to draw attention to the use of homological algebra in certain 
areas of partial differential equations, as in the papers of Atiyah-Bott-Patodi and 
Atiyah-Singer on complexes of elliptic operators. Readers can trace some of the 
literature from the bibliography given in [ABP 73]. 

The choice of material in this part was to a large extent motivated by all the 
above applications. 

For this chapter, considering the number of references and cross-references 
given, the bibliography for the entire chapter is placed at the end of the chapter. 



CHAPTER XX _ 

General Homology Theory 


To a large extent the present chapter is arrow-theoretic. There is a substantial 
body of linear algebra which can be formalized very systematically, and con- 
stitutes what Steenrod called abstract nonsense, but which provides a well-oiled 
machinery applicable to many domains. References will be given along the way. 

Most of what we shall do applies to abelian categories, which were mentioned 
in Chapter III, end of §3. However, in first reading, I recommend that readers 
disregard any allusions to general abelian categories and assume that we are 
dealing with an abelian category of modules over a ring, or other specific abelian 
categories such as complexes of modules over a ring. 


§1. COMPLEXES 


Let A be a ring. By an open complex of >l-modules, one means a sequence 
of modules and homomorphisms {(E\ d')} 9 

I 1 Í _ \ ^ £i + * _► 

where i ranges over all integers and d, maps E l into E l+1 , and such that 

d l o d l - 1 = 0 


for all L 

One frequently considers a finite sequence of homomorphisms, say 


E - >E r 


761 






762 GENERAL HOMOLOGY THEORY 


XX, §1 


such that the composite of two successive ones is 0, and one can make this 
sequence into a complex by inserting 0 at each end: 

-► 0 -► 0 -► E 1 -►-> E r -► 0 -► 0 -► 


Such a complex is called a finite or bounded complex. 

Remark. Complexes can be indexed with a descending sequence of integers, 
namely, 


E h 


EAE:. 


When that notation is used systematically, then one uses upper indices for 
complexes which are indexed with an ascending sequence of integers: 


£i~ 1 £‘ 4 E i+ 1 


In this book, I shall deal mostly with ascending indices. 

As stated in the introduction of this chapter, instead of modules over a ring, 
we could have taken objects in an arbitrary abelian category. 

The homomorphisms d l are often called differentials, because some of the 
fìrst complexes which arose in practice were in analy sis, with differential operators 
and differential forms. C/. the examples below. 

We denote a complex as above by (£, d ). If the complex is exact, it is often 
useful to insert the kernels and cokernels of the differentials in a diagram as 
follows, letting M, = Ker d l = Im í' 1 . 

-> £ 1-2 -*£«- 1 -).£» -►£»+ 1 -► 


M i_ 1 M 1 M i+l 



Thus by definition, we obtain a family of short exact sequences 

0 -► M 1 -► E l -► M i+1 -►0. 

If the complex is not exact, then of course we have to insert both the image of 
d l ~ 1 and the kernel of d'. The factor 

(Ker d‘)/(Im d l_1 ) 


will be studied in the next section. It is called the homology of the complex, 
and measures the deviation from exactness. 
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Let M be a module. By a resolution of M we mean an exact sequence 

£ 0 —► M —► 0. 

Thus a resolution is an exact complex whose furthest term on the right before 
0 is M. The resolution is indexed as shown. We usually write E M for the part of 
complex formed only of the E- s, thus: 

E m is. —► E n —► E n _ j —► • • • —► Eq 9 

stopping at E 0 . We then write E for the complex obtained by sticking 0 on 
the right: 

E is: -►£„-► j -►•••—► £ 0 —► 0. 

If the objects £, of the resolution are taken in some family, then the resolution is 
qualified in the same way as the family. For instance, if £, is free for all i ^ 0 
then we say that the resolution is a free resolution. If £ f is projective for all 
i ^ 0 then we say that the resolution is projective. And so forth. The same 
terminology is applied to the right, with a resolution 

0 - M -► £° -► £' -►-► £"" 1 -► E n -►, 


also written 

0 -► M -► £ m . 

We then write £ for the complex 

0 - £° - E 1 - £ 2 -► • • •. 

See §5 for injective resolutions. 

A resolution is said to be finite if £ t (or £') = 0 for all but a finite number of 
indices i. 

Example. Every module admits a free resolution (on the left). This is a 
simple application of the notion of free module. Indeed, letM be a module, and 
let {xj} be a family of generators, with j in some indexing set J. For each j let 
Rej be a free module over R with a basis consisting of one element ej. Let 

F = ®Rej 

jeJ 

be their direct sum. There is a unique epimorphism 

£ -► M -> 0 

sending e } on x j9 Now we let M x be the kernel, and again represent M x as the 
quotient of a free module. Inductively, we can construct the desired free 
resolution. 



764 GENERAL HOMOLOGY THEORY 


XX, §1 


Example. The Standard Complex. Let S be a set. For / = 0, 1,2,... 
let E ( be the free module over Z generated by (/ + l)-tuples (* 0 , • • • , +•) with 
x 0 ,. . . , jc z e S. Thus such (/ + l)-tuples form a basis of E t over Z. There is a 
unique homomorphism 

di+ 1 : ^i+i 

such that 

i+1 

£? (+ 1 (Ar 0 , • • • , •ïi+l) = 2 (-l) y W • • • . Xj, ..., x i+ì ), 

y = 0 

where the symbol x y means that this term is to be omitted. For / = 0, we define 
d 0 :E 0 ^>Zto be the unique homomorphism such that d 0 (x 0 ) = 1. The map d 0 
is sometimes called the augmentation, and is also denoted by e. Then we obtain 
a resolution of Z by the complex 

£/+i • • * —> E 0 Z —► 0. 

The formalism of the above maps d ( is pervasive in mathematics. See Exercise 
2 for the use of the standard complex in the cohomology theory of groups. For 
still another example of this same formalism, compare with the Koszul complex 
in Chapter XXI, §4. 

Given a module M, one may form Hom^, M) for each /, in which case one 
gets coboundary maps 

S': Hom (E h M) Hom(£ /+1 , M), S(f) = /° d í + 1 , 

obtained by composition of mappings. This procedure will be used to obtain 
derived functors in §6. In Exercises 2 through 6, you will see how this procedure 
is used to develop the cohomology theory of groups. 

Instead of using homomorphisms, one may use a topological version with 
simplices, and continuous maps, in which case the standard complex gives rise to 
the singular homology theory of topological spaces. See [GreH 81], Chapter 9. 

Examples. Finite free resolutions. In Chapter XXI, you will find other 
examples of complexes, especially finite free, constructed in various ways with 
different tools. This subsequent entire chapter may be viewed as providing 
examples for the current chapter. 

Examples with differential forms. In Chapter XIX, §3, we gave the exam- 
ple of the de Rham complex in an algebraic setting. In the theory of differential 
manifolds, the de Rham complex has differential maps 

d l : fì /+1 , 

sending differential forms of degree / to those of degree / + 1, and allows for 
the computation of the homology of the manifold. 

A similar situation occurs in complex differential geometry, when the maps 
d l are given by the Dolbeault 5-operators 

d l : íÏPd -> fì^ / + 1 
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operating on forms of type ( p , /). Interested readers can look up for instance 
Gunning’s book [Gu 90] mentioned in the introduction to Part IV,Volume I, E. 
The associated homology of this complex is called the Dolbeault or d-cohom- 
ology of the complex manifold. 

Let us return to the general algebraic aspects of complexes and resolutions. 

It is an interesting problem to discuss which modules admit finite resoutions, 
and variations on this theme. Some conditions are discussed later in this chapter 
and in Chapter XXI. If a resolution 

0 —► E n —► E n _! £ 0 —► M —► 0 

is such that E m = 0 for m > n, then we say that the resolution has length ^ n 
(sometimes we say it has length n by abuse of language). 

A closed complex of /1-modules is a sequence of modules and homomorph- 
isms {(F, d 1 )} where i ranges over the set of integers mod n for some n ^ 2 
and otherwise satisfying the same properties as above. Thus a closed complex 
looks like this: 

E 1 -► E 2 -►-► E n 

We call n the Jength of the closed complex. 

Without fear of confusion, one can omit the index i on d l and write just d. 
We also write (E , d) for the complex {(F, d 1 )}, or even more briefly, we write 
simply E. 

Let (E , d) and (E', d' ) be complexes (both open or both closed). Let r be an 
integer. A morphism or homomorphism (of complexes) 

/:(£', d')-(£,d) 

of degree r is a sequence 

/: E ri -► E i + r 

of homomorphisms such that for all i the following diagram is commutative: 


£'(i- 1) f j - 1 y 1+r 



Just as we write d instead of d\ we shall also write j instead of f. If the com- 
plexes are closed, we define a morphism from one into the other only if they 
have the same length. 

It is clear that complexes form a category. In fact they form an abelian 
category. Indeed, say we deal with complexes indexed by Z for simplicity, and 
morphisms of degree 0. Say we have a morphism of complexes /: C -► C" or 
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putting the indices: 


C„ 

c: 


C„-l 

C'l-i 


We let C n = Ker(C„ -► C"). Then the family (C n ) forms a complex, which we 
define to be the kernel of /. We let the reader check the details that this and a 
similar definition for cokernel and finite direct sums make complexes of 
modules into an abelian category. At this point, readers should refer to Chapter 
III, §9, where kernels and cokernels are discussed in this context. The snake 
lemma of that chapter will now become central to the next section. 

It will be useful to have another notion to deal with objects indexed by a 
monoid. Let G be a monoid, which we assume commutative and additive to 
fit the applications we have in mind here. Let {Mi} ieG be a family of modules 
indexed by G. The direct sum 

M = © M ; 

ieG 


will be called the G-graded module associated with the family {Mi} ieG . Let 
{Mi} ieG and {M-} ieG be families indexed by G, and let M , M' be their asso- 
ciated G-graded modules. Let r e G. By a G-graded morphism f:M'-+M of 
degree r we shall mean a homomorphism such that /maps M\ into M i + r for 
each i e G (identifying M, with the corresponding submodule of the direct 
sum on the i-th component). Thus / is nothing else than a family of homo- 
morphisms f :M\ -► M i+r . 

If (£, d) is a complex we may view E as a G-graded module (taking the direct 
sum of the components of the complex), and we may view d as a G-graded 
morphism of degree 1, letting G be Z or Z/nZ. The most common case we en- 
counter is when G = Z. Then we write the complex as 

E = @E i9 and d:E^E 

maps E into itself. The differential d is defined as d t on each direct summand 
E i9 and has degree 1. 

Conversely, if G is Z or Z/nZ, one may view a G-graded module as a com- 
plex, by defining d to be the zero map. 

For simplicity, we shall often omit the prefix “ G-graded ” in front of the word 
“morphism”, when dealing with G-graded morphisms. 
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§2. HOMOLOGY SEQUENCE 

Let (£, d) be a complex. We let 

Z\E) = Ker d l 

and call Z\E) the module of i-cycles. We let 

B\E) = Im d 1 ' 1 

and call B\E) the module of i-boundaries. We frequently write Z* and B l 
instead of Z\E) and B\E\ respectively. We let 

H\E) = Z'/B' = Ker dyim d l ~ \ 

and call H\E) the i-th homology group of the complex. The graded module 
associated with the family {H'} will be denoted by //(£), and will be called the 
homology of E . One sometimes writes H*(E) instead of H(E). 

If /:£'-►£ is a morphism of complexes, say of degree 0, then we get an 

induced canonical homomorphism 

H\f) : H\E f ) H\E) 

on each homology group. Indeed, from the commutative diagram defining a 
morphism of complexes, one sees at once that/maps Z\E f ) into Z\E) and B\E f ) 
into B\E), whence the induced homomorphism H\f). Compare with the begin- 
ning remarks of Chapter III, §9. One often writes this induced homomorphism 
as f* rather than ////), and if H(E) denotes the graded module of homology as 
above, then we write 

//(/) = /*://(£') ^//(£). 

We call H(f) the map induced by /on homology. If H\f) is an isomorphism 
for all /, then we say that/is a homology isomorphism. 

Note that if/: E f —> E and g : E —> E ,f are morphisms of complexes, then it 
is immediately verified that 

H(g) o H(f) = H(g o/) and H( id) = id. 

Thus H is a functor from the category of complexes to the category of graded 
modules. 

We shall consider short exact sequences of complexes with morphisms of 
degree 0: 

0 ->• £' 4 E 4 E" -► 0, 
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which written out in full look like this: 


0-> E ,(i ~ l) -> E'~ 1 -> l) -> 0 



One can define a morphism 

ô : H(E") - H(E) 

of degree 1, in other words, a family of homomorphisms 

ô l : H ni -► H ,{i+1) 

by the snahe lemma. 

Theorem2.1. Let 

0 E' 4 E 4 E" -► 0 

be an exact sequence of complexes withf g of degree 0. Then the sequence 

———► H(E') 



H(E") 


is exact. 

This theorem is merely a special application of the snake lemma. 

If one writes out in full the homology sequence in the theorem, then it looks 
like this: 
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It is clear that our map ò is functorial (in an obvious sense), and hence that 
our whole structure ( H, ò ) is a functor from the category of short exact sequences 
of complexes into the category of complexes. 


§3. EULER CHARACTERISTIC AND THE 
GROTHENDIECK GROUP 

This section may be viewed as a continuation of Chapter III, §8, on Euler- 
Poincaré maps. Consider complexes of A-modules, for simplicity. 

Let E be a complex such that almost all homology groups H l are equal to 0. 
Assume that E is an open complex. As in Chapter III, §8, let cp be an Euler- 
Poincaré mapping on the category of modules (i.e. A-modules). We define the 
Euler-Poincaré characteristic Xq>(E) ( or more briefly the Euler characteristic) 
with respect to <p, to be 

*«>(£) = I(- 1)W) 

provided cp(H l ) is defined for all H\ in which case we say that Xq> is defìned for the 
complex E. 

If E is a closed complex, we select a definite order (E 1 ,..., E n ) for the integers 
mod n and define the Euler characteristic by the formula 

Xv(E) = Ì(-iycp(H‘) 

i= 1 

provided again all (p(H') are defìned. 

For an example, the reader may refer to Exercise 28 of Chapter I. 

One may view H as a complex, defining d to be the zero map. In that case, 
we see that xJiH) is the alternating sum given above. More generally: 

Theorem 3.1. Let F be a complex, which is of even length if it is closed . 
Assume that (p(F l ) is defined for all i, (p(F l ) = 0 for almost all i, and H l (F) = 0 
for almost all i. Then x^JF) is defined, and 

i 

Proof. Let Z l and B 1 be the groups of /-cycles and /-boundaries in F l 
respectively. We have an exact sequence 


0 -► Z l -► F l -► B i+l -► 0. 


Hence x<p(F) ÌS defined, and 

(p(F‘) = (p(Z‘) + (p(B ‘ +1 ). 
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Taking the alternating sum, our conclusion follows at once. 

A complex whose homology is trivial is called acyclic. 

Corollary 3.2. Let F be an acyclic complex , such that cp(F l ) is defined for 
all /, and equal to 0 for almost all i. ìf F is closed , we assume that F has even 
length. Then 


Xv(F) = 0. 

In many applications, an open complex F is such that F l = 0 for almost 
all i, and one can then treat this complex as a closed complex by defining an 
additional map going from a zero on the far right to a zero on the far left. Thus 
in this case, the study of such an open complex is reduced to the study of a 
closed complex. 

Theorem 3.3. Let 

0 0 

be an exact sequence of complexes , with morphisms of degree 0. // the com- 
plexes are closed , assume that their length is even. Let cp be an Euler-Poincaré 
mapping on the category of modules. If x<p 15 defined for two of the above 
three complexes , then it is defined for the third , and we have 

X(P (E) = xJLE) + XJE"). 

Proof We have an exact homology sequence 


_> h hi- d _ H i _> H i _> H "i H 'd+ 1) _ 


This homology sequence is nothing but a complex whose homology is trivial. 
Furthermore, each homology group belonging say to E is between homology 
groups of E' and E". Hence if Xq> is defìned for E' and E" it is defined for E. 
Similarly for the other two possibilities. If our complexes are closed of even 
length n, then this homology sequence has even length 3n. We can therefore 
apply the corollary of Theorem 3.1 to get what we want. 

For certain applications, it is convenient to construct a universal Euler 
mapping. Let (î be the set of isomorphism classes of certain modules. If E is a 
module, let [£] denote its isomorphism class. We require that (ï satisfy the 
Euler-Poincaré condition, i.e. if we have an exact sequence 

0 E' -> E -> E" -> 0, 

then [£] is in (î if and only if [£'] and [£"] are in (2. Furthermore, the zero 
module is in (î. 
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Theorem 3.4. Assume that (i satisfies the Euler-Poincaré condition. Then 
there is a map 

y:a-K(G) 

of (3 into an abelian group K(G) having the universal property mth respect to 
Euler-Poincaré maps defined on (3. 

To construct this, let F ab ((3) be the free abelian group generated by the set of 
such [F]. Let B be the subgroup generated by all elements of type 

[£] - [L'] - [£"], 

where 

0 - E' - E - E" - 0 


is an exact sequence whose members are in (3. We let K((3) be the factor group 
F ab (a)/B, and let y : a — K((ì) be the natural map. It is clear that y has the 
universal property. 

We observe the similarity of construction with the Grothendieck group of a 
monoid. In fact, the present group is known as the Euler-Grothendieck group 
of (3, with Euler usually left out. 

The reader should observe that the above arguments are valid in abelian 
categories, although we still used the word module. Just as with the elementary 
isomorphism theorems for groups, we have the analogue of the Jordan-Hôlder 
theorem for modules. Of course in the case of modules, we don’t have to worry 
about the normality of submodules. 

We now go a little deeper into K-theory. Let (3 be an abelian category. In 
first reading, one may wish to limit attention to an abelian category of modules 
over a ring. Let C bea family of objects in (3. We shall say that (B is a K-family 
if it satisfies the following conditions. 


K 1. G is closed under taking finite direct sums, and 0 is in G. 
K 2. Given an object E in (3 there exists an epimorphism 


L-E-0 


with L in G. 

K 3. Let E be an object admitting a finite resolution of length n 


0 — L n — ■ • • — Lq — E — 0 
with L t e G for all i. If 

0-JV-F..J - ...-F o -L-0 


is a resolution with N in (3 and F 0 ,..., F„_ x in C, then N is also in G. 
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We note that it follows from these axioms that if £ is in Q and F' is iso- 
morphic to E, then F' is also in 6, as one sees by looking at the resolution 

0-F->F-0->0 

and applying K 3. Furthermore, given an exact sequence 

0 - F' - F - F" - 0 

with F and F" in C, then F' is in C, again by applying K 3. 

Example. One may take for Q the category of modules over a commutative 
ring, and for G the family of projective modules. Later we shall also consider 
Noetherian rings, in which case one may take finite modules, and finite pro- 
jective modules instead. Condition K 2 will be discussed in §8. 

From now on we assume that G is a K-family. For each object E in û, we 
let [£] denote its isomorphism class. An object E of Q will be said to have 
finite C-dimension if it admits a finite resolution with elements of G. We let 
(î(C) be the family of objects in Q which are of finite C-dimension. We may 
then form the 


K(<3(e)) = z iQ(e)]/R(Q(e)) 

where R(Q(e)) is the group generated by all elements [F] — [£'] - [£"] 
arising from an exact sequence 

0 0 

in Q(G). Similarly we defìne 

K(C) = Z[(C)]/R(e), 

where R(G) is the group of relations generated as above, but taking £', £, £" 
in e itseìf. 

There are natural maps 

y«(e):û(e)-K(a(e)) and y e :e-K(C), 

which to each object associate its class in the corresponding Grothendieck 
group. There is also a natural homomorphism 

£:K(C)-K (Q(G)) 

since an exact sequence of objects of e can also be viewed as an exact sequence 
of objects of (î(e). 
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Theorem 3.5. Let M e d((3) and suppose we have two resolutions 
Lfcf ► M —► 0 and L' M -► M -► 0, 
by finite complexes L M and L M in C. Then 

Proofi Take first the special case when there is an epimorphism L' M -► L M , 
with kernel E illustrated on the following commutative and exact diagram. 

0 + E ► L m ► L m * 0 


M ——► M 

id 

0 0 


The kernel is a complex 

0 —► E n —► t —►•••—► E 0 —► 0 

which is exact because we have the homology sequence 

H p (E) - H p (L') - H p (L) - H p . X (E) 

For p ^ 1 we have H p (L) = H p (L') = 0 by defìnition, so H p (E) = 0 for p ^ 1. 
And for p = 0 we consider the exact sequence 

Hi(L) - // 0 (£) - H 0 (L') -+ H 0 (L) 

Now we have HfiL) = 0, and H 0 (L') -► H 0 (L) corresponds to the identity 
morphisms on M so is an isomorphism. It follows that H 0 (E) = 0 also. 

By definition of K-family, the objects E p are in G. Then taking the Euler 
characteristic in K(C) we find 

X(L') - x(L) = x(E) = 0 

which proves our assertion in the special case. 

The general case follows by showing that given two resolutions of M in Q 
we can always find a third one which tops both of them. The pattern of our 
construction will be given by a lemma. 
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Lemma 3.6. Given txvo epimorphisms u : M -► N and v:M' -► N in <î, 
there exist epimorphisms F -> M and F -> M' xvith F in C mahing thefolloxving 
diagram commutative. 



Proof Let E — M x N M\ that is E is the kernel of the morphism 

M x M' ->N 

given by (x, y) h -> ux — vy. (Elements are not really used here, and we could 
write formally u — v instead.) There is some F in G and an epimorphism 
F -► E -► 0. The composition of this epimorphism with the natural projections 
of E on each factor gives us what we want. 

We construct a complex giving a resolution of M with a commutative 
and exact diagram: 

0 


L m ->AÍ-►O 

id 

L" m - >M -♦ 0 

id 

L' m - *M - 


0 

The construction is done inductively, so we put indices: 

Li - *Lì-x -♦ 

L" -♦ L ';_,-♦ 

Li ♦ Lj_ j ♦ 
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Suppose that we have constructed up to L"_ j with the desired epimorphisms on 
L,_! and Lí-j. We want to construct L". Let B { = Ker(L,_^ -► L,_ 2 ) and 
similarly for B\ and B'[. We obtain the commutative diagram: 

L t * B, ► L,_ j ► L x -2 

b [ -► l ;'_ 1 -> l ?'_ 2 

Lì * B( ► L,_ ^ ► L, _ 2 

If B” -► B ( or BJ' -► B\ are not epimorphisms, then we replace L?'_ ^ by 

L?'_ i © L,- © L\. 

We let the boundary map to L|'_ 2 be 0 on the new summands, and similarly 
define the maps to L,_ ^ and L;_ ^ to be 0 on L\ and L f _ j respectively. 

Without loss of generality we may now assume that 

B\' - Bi and B\' - B\ 

are epimorphisms. We then use the construction of the preceding lemma. 
We let 

£, = L, @ B , B" and E\ = L\. 

Then both £, and £J have natural epimorphisms on B". Then we let 

/V, = £,@ flr E\ 

and we find an object LJ' in G with an epimorphism LJ' -► N t . This gives us the 
inductive construction of L" up to the very end. To stop the process, we use 
K 3 and take the kernel of the last constructed L" to conclude the proof. 


Theorem 3.7. The natural map 

e : K(C) — K(a(G)) 

is an isomorphism. 

Proof. The map is surjective because given a resolution 
0 —► F n —► • • • —► F q — ► M —► 0 
with F, e C for all i, the element 


K-1 yvAFt) 
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maps on y Qie) (M) under e. Conversely, Theorem 3.5 shows that the association 

is a well-defined mapping. Since for any LeG we have a short exact sequence 
0 — > L —► T —► 0, it follows that this mapping following e is the identity on K(C), 
so e is a monomorphism. Hence e is an isomorphism, as was to be shown. 

It may be helpful to the reader actually to see the next lemma which makes 
the additivity of the inverse more explicit. 

Lemma 3.8. Given an exact sequence in Û(C) 

0 - M' -> M -► M" -► 0 

there exists a commutative and exact diagram 

0 * L m > * L m * L m >> ► 0 


0-► M' -► M -► M" -► 0 


0 0 0 


with finite resolutions L M , L M , L M » in G. 

Proof We first show that we can find L', L, L" in G to fit an exact and 
commutative diagram 

0-> L'-> L -► L"-> 0 


0->M'-> M ->M"- >0 


0 0 0 

We first select an epimorphism L" -> M" with L" in G. By Lemma 3.6 there 
exists Lj e G and epimorphisms Lj -> M, L x -> L" making the diagram com- 
mutative. Then let L 2 -> M' be an epimorphism with L 2 e6, and finally define 
L = Lj ® L 2 . Then we get morphisms L -> M and L -> L" in the obvious 
way. Let L' be the kernel of L -> L". Then L 2 c= L' so we get an epimorphism 
L' - M'. 
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This now allows us to construct resolutions inductively until we hit the 
n-th step, where n is some integer such that M, M" admit resolutions of length 
n in G. The last horizontal exact sequence that we obtain is 

0 -+ L'„-> L n -> L”0 

and L" can be chosen to be the kernel of L"_, -► L"_ 2 . By K 3 we know that 
L" lies in G, and the sequence 


0 


L'n “► L"_ J 


is exact. This implies that in the next inductive step, we can take L" + j = 0. 
Then 


0 L' n+ ^ -► L n+ 1 ► 0 ► 0 

is exact, and at the next step we just take the kernels of the vertical arrows to 
complete the desired finite resolutions in G. This concludes the proof of the 
lemma. 

Remark. The argument in the proof of Lemma 3.8 in fact shows: 

If 

0 - M' - M - M" - 0 

is an exact sequence in d , and if M, M" have finite G-dimension , then so does 
M'. 

In the category of modules, one has a more precise statement: 

Theorem 3.9. Let (i he the category of modules over a ring. Let (P be the 
family of projective modules. Given an exact sequence of modules 

0 -► E — E — L" 0 

if any two of E', E , E" admit finite resolutions in (P then the third does also. 

Proofs in a more subtle case will be given in Chapter XXI, Theorem 2.7. 

Next we shall use the tensor product to investigate a ring structure on the 
Grothendieck group. We suppose for simplicity that we deal with an abelian 
category of modules over a commutative ring, denoted by d, together with a K- 
family G as above, but we now assume that (i is closed under the tensor product. 
The only properties we shall actually use for the next results are the following 
ones, denoted by TG (for “tensor” and “Grothendieck” respectively): 

TG 1. There is a bifunctorial isomorphism giving commutativity 

M (x) jV « N (x) M 


for all M, N in d ; and similarly for distributivity over direct sums, 
and associativity. 
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TG 2. For all L in Q the functor M i—► L (g) M is exact. 

TG 3. If L, L' are in C then L (x) L' is in 0. 

Then we may give K(C) the structure of an algebra by defining 
cl e (L) cl e (L f ) = cl e (L (g) L'). 

Condition TG 1 implies that this algebra is commutative, and we call it the 
Grothendieck algebra In practice, there is a unit element, but if we want one in 
the present axiomatization, we have to make it an explicit assumption: 

TG 4. There is an object R in Q such that R (g) M % M for all M in <î. 

Then cl e (R) is the unit element. 

Similarly, condition TG 2 shows that we can define a module structure on 
K((î) over K(C) by the same formula 

cl e (L) cle(Af) = cl tt (L ® M), 

and similarly K (d(G)) is a module over K(C), where we recall that Û(C) is the 
family of objects in (2 which admit finite resolutions by objects in C. 

Since we know from Theorem 3.7 that K(C) ~ K(Û(C)), we also have a 
ring structure on K(G(<2)) via this isomorphism. We then can make the product 
more explicit as follows. 

Proposition 3.10. Let M e Q.(Q) and let N e Q. Let 

0 —► L n Lq —► M — ► 0 

be a finite resolution of M by objects in Q. Then 

cle(M) cl a (N) = X (- 1 y cle(L| <g) N). 

= Z(-l ) 1 cl t(HKK)) 


xvhere K is the complex 

0^>L n ®N-+-'-^>L o (g)N^>M®N^>0 
and Hi(K) is the i-th homology of this complex. 

Proof. The formulas are immediate consequences of the definitions, and of 
Theorem 3.1. 

Example. Let <3 be the abelian category of modules over a commutative 
ring. Let Q be the family of projective modules. From §6 on derived functors 
the reader will know that the homology of the complex K in Proposition 3.10 
is just Tor(M, N). Therefore the formula in that proposition can also be written 

cle(M) cl a (N) = X ( - !)• cl a (Tor,(M, JV)). 
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Example. Let k be a field. Let G be a group. By a (G, A:)-moduIe, we shall 
mean a pair (£, p), consisting of a /:-space E and a homomorphism 

p: G -> Aut k (E). 

Such a homomorphism is also called a representation of G in E. By abuse of 
language, we also say that the k -space E is a G-module. The group G operates 
on E , and we write ax instead of p(cr)x. The field k will be kept fixed in what 
follows. 

Let Mod*(G) denote the category whose objects are (G, /:)-modules. A mor- 
phism in Mod^G) is what we call a G-homomorphism, that is a k -linear map 
f:E—>F such that f(ax) = cr/(x) for all cr e G. The group of morphisms in 
Mod*(G) is denoted by Hom G . 

If E is a G-module, and ijgG, then we have by definition a /c-automorphism 
(7: E -► E. Since T r is a functor, we have an induced automorphism 

T r (cj): T r (T) - T r (E) 

for each r, and thus T r (T) is also a G-module. Taking the direct sum, we see 
that T(E) is a G-module, and hence that T is a functor from the category of 
G-modules to the category of graded G-modules. Similarly for /\ r , 5 r , and /\, S. 

It is clear that the kernel of a G-homomorphism is a G-submodule, and that 
the factor module of a G-module by a G-submodule is again a G-module so the 
category of G-modules is an abelian category. 

We can now apply the general considerations on the Grothendieck group 
which we write 


K(G) = K(Mod*(G)) 

for simplicity in the present case. We have the canonical map 

cl: Mod*(G) K(G). 

which to each G-module associates its class in K(G). 

If £, F are G-modules, then their tensor product over /c, E (x) T, is also a 
G-module. Here again, the operation of G on E ® F is given functorially. If 
c e G, there exists a unique /c-linear map E (g) F -> E (x) F such that for x e £, 
y e F we have x® y i—► (crx) ® (cy). The tensor product induces a law of 
composition on Mod*(G) because the tensor products of G-isomorphic modules 
are G-isomorphic. 

Furthermore all the conditions TG 1 through TG 4 are satisfied. Since k is a 
field, we find also that tensoring an exact sequence of G-modules over k with any 
G-module over k preserves the exactness, so TG 2 is satisfied for all (G, k)- 
modules. Thus the Grothendieck group K(G) is in fact the Grothendieck ring, 
or the Grothendieck algebra over k. 
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By Proposition 2.1 and Theorem 2.3 of Chapter XVIII, we also see: 

The Grothendiech ring of a finite group G consisting of isomorphism classes of 
finite dimensional (G, k)-spaces over a field k of characteristic 0 is naturally 
isomorphic to the character ring X Z (G ). 

We can axiomatize this a little more. We consider an abelian category of 
modules over a commutative ring R , which we denote by (ì for simplicity. For 
two modules M, N in <î we let Mor(M, N) as usual be the morphisms in <î, but 
Mor(M, N) is an abelian subgroup of Hom^M, N). For example, we could take 
<î to be the category of (G, /c)-modules as in the example we have just discussed, 
in which case Mor(M, N) - Hom G (M, N). 

We let C be the family of finite free modules in d. We assume that C satisfies 
TG 1, TG 2, TG 3, TG 4, and also that G is closed under taking alternating pro - 
ducts , tensor products and symmetric products. We let K = K(C). As we have 
seen, K is itself a commutative ring. We abbreviate cl e = cl. 

We shall define non-linear maps 

A 1 : K - K 

using the alternating product. If E is finite free, we let 

k\E) = cl(A %. 

Proposition 1.1 of Chapter XIX can now be formulated for the K-ring as follows. 


Proposition 3.11. Let 


0 - E' - E - E" - 0 

be an exact sequence of finite free modules in d. Then for every integer n ^ 0 
we have 


A"(£)= 

i = 0 

As a result of the proposition, we can define a map 

A,: K -► 1 + íK[[í]] 

of K into the multiplicative group of formal power series with coefficients in K, 
and with constant term 1, by letting 


00 


A,(x) = X AXx)t‘. 

i = 0 
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Proposition 1.4 of Chapter XIX can be formulated by saying that: 

The map 

A t : K -> 1 + íK[[í]] 

is a homomorphism. 

We note that if L is free of rank 1, then 

A°(L) = ground ring; 

A\L) = cl(L); 

A\L) = 0 for i > 1. 

This can be summarized by writing 

A t (L) = 1 + cl(L)í. 

Next we can do a similar construction with the symmetric product instead of 
the alternating product. If E is a finite free module in C we let as usual: 

S(E) = symmetric algebra of L; 

S l (E) = homogeneous component of degree i in S(E). 

We define 

o\E) = cl (S\E)) 

and the corresponding power series 

a t (E) = £ a\E)t\ 

Theorem 3.12. Let Ebe a finite free module in (£, of rank r. Then for all 
integers n ^ 1 we have 

X(-iym)a ni (E) = o, 

i = o 

where by definition o j (E) = 0 for j < 0. Furthermore 

o t (E)A- t (E) = 1, 

so the poxver series o t (E) and A_ t (E) are inverse to each other. 

Proof. The first formula depends on the analogue for the symmetric product 
and the alternating product of the formula given in Proposition 1.1 of Chapter 
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XIX. It could be proved directly now, but the reader will find a proof as a special 
case of the theory of Koszul complexes in Chapter XXI, Corollary 4.14. The 
power series relation is essentially a reformulation of the first formula. 

From the above formalism, it is possible to define other maps besides À z and 
< t 1 . 


Example. Assume that the group G is trivial, and just write K for the 
Grothendieck ring instead of K(l). For x e K define 

d 

ý- t (x) = -t — \og \'(x) = -t \,(x)/\ t (x). 

Show that ifj-t is an additive and multiplicative homomorphism. Show that 

= 1 4- cl(£)/ 4 cl(£) 2 / 2 + • • • . 

This kind of construction with the logarithmic derivative leads to the Adams 
operations ijj' in topology and algebraic geometry. See Exercise 22 of Chapter 
XVIII. 

Remark. If it happens in Theorem 3.12 that E admits a decomposition into 
1-dimensional free modules in the K-group, then the proof trivializes by using 
the fact that \ t (L) = 1 + cl(L)í if L is 1-dimensional. But in the example of 
(G, /:)-spaces when k is a field, this is in general not possible, and it is also not 
possible in other examples arising naturally in topology and algebraic geometry. 
However, by “changing the base,” one can sometimes achieve this simpler 
situation, but Theorem 3.12 is then used in establishing the basic properties. C/. 
Grothendieck [SGA 6], mentioned in the introduction to Part IV, and other works 
mentioned in the bibliography at the end, namely [Ma 69], [At 61], [At 67], 
[Ba 68], [Bo 62]. The lectures by Atiyah and Bott emphasize the topological 
aspects as distinguished from the algebraic-geometric aspects. Grothendieck 
[Gr 68] actually shows how the formalism of Chern classes from algebraic 
geometry and topology also enters the theory of representations of linear groups. 
See also the exposition in [FuL 85], especially the formalism of Chapter I, §6. 
For special emphasis on applications to representation theory, see Brocker-tom 
Dieck [BtD 85], especially Chapter II, §7, concerning compact Lie groups. 


§4. INJECTIVE MODULES 

In Chapter III, §4, we defined projective modules, which have a natural 
relation to free modules. By reversing the arrows, we can define a module Q to 
be injective if it satisfies any one of the following conditions which are equivalent: 

11. Given any module M and a submodule M', and a homomorphism 
/:M' -► 0, there exists an extension of this homomorphism to M, 
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that is there exists h: M -> Q making the following diagram commuta- 
tive: 


0-► M'- >M 

f 

Q 

12. The functor M i—► Hom^M, Q) is exact. 

13. Every exact sequence 0 -► Q -► M -► Aí" -► 0 splits. 

We prove the equivalence. General considerations on homomorphisms as in 
Proposition 2.1, show that exactness of the homed sequence may fail only at 
one point, namely given 



0 -► M' -► M -► M" -► 0, 

the question is whether 

Hom Á (M, Q) - Hom Á (M\ 0 - 0 

is exact. But this is precisely the hypothesis as formulated in 11, so 11 implies 
12 is essentially a matter of linguistic reformulation, and in fact 11 is equivalent 
to 12. 

Assume I 2 or 11, which we know are equivalent. To get 13 is immediate, by 
applying 11 to the diagram: 


0- >Q -► M 



To prove the converse, we need the notion of push-out (cf. Exercise 52 of 
Chapter I). Given an exact diagram 

0-► M' -► M 


we form the push-out: 


M' 


Q 

M 


Q 


N = Q®M. 
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Since M' -> M is a monomorphism, it is immediately verified from the construc- 
tion of the push-out that Q -> N is also a monomorphism. By I 3, the sequence 

0 

splits, and we can now compose the splitting map N -► Q with the push-out map 
M -► N to get the desired h: M -► Q, thus proving 1 1. 

We saw easily that every module is a homomorphic image of a free module. 
There is no equally direct construction for the dual fact: 

Theorem 4.1. Every module is a submodule of an injective module. 

The proof will be given by dualizing the situation, with some lemmas. We 
first look at the situation in the category of abelian groups. If M is an abelian 
group, let its dual group be M A = Hom(M, Q/Z). If F is a free abelian group, 
it is reasonable to expect, and in fact it is easily proved that its dual F A is an 
injective module, since injectivity is the dual notion of projectivity. Furthermore, 
M has a natural map into the double dual M AA , which is shown to be a mono- 
morphism. Now represent M A as a quotient of a free abelian group, 

M A ^ 0. 


Dualizing this sequence yields a monomorphism 

0 —> M AA F\ 

and since M is embedded naturally as a subgroup of M AA , we get the desired 
embedding of M as a subgroup of F\ 

This proof also works in general, but there are details to be filled in. First 
we have to prove that the dual of a free module is injective, and second we have 
to be careful when passing from the category of abelian groups to the category 
of modules over an arbitrary ring. We now carry out the details. 

We say that an abelian group T is divisible if for every integer m, the homo- 
morphism 

m T : x i—► mx 

is surjective. 

Lemma 4.2. // T is divisible , then T is injective in the category of abelian 
groups. 

Proof Let M' c M be a subgroup of an abelian group, and let f:M'->T 
be a homomorphism. Let xsM. We want first to extend / to the module 
(M', x) generated by M' and x. If x is free over M', then we select any value 
t e 7\ and it is immediately verified that/extends to (M', x) by giving the value 
/(x) = t. Suppose that x is torsion with respect to M', that is there is a 
positive integer m such that mx e M'. Let d be the period of x mod M', so 
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dx e M', and d is the least positive integer such that dx e M'. By hypothesis, 
there exists an element ueT such that du = / ( dx). For any integer n, and zeM' 
define 

/(z -h nx) =f(z) + 

By the definition of d , and the fact that Z is principal, one sees that this value 
for /is independent of the representation of an element of (M', x) in the form 
z + nx, and then it follows at once that this extended definition of / is a 
homomorphism. Thus we have extended / to (M', x). 

The rest of the proof is merely an application of Zorn’s lemma. We consider 
pairs (N , g) consisting of submodules of M containing M', and an extension g 
of / to N. We say that (N, g) ^ (N u g t ) if N cz iVj and the restriction of g { 
to N is g. Then such pairs are inductively ordered. Let ( N , g) be a maximal 
element. If N ^ M then there is some xeM, x$ N and we can apply the first 
part of the proof to extend the homomorphism to ( N , x), which contradicts 
the maximality, and concludes the proof of the lemma. 

Example. The abelian groups Q/Z and R/Z are divisible, and hence are 
injective in the category of abelian groups. 

We can prove Theorem 4.1 in the category of abelian groups following the 
pattern described above. If F is a free abelian group, then the dual F A is a direct 
product of groups isomorphic to Q/Z, and is therefore injective in the category 
of abelian groups by Lemma 4.2. This concludes the proof. 

Next we must make the necessary remarks to extend the system to modules. 
Let A be a ring and let T be an abelian group. We make Hom z (^, T) into an 
4-module as follows. Let f:A -► T be an abelian group homomorphism. For 
ae A we define the operation 


(af)(b) =f(bá). 

The rules for an operation are then immediately verified. Then for any ,4-module 
X we have a natural isomorphism of abelian groups: 


Hom z (X, T) —» Hom A (X, Hom z (A, T)). 


Indeed, let ^ : X -> T be a Z-homomorphism. We associate with ijj the homo- 
morphism 


f:X^Hom z (A, T) 


such that 


/(x)(fl) = ( Kax). 
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The definition of the ,4-module structure on Hom z (>l, T) shows that / is an 
/1-homomorphism, so we get an arrow from Hom z (X, T ) to 

Hom^X, Hom z (/l, T)). 

Conversely, let/: X -> Hom z (/l, T) be an ,4-homomorphism. We define the 
corresponding \j/ by 


M*)=f(x)( 1). 

It is then immediately verified that these maps are inverse to each other. 

We shall apply this when T is any divisible group, although we think of T 
as being Q/Z, and we think of the homomorphisms into T as representing the 
dual group according to the pattern described previously. 

Lemma 4.3. IfT is a divisible abelian group , then Hom z (/l, T) is injective in 
the category of A-modules. 

Proof It suffices to prove that if 0 X -► Y is exact in the category of 
/1-modules, then the dual sequence obtained by taking /1-homomorphisms into 
Hom z (/l, T) is exact, that is the top map in the following diagram is surjective. 

Hom^Y, Hom z (/l, T))-, Uom A (X, Hom Z (A, T)) —1—0 


Hom z (Y, T) -► Hom z (X, T) ->0 

But we have the isomorphisms described before the lemma, given by the vertical 
arrows of the diagram, which is commutative. The bottom map is surjective 
because T is an injective module in the category of abelian groups. Therefore 
the top map is surjective, thus proving the lemma. 

Now we prove Theorem 4.1 for A-modules. Let M be an A-module. We can 
embed M in a divisible abelian group T, 

/ 

0 -> M i T. 

Then we get an /1-homomorphism 

M - Hom z (/l, T) 

by x i—► f x , where f x (a) = f(ax). One verifies at once that x i— >f x gives an em- 
bedding of M in Hom z (A, T), which is an injective module by Lemma 4.3. This 
concludes the proof of Theorem 4.1. 
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§5. HOMOTOPIES OF MORPHISMS OF 
COMPLEXES 

The purpose of this section is to describe a condition under which homo- 
morphisms of complexes induce the same map on the homology and to show 
that this condition is satisfied in an important case, from which we derive 
applications in the next section. 

The arguments are applicable to any abelian category. The reader may pre- 
fer to think of modules, but we use a language which applies to both, and is no 
more complicated than if we insisted on dealing only with modules. 

Let E = {( E n , d n )} and E' = {( E ' n , d' n )} be two complexes. Let 

fg\E->E' 

be two morphisms of complexes (of degree 0). We say that /is homotopic to g 
if there exists a sequence of homomorphisms 

h n \E n ^E' (n ~ l) 

such that 

f n -g n = d'«- l % + h n+l d n . 

Lemma 5.1. Iff g are homotopic, then f g induce the same homomorphism 

on the homology H(E\ that is 

H(f n ) = H(g n ) \H\E) ^H n (E'). 

Proof The lemma is immediate, because /„ — g n vanishes on the cycles, 
which are the kernel of d n , and the homotopy condition shows that the image of 
/„ — g n is contained in the boundaries, that is, in the image of d ' {n ~ 1} . 

Remark. The terminology of homotopy is used because the notion and 
formalism first arose in the context of topology. Cf. [ES 52] and [GreH 81]. 

We apply Lemma 5.1 to injective objects. Note that as usual the definition 
of an injective module applies without change to define an injective object in 
any abelian category. Instead of a submodule in I 1, we use a subobject, or 
equivalently a monomorphism. The proofs of the equivalence of the three con- 
ditions defining an injective module depended only on arrow-theoretic juggling, 
and apply in the general case of abelian categories. 

We say that an abelian category has enough injectives if given any object M 
there exists a monomorphism 


0-M-/ 
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into an injective object. We proved in §4 that the category of modules over a 
ring has enough injectives. We now assume that the abelian category we work 
with has enough injectives. 

By an injective resolution of an object M one means an exact sequence 
0 -► M -► 1° -> 7 1 -► I 2 -► 

such that each /„ (n ^ 0) is injective. Given M, such a resolution exists. ïndeed, 
the monomorphism 

0 ^ M - /° 

exists by hypothesis. Let M° be its image. Again by assumption, there exists a 
monomorphism 


0 - I°/M° -► I\ 

and the corresponding homomorphism /° -> 7 1 has kernel M°. So we have 
constructed the first step of the resolution, and the next steps proceed in the 
same fashion. 

An injective resolution is of course not unique, but it has some uniqueness 
which we now formulate. 

Lemma 5.2. Consider two complexes: 

0-► M-►£°- > E 1 - >E 2 -> 


0-►M'-►Z 0 -► / ->/ 2 - ►... 

Suppose that the top row is exact , and that each T (n ^ 0) is injective. Let 
cp:M -+ M' be a given homomorphism. Then there exists a morphism f of 
complexes such that /_ j = cp; and any two such are homotopic. 

Proof. By definition of an injective, the homomorphism M -► /° via M' 
extends to a homomorphism 


fo--E° 


i°. 


which makes the first square commute: 


M 

<p 


E 0 

fo 


M' --/° 
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Next we must construct f. We write the second square in the form 

0-* E°/M -> E 1 

/o 

7° -> I 1 

with the exact top row as shown. Again because / 1 is injective, we can apply the 
same argument and find f x to make the second square commute. And so on, 
thus constructing the morphism of complexes f 

Suppose f g are two such morphisms. We define h 0 : E° -► M' to be 0. 
Then the condition for a homotopy is satisfied in the first instance, when 

f-i = 9 -1 =<P- 

Next let d ~ 1 : M -» £° be the embedding of M in E°. Since 7° is injective, 
we can extend 

d° : £°/Im d ~ 1 - E x 

to a homomorphism : E l -► /°. Then the homotopy condition is verified for 
/o — g 0 . Since h 0 = 0 we actually have in this case 

fo~9o = M°> 

but this simplification is misleading for the inductive step which follows. We 
assume constructed the map h n + ì , and we wish to show the existence of h n + 2 
satisfying 

/»+1 ““ 0«+i = d' n h n+ 1 + h n+2 d n + í . 

Since Im d" = Ker d n+ \ we have a monomorphism £ n+ Vlm d n -► £ n + 2 . By 
the definition of an injective object, which in this case is /” + / it suffices to prove 
that 

f n+ 1 — g n+ 1 - d' n h n+ 1 vanishes on the image of d n , 
and to use the exact diagram: 

0-> £” +1 /Im d n -> E n + 2 

fn + 1 ~9n+ 1 

/ n+1 

to get the existence of h„ + 2 :E" +2 -> I n+Í extending / n+1 - g„ + i. But we 
have: 

(/. + i -g„ +l -d'"h„ +l )d" 

= (/ n+ i -g„ +l )d"-d'"h„ +1 d" 
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= (/„ + i - g n+ 1 }d? - d'\f n -g n - d %) by induction 
= (f n+í -g n+ òd n - d'\f n - g n ) because d'd' = 0 

= 0 because f g are 

homomorphisms of 
complexes. 

This concludes the proof of Lemma 5.2. 

Remark. Dually, let P M > -> M' -+ 0 be a complex with P 1 projective for 
i ^ 0,andlet£ M -> M -> Obearesolution. Let<p: M' -> M beahomomorphism. 
Then <p extends to a homomorphism of complex P -> E. The proof is obtained 
by reversing arrows in Lemma 5.2. The books on homological algebra that I 
know of in fact carry out the projective case, and leave the injectivc case to the 
reader. However, one of my motivations is to do here what is needed, for 
instance in [Ha 77], Chapter III, on derived functors, as a preliminary to the 
cohomology of sheaves. For an example of projective resolutions using free 
modules, see Exercises 2-7, concerning the cohomology of groups. 


§6. DERIVED FUNCTORS 

We continue to work in an abelian category. A covariant additive functor 

F : d -► (B 

is said to be left exact if it transfòrms an exact sequence 

0 - M' - M - M" 

into an exact sequence 0 -► F(M') -► F(M) -► F(M"). We remind the reader 
that F is called additive if the map 

Hom04', A) - Hom(ivl', FA) 

is additive. 

We assume throughout that F is left exact unless othermse specified , and 
additive. We continue to assume that our abelian category has enough in - 
jectives. 

Given an object M, let 

0 — M — /° — 7 1 — / 2 — 
be an injective resolution, which we abbreviate by 

0 - M - / M , 

where I M is the complex /° -► Z 1 -► / 2 -►. We let / be the complex 

0 _►/° _►/!_►/ 2 _► 
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We define the right-derived functor R n F by 

R n F(M ) = // n (F(/)), 
in other words, the n- th homology of the complex 

0 -► F(I°) -> F(I l ) -► F(I 2 ) -> 

Directly from the definitions and the monomorphism M -► / 0 , we see that there 
is an isomorphism 

R°F(M) = F(M ). 

This isomorphism seems at first to depend on the injective resolution, and so 
do the functors R n F(M) for other n. However, from Lemmas 5.1 and 5.2 we 
see that given two injective resolutions of M, there is a homomorphism between 
them, and that any two homomorphisms are homotopic. If we apply the functor 
F to these homomorphisms and to the homotopy, then we see that the homology 
of the complex F(I) is in fact determined up to a unique isomorphism. One 
therefore omits the resolution from the notation and from the language. 

Example 1. Let R be a ring and let d = Mod (R) be the category of R - 
modules. Fix a module A. The functorA/ i-> Hom (A, M) is leftexact, i.e. given 
an exact sequence 0 —> M' —> M —> M'\ the sequence 

0 —> Hom(A, M') —> Hom(A, M) —> Hom (A , M") 

is exact. Its right derived functors are denoted by Ext n (A, M) for M variable. 
Similarly, for a fixed module B, the functor X Hom (X, B) is right exact, 
and it gives rise to its left derived functors. For the explicit mirror image of 
the terminology, see the end of this section. In any case, we may consider A as 
variable. In §8 we shall go more deeply into this aspect of the formalism, by 
dealing with bifunctors. It will turn out that Ext n (A , B) has a dual interpretation 
as a left derived functor of the first variable and right derived functor of the 
second variable. See Corollary 8.5. 

In the exercises, you will prove that Ext ! (A, M) is in bijection with iso- 
morphism classes of extensions, ofM by A, that is, isomorphism classes of exact 
sequences 

0^> A^> E^> M^>0. 

The name Ext comes from this interpretation in dimension 1. 

For the computation of Ext' in certain important cases, see Chapter XXI, 
Theorems 4.6 and 4.11, which serve as examples for the general theory. 

Example 2. Let R be commutative. The functor M i-> A ® M is right exact, 
in other words, the sequence 

j4®M'—>A®M—>4®M"—>0 

is exact. Its left derived functors are denoted by Tor n (A, M) for M variable. 
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Example 3. Let G be a group and let R = Z[G ] be the group ring. Let Ct 
be the category of G-modules, i.e. G = Mod(/?), also denoted by Mod(G). For 
a G-module A, let A G be the submodule (abelian group) consisting of those 
elements v such that xv = v for all * e G. Then A i-> A G is a left exact functor 
from Mod (R) into the category of abelian groups. Its left derived functors give 
rise to the cohomology of groups. Some results from this special cohomology 
will be carried out in the exercises, as further examples of the general theory. 

Example 4. Let X be a topological space (we assume the reader knows 
what this is). By a sheaf 5 of abelian groups on X, we mean the data: 

(a) For every open set U of X there is given an abelian group ÍF(Í/). 

(b) For every inclusion V C U of open sets there is given a homomorphism 

rtsy : £F(£/) —> ÍF(V), 

called the restriction from U to V, subject to the following conditions: 

SH 1. íF(empty set) = 0. 

SH 2. res£/ is the identity íF(G) —> íF(G). 

SH 3. If W C V C U are open sets, then res{^ ° res^ = res(£. 

SH 4. Let U be an open set and {V"} be an open covering of U. Let 
s e 5(U). If the restriction of s to each V- is 0, then s = 0. 

SH 5. Let U be an open set and let {VJ} be an open covering of U. Suppose 
given Si e íF(VJ) for each /, such that given /, j the restrictions of s { 
and Sj to fl Vj are equal. Then there exists a unique s e5(U) whose 
restriction to V t is s ( for all /. 

Elements of 5(U) are called sections of íF over U. Elements of ÍF(X) are called 
global sections. Just as for abelian groups, it is possible to define the notion of 
homomorphisms of sheaves, kernels, cokernels, and exact sequences. The asso- 
ciation ÍF i-> ÍF (X) (global sections functor) is a functor from the category of 
sheaves of abelian groups to abelian groups, and this functor is left exact. Its 
right derived functors are the basis of cohomology theory in topology and algebraic 
geometry (among other fields of mathematics). The reader will find a self- 
contained brief definition of the basic properties in [Ha 77], Chapter II, §1, as 
well as a proof that these form an abelian category. For a more extensive treatment 
I recommend Gunning’s [Gu 91], mentioned in the introduction to Part IV, 
notably Volume III, dealing with the cohomology of sheaves. 

We now return to the general theory of derived functors. The general theory 
tells us that these derived functors do not depend on the resolution by projectives 
or injectives according to the variance. As we shall also see in §8, one can even 
use other special types of objects such as acyclic or exact (to be defined), which 
gives even more flexibility in the ways one has to compute homology. Through 
certain explicit resolutions, we obtain means of computing the derived functors 
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explicitly. For example, in Exercise 16, you will see that the cohomology of 
finite cyclic groups can be computed immediately by exhibiting a specific free 
resolution of Z adapted to such groups. Chapter XXI will contain several other 
examples which show how to construct explicit finite free resolutions, which 
allow the determination of derived functors in various contexts. 

The next theorem summarizes the basic properties of derived functors. 

Theorem6.1. Let Q. be an abelian category mth enough injectives , and let 

F: (i ->(B be a covariant additive left exact functor to another abelian cate - 

gory(R. Then: 

(i) For each n ^ 0, R n F as defined above is an additive functor from (ì 
to CB. Furthermore , it is independent , up to a unique isomorphism of 
functors , of the choices of resolutions made. 

(ii) There is a natural isomorphism F « R°F. 

(iii) For each short exact sequence 

0 -► M' -► M -► M" -► 0 

and for each n ^ 0 there is a natural homomorphism 
ô n :R n F(M")^ R n+Ì F(M) 
such that we obtain a long exact sequence: 

- R n F(M') - R n F(M) - R"F(M") - K” + l F(M') 

(iv) Given a morphism of short exact sequences 

0- >M' - * M - *M" --0 


0- *N' - *N - >N” - > 0 

the ò's give a commutative diagram: 

R n F(M") à " —> R n+ l F(M') 


R n F(N") ———♦ R n+ l F(N') 

(v) For each injective object I of A andfor each n > 0 we have R n F(I) = 0. 

Properties (i), (ii), (iii), and (iv) essentially say that R n F is a delta-functor in a 
sense which will be expanded in the next section. The last property (v) will be 
discussed after we deal with the delta-functor part of the theorem. 
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We now describe how to construct the á-homomorphisms. Given a short 
exact sequence, we can find an injective resolution of M', M, M" separately, but 
they don’t necessarily fit in an exact sequence of complexes. So we must achieve 
this to apply the considerations of §1. Consider the diagram: 

0 0 0 


0-► M'-> M-► M"-» 0 

o-> r° -> x -> /"°-> o. 

We give monomorphisms M' -► /'° and M" -► /"° into injectives, and we want to 
find X injective with a monomorphism M -► X such that the diagram is exact. 
We take X to be the direct sum 

X = /'° 0 /"°. 

Since í° is injective, the monomorphism M' -► I'° can be extended to a homo- 
morphism M -► /'°. We take the homomorphism of M into l'° © I"° which 
comes from this extension on the first factor í'°, and is the composite map 

M - M" - I"° 

on the second factor. Then M -► X is a monomorphism. Furthermore /'° -► X 
is the monomorphism on the first factor, and X -► I"° is the projection on the 
second factor. So we have constructed the diagram we wanted, giving the 
beginning of the compatible resolutions. 

Now we take the quotient homomorphism, defìning the third row, to get an 
exact diagram: 

0 0 0 


0-► M'-► M-► M"-► 0 


o-► r° -► /°-► r o -► o 


0-► JV'-> N -> N" -► 0 


0 0 0 
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where we let /° = X 9 and JV', N 9 N" are the cokernels of the vertical maps by 
definition. The exactness of the N-sequence is left as an exercise to the reader. 
We then repeat the construction with the N-sequence, and by induction construct 
injective resolutions 


0 0 0 


0-> M' -► M -> M" -> 0 


0 ► I M ' ► I M ► / M » ► 0 

of the Àí-sequence such that the diagram of the resolutions is exact. 

We now apply the functor F to this diagram. We obtain a short sequence of 
complexes: 


0 - F(F) - F(I) - F(F') - 0, 

which is exact because / = /' © /" is a direct sum and F is left exact, so F com- 
mutes with direct sums. We are now in a position to apply the construction of 
§1 to get the coboundary operator in the homology sequence: 

R n F(M') R n F(M) -► R n F(M") ^ R n+ l F(M'). 

This is legitimate because the right derived functor is independent of the chosen 
resolutions. 

So far, we have proved (i), (ii), and (iii). To prove (iv), that is the naturality of 
the dçlta homomorphisms, it is necessary to go through a three-dimensional 
commutative diagram. At this point, I feel it is best to leave this to the reader, 
since it is just more of the same routine. 

Finally, the last property (v) is obvious, for if / is injective, then we can 
use the resolution 


0-/-/-0 

to compute the derived functors, from which it is clear that R n F = 0 forn > 0. 
This concludes the proof of Theorem 6.1. 

In applications, it is useful to determine the derived functors by means of 
other resolutions besides injective ones (which are useful for theoretical 
purposes, but not for computational ones). Let again F be a left exact additive 
functor. An object X is called F-acyclic if R n F(X) = 0 for all n > 0. 
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Theorem 6.2. Let 


0 -► M - X° - X 1 - X 2 - • • • 
be a resolution of M by F-acyclics. Let 

O-M-/ 0 -/ 1 -/ 2 - • 

be an injective resolution. Then there exists a morphism of complexes X M — / M 
extending the identity on M, and this morphism induces an isomorphism 

H n F(X) x H n F(I) = R n F(M) for all n^ 0. 

Proof. The existence of the morphism of complexes extending the identity 
on M is merely Lemma 5.2. The usual proof of the theorem via spectral se- 
quences can be formulated independently in the following manner, shown to 
me by David Benson. We need a lemma. 

Lemma 6.3. Let Y l (i ^ 0) be F-acyclic , and suppose the sequence 

o- y°- y 1 - y 2 -..- 


is exact. Then 


0 - F(Y°) - F(Y^) — F(Y 2 ) — • 

is exact. 

Proof. Since F is left exact, we have an exact sequence 
0 — F(Y°) — F(Y X ) — F(Y 2 ). 

We want to show exactness at the next joint. We draw the cokernels: 
0- >Y° -►T 1 - >Y 2 -►Y 3 



So Zj = Coker(y° — Y 1 ); Z 2 = Cofcer^y 1 — Y 2 ); etc. Applying F we have 
an exact sequence 


0 - F(Y°) - F(Y l ) - F(Z l ) - R l F(Y°) = 0. 



XX, §6 


DERIVED FUNCTORS 797 


So FCZj) = Coker (F(Y°) -> FiY 1 )). We now consider the exact sequence 

0 ^ Zí ^Y 2 ^Y 3 


giving the exact sequence 

0 ^ F(Z l ) ^ F(Y 2 ) ^ F(Y 3 ) 

by the left-exactness of F, and proving what we wanted. But we can now 
continue by induction because Z x is F-acyclic, by the exact sequence 

0 -► R n F(Y l ) - R n F(Z l ) - R n+ l F(Y°) = 0. 

This concludes the proof of Lemma 6.3. 

We return to the proof of Theorem 6.2. The injective resolution 

0 - M - / M 

can be chosen such that the homomorphisms X n -> I n are monomorphisms for 
n ^ 0, because the derived functor is independent of the choice of injective 
resolution. Thus we may assume without loss of generality that we have an 
exact diagram: 


0 0 0 


0- >M - >X° -‘X 1 - >X 2 


+ + + + 
0- >M -♦ 1° -♦ /‘ -♦ I 2 


o-♦ y°-♦ Y l -♦ Y 2 


0 0 0 

defining Y" as the appropriate cokernel of the vertical map. 
Since X" and I" are acyclic, so is Y n from the exact sequence 

R k F(I n ) - R k F( Y n ) -+R k + l F(X"). 

Applying F we obtain a short exact sequence of complexes 


0 ^ F(*) - F(I) - F(Y) - 0. 
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whence the corresponding homology sequence 

H n ~ 'F( Y) - H n F(X) - H n F(I ) - H n F(Y). 

Both extremes are 0 by Lemma 6.3, so we get an isomorphism in the middle, 
which by definition is the isomorphism 

H n F(X) « R n F(M\ 

thus proving the theorem. 

Left derived functors 

We conclude this section by a summary of the properties of left derived 
functors. 

We consider complexes going the other way, 

- X n --► X 2 - X, - X 0 - M - o 

which we abbreviate by 

X m —► M —► 0. 

We call such a complex a resolution of M if the sequence is exact. We call it a 
projective resolution if X n is projective for all n ^ 0. 

Given projecîive resolutions X M , Y M > and a homomorphism 


(p: M -+ M' 

there always exists a homomorphism X M -► Y M > extending <p , a/zd any two 
such are homotopic. 

In fact, one need only assume that X M is a projective resolution, and that 
y M is a resolution, not necessarily projective, for the proof to go through. 

Let T be a covariant additive functor. Fix a projective resolution of an ob- 
ject M, 

P m ^M^ 0 . 

We define the left derived functor L n T by 

L n T(M) = H n (T(P)\ 


where T(P) is the complex 

- T(P n ) - • - T(P 2 ) - T(P x ) - T(P 0 ) - 0. 

The existence of homotopies shows that L n T(M) is uniquely determined up 
to a unique isomorphism if one changes the projective resolution. 

We define T to be right exact if an exact sequence 


M' -► M -► M" -► 0 
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yields an exact sequence 


T(M') - T(M) - T(M") 0. 

IfT is right exact, then we have immediately from the definitions 

L 0 T(M) « M. 

Theorems 6.1 and 6.2 then go over to this case with similar proofs. One 
has to replace “injectives” by “projectives” throughout, and in Theorem 6.1, 
the last condition states that for n > 0, 

L n T(F) = 0 ifP is projective. 

Otherwise, it is just a question of reversing certain arrows in the proofs. For 
an example of such left derived functors, see Exercises 2-7 concerning the 
cohomology of groups. 


§7. DELTA-FUNCTORS 

In this section, we axiomatize the properties stated in Theorem 6.1 following 
Grothendieck. 

Let Ct, CB be abelian categories. A (covariant) á-functor from G to CB is a 
family of additive functors F = {F n ] n ^ 0 , and to each short exact sequence 

0 -> M' -► M -► M" -► 0 

an associated family of morphisms 

ô n :F n (M")-+ F n+Í (M') 

with n ^ 0, satisfying the following conditions: 

DELl. For each short exact sequence as above, there is a long exact 
sequence 

0 - F°(M') - F°(M) - F°(M") - F l (M') - • • • 

- F n (M') - F n (M) - F n (M") - F n+ l (M') - 

DEL 2. For each morphism of one short exact sequence as above into 
another 0 -► IV' -► N -► IV" -► 0, the á’s give a commutative 
diagram: 

F\M") — â —*F n+1 (M’) 


F"(N") — r -*F n+1 (N'). 
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Before going any further, it is useful to give another definition. Many proofs 
in homology theory are given by induction from one index to the next. It turns 
out that the only relevant data for going up by one index is given in two succes- 
sive dimensions, and that the other indices are irrelevant. Therefore we general- 
ize the notion of á-functor as follows. 

A ô-functor defined in degrees 0, 1 is a pair of functors (F°, F l ) and to 
each short exact sequence 


0^A'->A -+A"-+0 


an associated morphism 


ô: F°(A") -► F\A") 


satisfying the two conditions as before, but putting n = 0, n -f 1 = 1, and for- 
getting about all other integers n. We could also use any two consecutive posi- 
tive integers to index the á-functor, or any sequence of consecutive integers 
^ 0. In practice, only the case of all integers ^ 0 occurs, but for proofs, it is 
useful to have the fíexibility provided by using only two indices, say 0, 1. 

The <5-functor F is said to be universal, if given any other á-functor G of d 
into CB, and given any morphism of functors 

fo : F° G°, 

there exists a unique sequence of morphisms 

fn * F * 1 ► G n 

for all n ^ 0, which commute with the ò n for each short exact sequence. 

By the definition of universality, a á-functor G such that G° = F° is uniquely 
determined up to a unique isomorphism of functors. We shall give a condition 
for a functor to be universal. 

An additive functor F of d into (B is called erasable if to each object A there 
exists a monomorphism u : A -> M for some M such that F(u) = 0. In practice, 
it even happens that F(M) = 0, but we don’t need it in the axiomatization. 

Linguistic note. Grothendieck originally called the notion “effaceable” in 
French. The dictionary translation is “erasable,” as I have used above. Ap- 
parently people who did not know French have used the French word in English, 
but there is no need for this, since the English word is equally meaningful and 
convenient. 

We say the functor is erasable by injectives if in addition M can be taken to 
be injective. 
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Example. Of course, a right derived functor is erasable by injectives, and 
a left derived functor by projectives. However, there are many cases when one 
wants erasability by other types of objects. In Exercises 9 and 14, dealing with 
the cohomology of groups, you will see how one erases the cohomology functor 
with induced modules, or regular modules when G is finite. In the category of 
coherent sheaves in algebraic geometry, one erases the cohomology with locally 
free sheaves of finite rank. 


Theorem 7.1. Let F = {F n } be a covariant ô-functor from Gfc into CB. IfF n is 
erasable for each n > 0, then F is universal. 

Proof Given an object A , we erase it with a monomorphism u, and get a 
short exact sequence: 

O^A^M^X^O. 

Let G be another á-functor with given / 0 : F° -► G°. We have an exact com- 
mutative diagram 


F°(M) -♦ F°(X )- > F\A) -»0 


/ 0 

G°(M) 


f 0 


/V 


G°l 


X) 


ÒG 


G\A) 


We get the 0 on the top right because of the erasability assumption that 

F\cp) = 0. 

We want to construct 


fM)'F\A)->G\A) 

which makes the diagram commutative, is functorial in A , and also commutes 
with the ò. Commutativity in the left square shows that Ker ò F is contained in 
the kernel of ô G °f 0 . Hence there exists a unique homomorphism 

ffA)'F\A)->G\A) 

which makes the right square commutative. We are going to show that ffA) 
satisfies the desired conditions. The rest of the proof then proceeds by induction 
following the same pattern. 

We first prove the functoriality in A. 

Let u : A -► B be a morphism. We form the push-out P in the diagram 

A —-—> M 


B 


P 
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Since (p is a monomorphism, it follows that B P is a monomorphism also. 
Then we let P -> N be a monomorphism which erases F v This yields a com- 
mutative diagram 


0- >A - >M --►O 

U V w 

0- >B - *N - >Y -»0 


where B -> N is the composite B -* P -> N, and Y is defined to be the cokernel 
of B->N. 

Functoriality in A means that the following diagram is commutative. 


F l (Á) F 1 


fliA) \ 

G\A) 


FHu) 


>G l 


B ) 

fi(B) 

B) 


This square is the right-hand side of the following cube: 



All the faces of the cube are commutative except possibly the right-hand face. 
It is then a general fact that if the top maps here denoted by ô F are epimorphisms, 
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then the right-hand side is commutative also. This can be seen as follows. We 
start with ffB^F^^Ôp. We then use commutativity on the top of the cube, 
then the front face, then the left face, then the bottom, and finally the back face. 
This yields 

fi(B)F l (u)ô F = G\u)ffA)ô F . 

Since ô F is an epimorphism, we can cancel ô F to get what we want. 

Second, we have to show that f x commutes with <5. Let 
0 -+ A' -► A -► A" -► 0 

be a short exact sequence. The same push-out argument as before shows that 
there exists an erasing monomorphism 0 -► A' -► M and morphisms v , vv 
making the following diagram commutative: 



Here X is defined as the appropriate cokernel of the bottom row. We now 
consider the following diagram: 

F°(A") 



Our purpose is to prove that the right-hand face is commutative. The triangles 
on top and bottom are commutative by the definition of a á-functor. The 
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left-hand square is commutative by the hypothesis that / 0 is a morphism 
of functors. The front square is commutative by the definition of fi(A'). 
Therefore we find: 


A(A')ô f =MA')ô f F°(w) 
= S F f 0 F°(w) 

= ô F G°(w)f 0 
= &F fo 


(top triangle) 
(front square) 

(left square) 
(bottom triangle). 


This concludes the proof of Theorem 7.1, since instead of the pair of indices 
(0, 1) we could have used (n, n + 1). 

Remark. The morphism f x constructed in Theorem 7.1 depends functori- 
ally on / 0 in the following sense. Suppose we have three delta functors F, G, tì 
defined in degrees 0, 1. Suppose given morphisms 

/ 0 :F°-G° and g 0 :G 0 -^H 0 . 

Suppose that the erasing monomorphisms erase both F and G. Then we can 
construct/j and g x by applying the theorem. On the other hand, the composite 

g 0 f 0 = h 0 :F° H° 


is also a morphism of functors, and the theorem yields the existence of a morph- 
ism 


h x :F x -Z/ 1 

such that (h 0 , h x ) is a á-morphism. By uniqueness, we therefore have 

hi = gJv 

This is what we mean by the functorial dependence as mentioned above. 

Corollary 7.2. Assume that d has enough injectives. Then for any left exact 
Junctor F: (î -► (B, the derived functors R n F with n ^ 0 form a universal 
ô-functor with F « F°F, which is erasable by injectives. Conversely , if 
G = {G"}„^o ÍS a universal ò-functor , then G° is left exact , and the G" are 
isomorphic to R n G° for each n ^ 0. 

Proof. If F is a left exact functor, then the {R n F} n ^ 0 form a á-functor 
by Theorem 6.1. Furthermore, for any object A, let u : A -► I be a monomor- 
phism of A into an injective. Then R n F(I) = 0 for n > 0 by Theorem 
6.1(iv), so R n F(u) = 0. Hence R n F is erasable for all n > 0, and we can apply 
Theorem 7.1. 

Remark. As usual, Theorem 7.1 applies to functors with different variance. 
Suppose {F"} is a family of contravariant additive functors, with n ranging over 
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a sequence of consecutive integers, say for simplicity n ^ 0. We say that F is a 
contravariant ó-functor if given an exact sequence 

0 -4 M' -► M -► M" -► 0 

then there is an associated family of morphisms 

ò n : F n (M') -► F n+Í (M') 

satisfying DEL 1 and DEL 2 with M' interchanged with M" and N' inter- 
changed with N". We say that F is coerasable if to each object A there exists an 
epimorphism u : M -► A such that F(u) = 0. We say that F is universal if 
given any other á-functor G of Q. into (B and given a morphism of functors 

fo'-F°-G 0 

there exists a unique sequence of morphisms 

for all n ^ 0 which commute with ô for each short exact sequence. 

Theorem 7.1'. Let F = {F n } (n ranging over a consecutive sequence of 
integers ^ 0) be a contravariant ô-functor from <î into (B, and assume that 
F n is coerasable for n ^ 1. Then F is universal. 

Examples of á-functors with the variances as in Theorems 7.1 and 7.1' will 
be given in the next section in connection with bifunctors. 

Dimension shifting 

Let F = {F n } be a contravariant delta functor with n ^ 0. Let 8 be a 
family of objects which erases F n for all n ^ 1, that is F n (E) = 0 for n ^ 1 and 
E 6 8. Then such a family allows us to do what is called dimension shifting as 
follows. Given an exact sequence 

O-g-E-M-O 

with E 6 8, we get for n ^ 1 an exact sequence 

0 = F n (E) - F n (Q) - F n+ l (M) - F n+ \E) = 0, 
and therefore an isomorphism 

F n (Q)^F n+í (M\ 

which exhibits a shift of dimensions by one. More generally: 

Proposition 7.3. Let 

0 —► Q — > E n _ j —► • • —► Eq — ► M —► 0 
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be an exact sequence , such that e 8. Then we have an isomorphism 
F P (Q ) % F p+n (M) forp^ 1. 

Proo/. Let (2 = (?„• Also without loss of generality, take p — 1. We may 
insert kernels and cokernels at each step as follows: 



Qn Qn-Ì Qn-2 Q\ ^ 



Then shifting dimension with respect to each short exact sequence, we find 
isomorphisms 

This concludes the proof. 

One says that M has F-dimension ^ d if F n (M) = 0 for n ^ d + 1. By 
dimension shifting, we see that if M has F-dimension ^ d, then Q has F- 
dimension ^ d - n in Proposition 7.3. In particular, if M has F-dimension n, 
then Q has F-dimension 0. 

The reader should rewrite all this formalism by changing notation, using for 
F the standard functors arising from Hom in the first variable, on the category 
of modules over a ring, which has enough projectives to erase the left derived 
functors of 


A !—► Hom(/l, B\ 

for B fixed. We shall study this situation, suitably axiomatized, in the next sec- 
tion. 


§8. BIFUNCTORS 

In an abelian category one often deals with Hom, which can be viewed as a 
functor in two variables; and also the tensor product, which is a functor in two 
variables, but their variance is different. In any case, these examples lead to the 
notion of bifunctor. This is an association 


(A, B) T(A 9 B) 
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where A, B are objects of abelian categories (ï and ® respectively, with values 
in some abelian category. This means that T is functorial in each variable, with 
the appropriate variance (there are four possibilities, with covariance and con- 
travariance in all possible combinations); and if, say, T is covariant in all 
variables, we also require that for homomorphisms A' -> A and B' -> B there 
is a commutative diagram 


T(A\ B ) -► T(A\ B) 


T(A , B') -> T(A 9 B). 

If the variances are shuffled, then the arrows in the diagram are to be reversed in 
the appropriate manner. Finally, we require that as a functor in each variable, 
T is additive. 

Note that Hom is a bifunctor, contravariant in the first variable and covari- 
ant in the second. The tensor product is covariant in each variable. 

The Hom functor is a bifunctor T satisfying the following properties: 

HOM1. T is contravariant and left exact in the first variable. 

HOM 2. T is covariant and left exact in the second variahle. 

HOM 3. For any injective ohject J the functor 

A i—► T(/4, J) 

is exact. 

They are the only properties which will enter into consideration in this 
section. There is a possible fourth one which might come in other times: 

HOM 4. For any projective object Q the functor 

B^T(Q, B) 


is exact. 

But we shall deal non-symmetrically, and view T as a functor of the second 
variable, heeping the first one fixed , in order to get derived functors of the second 
variable. On the other hand, we shall also obtain a á-functor of the first variable 
by using the bifunctor, even though this <5-functor is not a derived functor. 

If ® has enough injectives, then we may form the right derived functors with 
respect to the second variable 


B i—► R n T(A, B), also denoted by R n T A (B), 
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fixing A, and viewing B as variable. If T = Hom, then this right derived functor 
is called Ext, so we have by definition 

Ext n (A, X) = R n Hom(/l, X). 

We shall now give a criterion to compute the right derived functors in terms 
of the other (first) variable. We say that an object A is 7’-exact if the functor 
B i—► T(A, B) is exact. By a 7’-exact resolution of an object A, we mean a resolu- 
tion 


—► M j —► M 0 -► A -* 0 


where M n is T-exact for all n ^ 0. 

Examples. Let d and CB be the categories of modules over a commutative 
ring. Let T = Hom. Then a T -exact object is by definition a projective module. 
Now let the transpose of T be given by 

'T(A 9 B) = T(B, A ). 

Then a 'T-exact object is by definition an injective module. 

If T is the tensor product, such that T(A, B) = A ® B, then a T-exact object 
is called flat. 

Remark. In the category of modules over a ring, there are enough pro- 
jectives and injectives. But there are other situations when this is not the case. 
Readers who want to see all this abstract nonsense in action may consult 
[GriH 78], [Ha 77], not to speak of [SGA 6] and Grothendieck’s collected works. 
It may genuinely happen in practice that (B has enough injectives but (î does not 
have enough projectives, so the situation is not all symmetric. Thus the functor 
A i-> R n T(A, B) for fixed B is not a derived functor in the variable A. In the 
above references, we may take for 0 1 the category of coherent sheaves on a 
variety, and for (B the category of all sheaves. We let T = Hom. The locally 
free sheaves of finite rank are T-exact, and there are enough of them in 0t. There 
are enough injectives in (B. And so it goes. The balancing act between T-exacts 
on one side, and injectives on the other is inherent to the situation. 

Lemma 8.1. Let T be a bifunctor satisfying HOM 1, HOM 2. Let A e d, 
and let M A -> A -> 0, that is 

► M j —► M q —► A —► 0 

be a T-exact resolution of A. Let F n (B) = H n (T(M, B))for Be( B. Then F 
is a ò-functor and F°(B) = T(A,B). If in addition T satisfies HOM3, 
then F n (J) = 0 for J injective and n ^ 1. 
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Proof. Given an exact sequence 

0 -► B' -► B -► B" -► 0 

we get an exact sequence of complexes 

0 - T(M, B') - T(M, £) - T(M, B") - 0, 

whence a cohomology sequence which makes F into a á-functor. For n = 0 
we get F°(B) = T(A , B) because X i—► T(X, B) is contravariant and left exact 
for X g Q.. If B is injective, then F n (B) = 0 for n ^ 1 by HOM 3, because 
X i—► T(X, B) is exact. This proves the lemma. 

Proposition 8.2. Let T be a bifunctor satisfying HOM 1, HOM 2, HOM 3. 
Assume that (B has enough injectives. Let Ae Q. Let 

M A — > A —► 0 

be a T-exact resolution of A. Then the ívvo ô-functors 

B h-► R n T(A, B) and B\-+ H n (T(M, B)) 

are isomorphic as universal ô-functors vanishing on injectives, for n ^ 1, and 
such that 


R°T(A , B) = H°(T(M\ B) = T(A , £). 

Froo/. This comes merely from the universality of a á-functor erasable 
by injectives. 

We now look at the functoriality in A. 

Lemma 8.3. Let T satisfy HOM 1, HOM 2, and HOM 3. Assume that 
(B has enough injectives. Let 

0 -► A' -> A - A" -► 0 

be a short exact sequence. Then for fixed B , we have a long exact sequence 
0 - T(A", B) - T(A, B) - T/4', B) - 
- R l T(A", B) - F^T^, £) - /^TU', B) - 
such that the association 


A i—► R n T(A, B) 


is a ô-functor. 
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Proof. Let 0 -► B -> I B be an injective resolution of B. From the exactness 
of the functor A i—► T(A , J), for J injective we get a short exact sequence of 
complexes 


0 - T(A\ I B ) - T(A , I B ) T(A\ I B ) - 0. 

Taking the associated long exact sequence of homology groups of these com- 
plexes yields the sequence of the proposition. (The functorality is left to 
the readers.) 

If T = Hom, then the exact sequence looks like 

0 -> Hom(/l", B) -► Hom(/l, B) -> Hom(/l', B) -> 

- Ext 1 ^", B) - Ext 1 ^, B) - Ext 1 ^', B) - 


and so forth. 

We shall say that Ct has enough T -exacts if given an object A in Gfc there is a 
T -exact M and an epimorphism 


M - A - 0. 

Proposition 8.4. Let T satisfy HOM 1, HOM 2, HOM 3. Assume that CB 
has enough injectives. Fix BeC B. Then the association 

A y->R n T(A, B) 

is a contravariant ò-functor on d which vanishes on T-exacts, for n ^ 1. // 
Ot has enough T-exacts , then this functor is universal , coerasable by T-exacts , 
with value 


R°T(/1, B) = T(A , £). 

Pr<?o/ By Lemma 8.3 we know that the association is a <5-functor, and it 
vanishes on T-exacts by Lemma 8.1. The last statement is then merely an 
application of the universality of erasable á-functors. 

Corollary 8.5. Let Ct = (B be the category of modules over a ring. Forfixed 
£, let ext n (A, B) be the left derived functor of A\-+ Hom(/l, B), obtained by 
means of projective resolutions of A. Then 

ext"(/l, B) = Ext n (/1, B). 

Proof. Immediate from Proposition 8.4. 

The following proposition characterizes T -exacts cohomologically. 
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Proposition 8.6. Let T be a bifunctor satisfying HOM 1, HOM 2, HOM 3. 
Assume that © has enough injectives. Then the follomng conditions are 
equivalent: 

TEl. AisT-exact. 

TE 2. For every B and every integer n ^ 1, we have R n T(A , B) = 0. 

TE 3. For every B we have R 1 T(/4, £) = 0. 

Proof Let 

0 - B -/ 0 -/ 1 - 

be an injective resolution of B. By definition, R n T(A , B) is the n- th homology of 
the sequence 

0 - T(A, /°) - T(A, 7 1 ) - T(A, I 2 ) - 

If A is T-exact, then this sequence is exact for n ^ 1, so the homology is 0 and 
TE 1 implies TE 2. Trivially, TE 2 implies TE 3. Finally assume TE 3. Given 
an exact sequence 


0 - B' - B - B" - 0, 

we have the homology sequence 

0 - T(A, B') - T(A, B) - T(A, B") - R 1 T(>1, B') 

If R'T(A, B') = 0, then by defìnition A is T-exact, thus proving the proposition. 
We shall say that an object A has T -dimension ^ d if 

R n T(A , B) = 0 for n > d and all B. 

Then the proposition states in particular that A is T-exact if and only if A has 
T-dimension 0. 

Proposition 8.7. Let T satisfy HOM 1, HOM 2, HOM 3. Assume that © 
has enough injectives. Suppose that an object A admits a resolution 

0 —► Erf —► Erf _ i Eq —► A —► 0 

where E 0 , ... 9 E d are T-exact. Then A has T-dimension ^ d. Assume this 
is the case. Let 

0 —► Q —► L d _ j —► • • • —► L 0 —► A —► 0 

be a resolution where L 0 ,..., L d _ t are T-exact. Then Q is T-exact also. 

Proof By dimension shifting we conclude that Q has T-dimension 0, 
whence Q is T-exact by Proposition 8.6. 
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Proposition 8.7, like others, is used in the context of modules over a ring. 
In that case, we can take T = Hom, and 

R n T(A , B) = Ext n (y4, B). 

For A to have 7-dimension ^ d means that 

Ext n (A, B) = 0 for n > d and all B. 

Instead of 7-exact, one can then read projective in the proposition. 

Let us formulate the analogous result for a bifunctor that will apply to the 
tensor product. Consider the following properties. 

TEN1. T is covariant and right exact in the first variable. 

TEN 2. T is covariant and right exact in the second variable. 

TEN 3. For any projective object P the functor 

A T(A, P) 

is exact. 

As for Hom, there is a possible fourth property which will play no role in this 
section: 

TEN 4. For any projective object Q the functor 

T(Q, B) 

is exact. 

Proposition 8.2'. Let T be a bifunctor satisfying TEN 1, TEN 2, TEN 3. 
Assume that (B has enough projectives. Let AeQ. Let 

M A -► A — ► 0 

be a T-exact resolution ofA. Then the txvo ò-functors 

B\-^L n T(A, B) and B^H n (T(M,B)) 

are isomorphic as universal ô-functors vanishing on projectives , and such that 

L 0 T(A, B) = H 0 (T(M\ B) = T(A, B). 

Lemma 8.3'. Assume that T satisfies TEN 1, TEN 2, TEN 3. Assume that 
(B has enough projectives. Let 


0 - A' - A - A" - 0 



XX, §8 


BIFUNCTORS 813 


be a short exact sequence. Then for fíxed B, we have a long exact sequence: 

- Lj T(A\ B) -> Lj T(A, B) - L^ T(A\ B) - 

- T(/4', B) - T(/4, B) - T(A", B) -> 0 
which makes the association A\—>L n T(A, £) a ô-functor. 

Proposition 8.4'. Leí T satisfy TEN 1, TEN 2, TEN 3. Assume that (B has 
enough projectives. Fix Be(R. Then the association 

A^L n T(A,B) 

is a contravariant ô-functor on Q. which vanishes on T-exactsfor n ^ 1. Ifd 
has enough T-exacts, then this functor is universal , coerasable by T-exacts, 
vvií/i í/te i;a/we 


L 0 T(A,B)= T(A, B). 

Corollary 8.8. // there is a bifunctorial isomorphism T(A, B) % T(£, A\ 

and if B is T-exact , í/te/î for all A, L n T(A , Z?) =0 for n ^ 1. In short, 
T-exact implies acyclic. 

Proof Let = P A be a projective resolution in Proposition 8.2'. By 
hypotheses, X v-+T(X,B) is exact so H n (T(P,B)) - 0 for n^ 1; so the 
corollary is a consequence of the proposition. 

The above corollary is formulated so as to apply to the tensor product. 

Proposition 8.6 . Let T be a bifunctor satisfying TEN 1, TEN 2, TEN 3. 
Assume that (B has enough projectives. Then the following conditions are 
equivalent: 

TEl. AisT-exact. 

TE 2. For every B and every integer n ^ 1 we have L n T(A, B) = 0. 

TE3. For every B, we have L X T(A, B) = 0. 

Proof We repeat the proof of 8.6 so the reader can see the arrows pointing 
in different ways. 

Let 


-► Qi —► Qo —*► B —>► 0 

be a projective resolution of B. By definition, L n T(A, B) is the n-th homology 
of the sequence 


T(A, Q x ) - T(A, Q 0 ) - 0. 
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If A is T-exact, then this sequence is exact for n ^ 1, so the homology is 0, and 
TE 1 implies TE 2. Trivially, TE 2 implies TE 3. Finally, assume TE 3. Given 
an exact sequence 

0 -► B - B -> B" - 0 
we have the homology sequence 

-> L, T(A , B") T(A , B') T(i4, B) - T(/4, B") 0. 

If L^T^A, B") is 0, then by definition, >4 is T-exact, thus proving the proposition. 


§9. SPECTRAL SEQUENCES 

This section is included for convenience of reference, and has two purposes: 
first, to draw attention to an algebraic gadget which has wide applications in 
topology, differential geometry, and algebraic geometry, see Griffiths-Harris, 
[GrH 78]; second, to show that the basic description of this gadget in the context 
in which it occurs most frequently can be done in just a few pages. 

In the applications mentioned above, one deals with a filtered complex 
(which we shall define later), and a complex may be viewed as a graded object, 
with a differential d of degree 1. To simplify the notation at first, we shall deal 
with filtered objects and omit the grading index from the notation. This index 
is irrelevant for the construction of the spectral sequence, for which we follow 
Godement. 

So let F be an object with a diíferential (i.e. endomorphism) d such that 
d 2 = 0. We assume that F isfiltered, that is that we have a sequence 

F = F°dF 1 dF 2 d..oF"dF' ,+ 1 = {0}, 

and that dF p cz F p . This data is called afiltered differential object. (Weassume 
that the filtration ends with 0 after a finite number of steps for convenience.) 

One defines the associated graded object 

Gr F = 0 Gr* F where Gv p F = F p /F p+ K 

pì o 

In fact, Gr F is a complex, with a differential of degree 0 induced by d itself, and 
we have the homology H(Gv p F). 

The filtration {F p } also induces a filtration on the homology H(F 9 d) = H(F ); 
namely we let 


H(F) P = image of H(F P ) in H(F). 
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Since d maps F p into itself, H(F P ) is the homology of F p with respect to the 
restriction of d to F p , and it has a natural image in H(F ) which yields this filtra- 
tion. In particular, we then obtain a graded object associated with the filtered 
homology, namely 


Gr H(F) = @ Gv p H(F). 

A spectral sequence is a sequence {E rì d r } (r ^ 0) of graded objects 

Er = © E’ 

p'ào 

together with homomorphisms (also calleddifferentials) of degree r, 

d r :E p ^E p+r 

satisfying d r = 0, and such that the homology of E r is E r + l9 that is 

H(E r ) = E r+Í . 

In practice, one usually has E r = E r+1 = • • • for r ^ r 0 . This limit object is 
called E^, and one says that the spectral sequence abuts to £*>. Actually, to be 
perfectly strict, instead of equalities one should really be given isomorphisms, 
but for simplicity, we use equalities. 

Proposition 9.1. Let F be a filtered differential object. Then there exists a 
spectral sequence {E r } with: 

E p = F p /F p+ 1 ; £? = H(Gr p F); E' = Gr p H(F). 

Proof. Define 


Z p = {x g F p such that dx e F p+r } 
E p = Z p /ldZ p :[ r ~ l) + Z p+ H 


The definition of E p makes sense, since Z p is immediately verified to contain 
dz p :[ r ~ l) 4- Z p+ f Furthermore, d maps Z p into Z p+r , and hence includes a 
homomorphism 


d r :E p ^E p+r . 

We shall now compute the homology and show that it is what we want. 

First, for the cycles : An element xe Z p represents a cycle of degree p in E r 
if and only if dx e dZ p r +\ + Z p f[ +X , in other words 

with y e Z p+ / and z £ Z p+ [ +1 . 


dx = dy + z, 
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Write x = y + u, so du = z. Then usF p and du e F p+r+ \ that is u e P It 
follows that 


p-cycles of E r = (Z p +1 + Z p+ ì)/(dZ p I[ + 1 + Z?_ + /). 


On the other hand, the p-boundaries in E r are represented by elements of 
dZ p ~ r , which contains dZí , r 1 r+ l . Hence 


p-boundaries of E r = (dZ p ~ r + Z p+ })/(dZ p I[ +ì + Z p l\). 

Therefore 

H p (E r ) = (zf +1 + z^Dndzr' + zri) 

= Z p r+ J{Z p r+x n (ázr + Z? +1 )). 

Since 

Z? + ! z> áZf' r and Z? + j n Z r p+ / = Z p+X , 

it follows that 


H p (E r ) = Z p r+í /(dZ p ~ r + Zr 1 ) = £? + i, 
thus proving the property of a spectral sequence. 

Remarks. It is sometimes useful in applications to note the relation 

dZ p l[ r ~ 1] + Z p+ \ = Z p n (dF p ~ r+ 1 + F p+1 ). 

The verification is immediate, but Griffiths-Harris use the expression on the 
right in defining the spectral sequence, whereas Godement uses the expression 
on the left as we have done above. Thus the spectral sequence may also be 
defined by 


E p = Z p mod (dF p ~ r+1 + F p+ x ). 


This is to be interpreted in the sense that Z mod S means 

(Z + S)/S or Z/(Z n S). 

The term is F p /F p+l immediately from the definitions, and by the 
general property already proved, we get E{ = H(F P /F P+1 ). As to ££,, for 
r large we have Z p = Z p = cycles in F p , and 


£ p = Z p /(Z p+1 + (dF° n F p )) 
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which is independent of r, and is precisely Gr p H(F), namely the p-graded 
component of H(F), thus proving the theorem. 

The differential d x can be specified as follows. 

Proposition 9.2. The homomorphism 

d x : E p x -► F? +1 

is the coboundary operator arising from the exact sequence 
0 -► fp +1 /fp +1 -► fp/fp +2 _► fp/fp + 1 _► o 

viexving each term as a complex mth differential induced by d. 

Proof Indeed, the coboundary 

Ò :E\ = H(F p /F p+l )-> H(F p+1 /F p+2 ) = F? +1 

is defined on a representative cycle z by dz, which is the same way that we de- 
fined d v 

In most applications, the filtered differential object is itself graded, because 
it arises from the following situation. Let K be a complex, K = (K p 9 d) with 
p ^ 0 and d of degree 1. By a fìltration FK , also called a filtered complex, we 
mean a decreasing sequence of subcomplexes 


K = F°K => F X K =) F 2 K =) • •=) F n K => F n+ l K = {0}. 


Observe that a short exact sequence of complexes 

0 - K' - K - K" - 0 

gives rise to a filtration K => K' => {0}, viewing K' as a subcomplex. 
To each filtered complex FK we associated the complex 

Gr FK = Gr K = 0 Gr p X, 

p^O 


where 


Gr p K = F p K/F p+í K , 

and the differential is the obvious one. The filtration F P K on K also induces a 
filtration F P H(K) on the cohomology, by 


F p H q (K) = F p Z q /F p B q . 
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The associated graded homology is 

Gr H(K) = 0 Gr' H\K\ 

where 

Gr p H q (K) = F p H q (K)/F p+ l H q (K). 


A spectral sequence is a sequence {E r , d r } (r ^ 0) of bigraded objects 

Er = © Er 

together with homomorphisms (called differentials) 

d r : E p ' q - E p+r ' q ~ r+ 1 satisfying d 2 r = 0, 
and such that the homology of E r is E r+l , that is 

H(E r ) = E r+Í . 

A spectral sequence is usually represented by the following picture: 



-r + 1) 


In practice, one usually has E r = E r+l = •• • for r ^ r 0 . This limit object 
is called and one says that the spectral sequence abuts to E^. 


Proposition 9.3. Let FK be a filtered complex. Then there exists a spectral 
sequence {E r } xvith: 

J7P,q _ ppfcP + qjFP+ lf£P + q. 

E p ' q = H p+q (Gv p K); 

E p ^ q = Gv p (H p+q (K)). 

The last relation is usually xvritten 

E r =>H(K\ 

and we say that the spectral sequence abuts to H(K). 
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The statement of Proposition 9.3 is merely a special case of Proposition 9.1, 
taking into account the extra graduation. 

One of the main examples is the spectral sequence associated with a double 
complex 

K = © K p ' q 

p,q^O 

which is a bigraded object, together with differentials 

d' :K p - q ^ K p+Uq and d": K p - q -> K p - q+1 


satisfying 

d’ 1 = d" 2 = 0 and d'd" + d"d' = 0. 

We denote the double complex by (K, d', d"). The associated single complex 
(Tot(íC), D) (Tot for total complex), abbreviated K*, is defined by 

K" = 0 K p - q and D = d' + d". 

p + q = n 

There are two filtrations on (/£*, D ) given by 

'F p K n = © K p, ' q 

p' + q = n 
P'^P 

"F q K" = 0 K p q ". 

p + q'=n 

q'^q 

There are two spectral sequences {'£,,} and {" E r }, both abutting to H(Tot(K)). 
For applications, see [GrH 78], Chapter 3, §5; and also, for instance, [FuL 85], 
Chapter V. There are many situations when dealing with a double complex directly 
is a useful substitute for using spectral sequences, which are derived from double 
complexes anyhow. 

We shall now derive the existence of a spectral sequence in one of the most 
important cases, the Grothendieck spectral sequence associated with the com- 
posite of two functors. We assume that our abelian category has enough injectives. 

Let C = © C p be a complex, and suppose C p = 0 if p < 0 for simplicity. 
We define injective resolution of C to be a resolution 

0-► C7 1 -► / 2 -► • • • 

written briefly 

0 - C - / c 

such that each I j is a complex, I j = © I j * p , with differentials 

p : jj< p jj> p +1 
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and such that I j ' p is an injective object. Then in particular, for each p we get 
an injective resolution of C p , namely: 

0^> C p ^> I°' p ^> I Up ^> ••• 

We let: 

Z j ' p = Ker d j,p = cycles in degree p 
B j p = Im d j,p ~ 1 = boundaries in degree p 
H j p = Z i,p /B j,p = homology in degree p. 

We then get complexes 

0 -► Z P (C) -> Z°' p -> z ì,p -> 

0 B P (C)^> B 0 p ^> B l p ^> 

0 -// p (C)-//°’ p -/í lp - 

We say that the resolution 0 -► C -► / c is fully injective if these three com- 
plexes are injective resolutions of Z P (C), B P (C) and H P (C) respectively. 

Lemma 9.4. Let 

0 -► M’ - M - M" - 0 
be a short exact sequence. Let 

0 -► M' l M and 0 -> M" I M > 

be injective resolutions of M’ and M”. Then there exists an injective resolution 

0 —> M —> I M 

of M and morphisms which make the following diagram exact and commutative: 
0 > l M > > I M > I M » > 0 

0- > M' - >M - >M" - 

0 0 0 

Proof. The proof is the same as at the beginning of the proof of Theorem 

6 . 1 . 
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Lemma 9.5. Given a complex C there exists a jully injective resolution of C. 

Proof We insert the kernels and cokernels in C, giving rise to the short 
exact sequences with boundaries B p and cycles Z p \ 

0 -> B p - Z p - H p - 0 

0 -► Z p_ 1 -► C p ~ 1 -►£*-► 0. 

We proceed inductively. We start with an injective resolution of 
0 -► Z p ~ 1 -► C p ~ 1 -► -► 0 

using Lemma 9.4. Next let 

0 - H p - I HP 

be an injective resolution of H p . By Lemma 9.4 there exists an injective resolu- 
tion 

0 -> Z' - I ZP 

which fits in the middle of the injective resolutions we already have for B p and 
H p . This establishes the inductive step, and concludes the proof. 

Given a left exact functor G on an abelian category with enough injectives, 
we say that an object X is G-acyclic if R P G(X) = 0 for p ^ 1. Of course, 

R°G(X) = G(X). 

Theorem 9.6. (Grothendieck spectral sequence). Let 

T.G-^CB and G\C B-► e 

be covariant left exact functors such that if I is injective in â, then T(I) is 
G-acyclic. Then for each A in (ì there is a spectral sequence {E r (A)}> such that 

E p 2 ' q (A) = R p G(R q T(A)) 

and E p q abuts (with respect to p) to R p+q (GT)(A\ xvhere q is the grading 
index. 

Proof. Let A be an object of G, and let 0 -► A -► C A be an injective resolu- 
tion. We apply T to get a complex 

TC: 0 -► TC° -► TC 1 -► TC 2 -► 

By Lemma 9.5 there exists a fully injective resolution 

0 —► TC —► / 7(7 

which has the 2 -dimensional representation: 
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o - >/ 0 ’ 1 -► z 1 ' 1 -* / 2 ' 1 

q _ ^ jO , 0 _^ j 1,0 _ j2 ,0 

0-» TC° -► TC l -> TC 2 

0 0 0 


Then GI is a double complex. Let Tot(G/) be the associated single complex. 
We now consider each of the two possible spectral sequences in succession, 
which we denote by l E*' q and 2 E? ,q . 

The first one is the easiest. For fixed p , we have an injective resolution 

0 - TC P - /ç. c 

where we write /f c instead of I TCP . This is the p -th column in the diagram. By 
definition of derived functors, GI P is a complex whose homology is R q G , in 
other words, taking homology with respect to d" we have 

"H p,q (GI) = H q (GI p ) = (R«G)(TC p ). 

By hypothesis, C p injective implies that (R q G)(TC p ) = 0 for q > 0. Since G 
is left exact, we have R°G(TC P ) = TC P . Hence we get 


"H p ' q (Gl) 


GT(C P ) if <7 = 0 
0 if 47 > 0 / 


Hence the non-zero terms are on the p- axis, which looks like 
0 - GT(C°) - GT(C l ) - GT(C 2 ) - 


Taking 'H p we get 

1 E P*(A) = F (G7 * A ) if « = 0 
2 lo if <7 > 0 


This yields 


H"(Tot(G/)) % R n (GT)(A). 
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The second one will use the full strength of Lemma 9.5, which had not been 
used in the first part of the proof, so it is now important that the resolution 
I TC is fully injective. We therefore have injective resolutions 

0 -► Z P (TC ) -> l Z°' p -► l Z Up -► l Z 2 ' p -► 

0 -> B P (TC) -► l B 0,p -► l B Up -► l B 2 ' p -► 

0 - H P (TC) - l H°' p - l H Up - l H 2 p - 

and the exact sequences 

0 -► x Z q ' p -► I q ' p -► l B q+ í,p -► 0 
0 _► 1 B q p -► l Z q ' p -► l H q p -► 0 

split because of the injectivity of the terms. We denote by / íp) the p-th row of the 
double complex / = {I q,p }. Then we find: 

f H q p (GI) = H q (GI ip) ) = G l Z q ' p /G l B q p by the first split sequence 

= G’ H q ' p (I) by the second split sequence 

because applying the functor G to a split exact sequence yields a split exact 
sequence. 

Then 


2 E p « = "H p ('H qp (GI) = H p (G x H q ' p (l)). 

By the full injectivity of the resolutions, the complex ’H q p (I) with p ^ 0 is an 
injective resolution of 

H q (TC) = (R q T)(A). 


Furthermore, we have 


H p (G'H qp ) = R p G(R q T(A)), 

since a derived functor is the homology of an injective resolution. This proves 
that (R p G)R q T(A)) abuts to R n (GT)(A\ and concludes the proof of the theorem. 

Just to see the spectral sequence at work, we give one application relating 
it to the Euler characteristic discussed in §3. 

Let (î have enough injectives, and let 

T : a - ffi 

be a covariant left exact functor. Let 5a be a family of objects in (î giving rise 
to a K-group. More precisely, in a short exact sequence in Ct, if two of the objects 
lie in then so does the third. We also assume that the objects of have 
finite /^T-dimension, which means by definition that if A e 5a then R l T(A) = 0 
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for all i sufficiently large. We could take in fact to be the family of all objects 
in (î which have finite RT-dimension. 

We define the Euler characteristic associated with T on K(g a ) to be 

Xt(A)= £ (-iyd(R‘T(A)). 

i = 0 

The cl denotes the class in the K-group K(5 3 ) associated with some family 
5« of objects in (B, and such that R'T(A) e g® for all A e g a . This is the mini- 
mum required for the formula to make sense. 

Lemma 9.7. The map % T extends to a homomorphism 

K(ga)-K(gJ. 


Proof. Let 


0 - A' - A - A" - 0 

be an exact sequence in g. Then we have the cohomology sequence 

- R i T(A') - R l T(A) - R i T(A' f ) - R i+1 T(A') - 

in which all but a finite number of terms are 0. Taking the alternating sum in the 
K-group shows that % T is an Euler-Poincaré map, and concludes the proof. 

Note that we have merely repeated something from §3, in a jazzed up context. 
In the next theorem, we have another functor 

G:(B - C, 

and we also have a family g e giving rise to a K-group K(5 C )- We suppose that 
we can perform the above procedure at each step, and also need some condition 
so that we can apply the spectral sequence. So, precisely, we assume: 

CHARl. For all í, RT maps into g*, R'G maps into 5 e , and 
R l (GT) maps into g e . 

CHAR 2. Each subobject of an element of li es i n 3a an d has finite 
RT- and R(GT)-dimension; each subobject of an element of 
5« lies in g* and has finite RG-dimension. 

Theorem 9.8. Assume that T : d -> (B and G : (B -► C satisfy the conditions 
CHAR 1 and CHAR 2. Also assume that T maps injectives to G-acyclics. 
Then 


Xg° Xt — Xgt • 
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Proof. By Theorem 9.6, the Grothendieck spectral sequence of the com- 
posite functor implies the existence of a filtration 

•.. c F p R\GT\Á) C= F p+1 R n (GT)(A) c • • • 

of R n (GT)(A\ such that 


F p+1 /F p » 


p 


Then 


Zor(^)= f(-l)"cl(R"(GT)U)) 

n = 0 

= l (-irZdiE^-’) 

n = 0 p = 0 

= f (-l)"cl(0 

n = 0 


On the other hand, 


Xt(A)= £(-1)«c1(K«T(,4)) 

q=0 

and so 


Xg°Xt(A)= Ì(-ÍYxc(R q T(A)) 

9 = 0 

00 00 

= I(-l) î I(-l) p cl(R'’G(R’TM)) 

q = 0 p=0 

= f (-l)"îcl(R p G(R"- p T(/l)) 

n=0 p=0 

= f (-l)"cl(ES). 

n = 0 

Since E r+l is the homology of E r , we get 


X(-l)"cl(£" 2 )= S(-l)"cl (£5) 


n = 0 


n = 0 


X(-l)" Cl(£"j. 


n = 0 


This concludes the proof of the theorem. 
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EXERCISES 

1. Prove that the example of the standard complex given in §1 is actually a complex, 
and is exact, so it gives a resolution of Z. [Hint: To show that the sequence of the 
standard complex is exact, choose an element z E S and define h : E l —> E i+ 1 by letting 

h(x 0 , ..., x t ) = (z, x 0 ,..., Xi ). 

Prove that dh 4- hd = id, and that dd = 0. Exactness follows at once.] 

Cohomology of groups 

2. Let G be a group. Use G as the set S in the standard complex. Define an action of 
G on the standard complex E by letting 

x(x 0 , ... 9 X/) = (xx 0 ,. . . , XXi). 

Prove that each E ( is a free module over the group ring Z[G]. Thus if we let 
R = Z[G] be the group ring, and consider the category Mod(G) of G-modules, then 
the standard complex gives a free resolution of Z in this category. 

3. The standard complex E was written in homogeneous form, so the boundary maps 
have a certain symmetry. There is another complex which exhibits useful features 
as follows. Let F l be the free Z[G]-module having for basis /-tuples (rather than 
(i -I- l)-tuples) (xj,. .., Xi). For i = 0 we take F 0 = Z[G] itself. Define the boundary 
operator by the formula 


i-i 

d(x u ...,Xi) =xi(x 2 ,...,x,) + l) y (jC!,..., jc y Ar y+1 ,..., AT/) 

7=1 

+ (— 1) , " hl (ATi, . . . ,Xi). 

Show that E ^ F (as complexes of G-modules) via the association 

(*!, . . . , Xi) l-> (1, x x , x x x 2t . . . , x x x 2 • * • Xi), 

and that the operator d given for F corresponds to the operator d given for E under 
this isomorphism. 

4. If A is a G-module, let A G be the submodule consisting of all elements v E A such 
that xv = v for all x E G. Thus A G has trivial G-action. (This notation is convenient, 
but is not the same as for the induced module of Chapter XVIII.) 

(a) Show that if H q (G, A) denotes the g-th homology of the complex 
HomçíE, A), then H°(G, A) = A G . Thus the left derived functors of A h-> A G 
are the homology groups of the complex HomcíE, A), or for that matter, 
of the complex Hom(F, A), where F is as in Exercise 3. 

(b) Show that the group of 1-cycles Z l (G , A) consists of those functions 
/: G —> A satisfying 

f(x) + xf(y) = f(xy) for all x,y E G. 

Show that the subgroup of coboundaries B X (G,A) consists of those functions 
/for which there exists an element a E A such that f(x) = xa - a. The factor 
group is then H l (G,A). See Chapter VI, §10 for the determination of a special 
case. 
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(c) Shovv that the group of 2-cocycles Z 2 (G, A) consists of those functions 
/: G —> A satisfying 

xf(y, z) ~ f(xy, z) 4- f(x , yz) - f(x , y) = 0. 

Such 2-cocycles are also called factor sets, and they can be used to describe 
isomorphism classes of group extensions, as follows. 

5. Group extensions. Let W be a group and A a normal subgroup, written multipli- 
catively. Let G = W/A be the factor group. Let F: G —> W be a choice of coset 
representatives. Define 

/(*, y) = F(x)F(y)F(xy)-'. 

(a) Prove that/is A-valued, and that/: G x G —> A is a. 2-cocycle. 

(b) Given a group G and an abelian group A, we view an extension W as an 
exact sequence 

l —> A—> W -> G —> 1. 

Show that if two such extensions are isomorphic then the 2-cocycles associated 
to these extensions as in (a) define the same class in H X (G, A). 

(c) Prove that the map which we obtained above from isomorphism classes of 
group extensions to H 2 (G , A) is a bijection. 

6. Morphisms of the cohomology functor. Let A : G' —> G be a group homomorphism. 
Then À gives rise to an exact functor 

<Í> A : Mod(G) Mod(G'), 

because every G-module can be viewed as a G'-module by defining the operation of 
cr' E G' to be cr'a = \(cr')a. Thus we obtain a cohomology functor H G ' ° <Í> A . 

Let G' be a subgroup of G. In dimension 0, we have a morphism of functors 

À* : H% —> H%- ° <I> A given by the inclusion A Gci > A G ' = Í> a (A) g '. 

(a) Show that there is a unique morphism of 5-functors 

À* : H g —> H G ' o <D a 

which has the above effect on H%. We have the following important special 
cases. 

Restriction Let H be a subgroup of G. Let A be a G-module. A function 
from G into A restricts to a function from H into A. In this way, we get a 
natural homomorphism called the restriction 

res: H q (G, A) W(H, A). 

Inflation Suppose that H is normal in G. Let A H be the subgroup of A 
consisting of those elements fixed by H. Then it is immediately verified that 
A H is stable under G, and so is a G/H- module. The inclusion A H< ^> A induces 
a homomorphism 

H%(u) = u q : H q (G, A H ) H q (A). 

Defìne the inflation 


inf g/// • H q (G/H, A H ) H q (G, A) 
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as the composite of the functorial morphism H q {G/H, A H ) —> H q {G, A H ) 
followed by the induced homomorphism u q — H^{u) as above. 

In dimension 0, the inflation gives the identity {A H ) G/H = A G . 

(b) Show that the inflation can be expressed on the standard cochain complex 
by the natural map which to a function of G/H in A H associates a function 
of G into A H C A. 

(c) Prove that the following sequence is exact. 

0 -> H'(G/H, A H ) “ H'(G, A) ™ H'(H, A). 

(d) Describe how one gets an operation of G on the cohomology functor H G “by 
conjugation” and functoriality. 

(e) In (c), show that the image of restriction on the right actually lies in 
H l {H, A) G (the fixed subgroup under G). 

Remark. There is an analogous result for higher cohomology groups, 
whose proof needs a spectral sequence of Hochschild-Serre. See [La 96], 
Chapter VI, §2, Theorem 2. It is actually this version for H 2 which is applied 
to H 2 {G, K*), when K is a Galois extension, and is used in class field theory 
[ArT 67]. 

7. Let G be a group, B an abelian group and M G {B) = M{G, B) the set of mappings 

from G into B. For x E G and / C M{G, B) define (M/)(y) = f{yx). 

(a) Show that B h-> M G {B) is a covariant, additive, exact functor from Mod(Z) 
(category of abelian groups) into Mod(G). 

(b) Let G' be a subgroup of G and G = UjtyG' a coset decomposition. For 

/ E M(G, B) let f be the function in M{G', B) such that f{y) = f{x^y). 

Show that the map 

J 

is a G'-isomorphism from M{G, B) to Yl M(G', B). 

j 

8. For each G-module A E Mod(G), define e A : A —> M(G, A) by the condition 
e A {a) = the function f a such that / z (c r) = aa for cr E G. Show that a i-> f a is a 
G-module embedding, and that the exact sequence 

0 -> A -3« M(G, A) -> X A = coker e A 0 

splits over Z. (In fact, the map/1-> f{e) splits the left side arrow.) 

9. Let B E Mod(Z). Let H q be the left derived functor of A h-> A G . 

(a) Show that H q {G, M G {B)) = 0 for all q > 0. [ Hint : use a contracting homotopy 

s:C'(G,M G (B))^C'-'(G,M c m by (sf) X2 _ ,Jx) = >Jtr (l). 

Show that / = sdf + dsf.] Thus M G erases the cohomology functor. 

(b) Also show that for all subgroups G' of G one has H q {G', M G {B)) = 0 for 

q > 0. 

10. Let G be a group and S a subgroup. Show that the bifunctors 

{A, B) h-> Hom G (A, M^{B)) and (A, B) h-> Hom 5 (A, B) 

on Mod(G) x Mod(5) with value in Mod(Z) are isomorphic. The isomorphism is 
given by the maps 

<P i-> (a l-> g a ), for <p e Hom s (A, B), where g a (a) = <p(aa), g a e M s a (B). 
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The inverse mapping is given by 

/1^/(1) with/E Hom G (i4, M S G (B)). 

Recall that M^(B) was defined in Chapter XVIII, §7 for the induced representation. 
Basically you should already know the above isomorphism. 

11. Let G be a group and S a subgroup. Show that the map 

H q (G, M$j(B)) -* H q (S, B) for B E Mod (S), 

obtained by composing the restriction res£ with the 5-homomorphism /1-> /(1), is 
an isomorphism for q > 0. [Hint : Use the uniqueness theorem for cohomology 
functors.] 

12. Let G be a group. Let e : Z[G] —> Z be the homomorphism such that n(x)x) = 
S n(x ). Let I G be its kernel. Prove that I G is an ideal of Z[G] and that there is an 
isomorphism of functors (on the category of groups) 

G/G c - l G Hl, by xG c l-> (x - 1) -f /*. 

13. Leti4 E Mod(G) and a E H l (G, A). Let beastandard 1-cocycle representing 

a. Show that there exists a G-homomorphism f :I G -=>A such that f(x - 1) = a(x), 
so/E (Hom(/ G , A)) G . Show that the sequence 

0-^A = Hom(Z, A) -* Hom(Z[G], A) -> Hom(/ G , A) 0 

is exact, and that if 8 is the coboundary for the cohomology sequence, then 
8(f) = -a. 

Finite groups 

We now turn to the case of finite groups G. For such groups and a G-module A we 
have the trace 

T g : A —> A defined by T G (a) = X cra. 

ffeC 

We define a module A to be G-regular if there exists a Z-endomorphism u : A —» A such 
that id^ = T g (u). Recall that the operation of G on End(A) is given by 

W\f(a) = a-f(a--'a) for a 6 G. 

14. (a) Show that a projective object in Mod(G) is G-regular. 

(b) Let R be a commutative ring and let A be in Mod^(G) (the category of (G, R)- 
modules). Show that A is /?[G]-projective if and only if A is /?-projective and 
/?[G]-regular, meaning that id^ = T G (u) for some /?-homomorphism u : A —> A. 

15. Consider the exact sequences: 

(1) 0-> / G -> Z[G] -4 Z-> 0 

(2) 0 -* Z 4 Z[G] -> / G 0 

where the first one defines I G , and the second is defined by the embedding 
e' : Z —> Z[G] such that e'(n) = w(2 cr), 
i.e. on the “diagonal”. The cokernel of e' is J G by definition. 

(a) Prove that both sequences (1) and (2) split in Mod(G). 
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(b) Define M' G {A) = Z [G] (tensor product over Z) for A E Mod(G). Shovv 

that M' g {A) is G-regular, and that one gets exact sequences (l^) and (2^) by 
tensoring (1) and (2) vvith A. As a result one gets an embedding 

e' A = e' 0 id : A = Z 0 A —> Z[G] ® A. 

16. Cyclic groups. Let G be a finite cyclic group of order n. Let <r be a generator of G. 
Let K‘ = Z[G] for / > 0. Let e : K° —» Z be the augmentation as before. For i odd 
^ 1, let d‘ : K‘ —> /T' -1 be multiplication by 1 - cr. For i even ^ 2, let d‘ be 
multiplication by 1 + cr + • • ■ + cr” -1 . Prove that K is a resolution of Z. Conclude 
that: 

For i odd: H‘{G, A) = A G /T G A vvhere 7’ G :tfl-»(l + <x+-** + a n ~ l )a ; 

For i even ^ 2: //'(G, A) = A T /{ 1 - <j)A, vvhere A T is the kernel of T G in A. 

17. Let G be a finite group. Shovv that there exists a ô-functor H from Mod(G) to 
Mod (Z) such that: 

(1) H° is (isomorphic to) the functor A h-> A G /T G A. 

(2) H^(A) = 0 if A is injective and q > 0, and H^(A) = 0 if A is projective and q 
is arbitrary. 

(3) H is erased by G-regular modules. In particular, H is erased by M G . 

The ô-functor of Exercise 17 is called the special cohomology functor. It differs 
from the other one only in dimension 0. 

18. Let H = H g be the special cohomology functor for a finite group G. Shovv that: 

H°(/ g ) = 0; H°(Z) ~ H‘(/) ~ Tj/nL vvhere n = #(G); 

H°(<2/Z) = H \Z) = H 2 (/) = 0 

H 1 {Q/Z) - H 2 (Z) « H 3 (/) ~ G A = Hom(G, Q/Z) by definition. 


Injectives 

19. (a) Shovv that ìf an abelian group T is injective in the category of abelian groups, then 

ìt is divisible. 

(b) Let A bea principal entire ring. Defìne the notionof divisibility by elements of A for 
modules in a manner analogous to that for abelian groups. Shovv that an A- 
module is injective ìf and only if ìt is A-divisible. [The proof for Z should vvork 
in exactly the same vvay.] 

20. Let S be a multiplicative subset of the commutative Noetherian ring A. If / is an 
injective A-module, shovv that S~ l I is an injective S~ 1 A-module. 

21. (a) Shovv that a direct sum of projective modules ìs projective. 

(b) Shovv that a direct product of ìnjective modules ìs mjective. 

22. Shovv that a factor module, direct summand, direct product, and direct sum of divisible 
modules are divisible. 

23. Let Q be a module over a commutative ring A. Assume that for every left ideal J of 
A, every homomorphism <p : J —> Q can be extended to a homomorphism of A into 
Q. Shovv that Q is injective. [Hint : Given M' C M and / : M' — > Q, let x 0 E M 
and x 0 £ M'. Let J be the left ideal of elements a E A such that ax 0 E M'. Let 
<p(à) = f{ax 0 ) and extend <p to A, as can be done by hypothesis. Then shovv that 
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one can extend / to M by the formula 

/(■*' + bx 0 ) =f(x') -f <p(b ), 

forx' E M and b E A. Then use Zom’s lemma. This is the same pattem of proof as 
the proof of Lemma 4.2.] 

24. Let 

0 —► / ^ —► 1 2 —► / 3 —► 0 

be an exact sequence of modules. Assume that / l5 / 2 are injective. 

(a) Show that the sequence splits. 

(b) Show that / 3 is injective. 

(c) If / is injective and / = M © N, show that M is injective. 

25. (Do this exercise after you have read about Noetherian rings.) Let A be a Noetherian 
commutative ring, and let Q be an injective 4-module. Let a be an ìdeal of A, and let 
Q (a) be the subset of elements xsQ such that a”x = 0 for some n , depending on x. 
Show that Q (a) is injective. [Hint: Use Exercise 23.] 

26. Let A be a commutative ring. Let E be an A-module, and let E A = Hom z (£, Q/Z) 
be the dual module. Prove the following statements. 

(a) A sequence 

O-N-M-E-O 
is exact if and only if the dual sequence 

0 —> E A —> M A —> N A —> 0 

is exact. 

(b) Let F be flat and I injective in the category of A-modules. Show that 
Hom a (F, I) is injective. 

(c) E is flat if and only if E A is injective. 

27. Extensions of modules. Let M, N be modules over a ring. By an extension of M 
by N we mean an exact sequence 

(*) 0-*N-*E—>M—* 0. 

We shall now define a map from such extensions to Ext‘(M, N). Let P be projective, 
with a surjective homomorphism onto M, so we get an exact sequence 

(**) 

where K is defined to be the kemel. Since P is projective, there exists a homomorphism 
u : P —> £, and depending on u a unique homomorphism v: K —> N making the 
diagram commutative: 

0-> K -> P —> M -►O 

v u id 

0-* N -» E -*• M -*0 
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On the other hand, we have the exact sequence 

(***) o -* Hom(M, N) -> Hom(P, N) -> Hom (K, N) -> Ext \M, N)-*0, 

with the last term on the right being equal to 0 because Ext‘(P, N) = 0. To the 
extension (*) we associate the image of v in Ext l (M, N). 

Prove that this association is a bijection between isomorphism classes of extensions 
(i.e. isomorphism classes of exact sequences as in (*)), and Ext*(M, N). [Hint: 
Construct an inverse as follows. Given an element e of Ext*(M, N), using an exact 
sequence (**), there is some element v E Hom(Â\ N) which maps on e in (***). Let 
E be the push-out of v and vv. In other words, let J be the submodule of N ® P 
consisting of all elements (z;(jc), ~w(x)) with x E K, and let E = (N ® P)/J. Show 
that the map y \-> (y, 0) mod J gives an injection of N into E. Show that the map 
N ® P M vanishes on J, and so gives a surjective homomorphism E —> M —> 0. 
Thus we obtain an exact sequence (*); that is, an extension of M by N. Thus to each 
element of Ext^M, N) we have associated an isomorphism class of extensions of M 
by N. Show that the maps we have defined are inverse to each other between iso- 
morphism classes of extensions and elements of Ext l (M, N).] 

28. Let R be a principal entire ring. Let a E R. For every /?-module N, prove: 

(a) Ext \R/aR, N) = N/aN. 

(b) For b E R we have Ext l (R/aR, R/bR) = R/(a, b), where (a, b) is the g.c.d 
of a and b, assuming ab ^ 0. 

Tensor product of complexes. 

29. Let K = 0X p and L = 0L fl be two complexes indexed by the integers, and with 
boundary maps lower indices by 1. Define K ® L to be the direct sum of the modules 
(K ® L) n , where 

(K®L\= ®K„®L q . 

p + q = n 

Show that there exist unique homomorphisms 

d = d n :(K®L) n -+(K®L) n _ í 


such that 


d(x ® y) = d(x) ® y + (- ì) p x ® d(y). 

Show that K® L with these homomorphisms ìs a complex, that is d ° d = 0. 

30. Let K, L be double complexes. We write K t and L, for the ordinary column complexes 
of K and L respectively. Let <p: K —> L be a homomorphism of double complexes. 
Assume that each homomorphism 

9Ï- -» U 

is a homology isomorphism. 

(a) Prove that Tot ((p): Tot(tf) —> Tot (L) is a homology isomorphism. (If you 
want to see this worked out, cf. [FuL 85], Chapter V, Lemma 5.4.) 

(b) Prove Theorem 9.8 using (a) instead of spectral sequences. 
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CHAPTER XXI _ 

Finite Free Resolutions 


This chapter puts together specific computations of complexes and homology. 
Partly these provide examples for the general theory of Chapter XX, and partly 
they provide concrete results which have occupied algebraists for a century. 
They have one aspect in common: the computation of homology is done by means 
of a finite free resolution, i.e. a finite complex whose modules are finite free. 

The first section shows a general technique (the mapping cylinder) whereby 
the homology arising from some complex can be computed by using another 
complex which is finite free. One application of such complexes has already 
been given in Chapter X, putting together Proposition 4.5 followed by Exercises 
10-15 of that chapter. 

Then we go to major theorems, going from Hilbert’s Syzygy theorem, from 
a century ago, to Serre’s theorem about finite free resolutions of modules over 
polynomial rings, and the Quillen-Suslin theorem. We also include a discussion 
of certain finite free resolutions obtained from the Koszul complex. These apply, 
among other things, to the Grothendieck Riemann-Roch theorem of algebraic 
geometry. 

Bibliographical references refer to the list given at the end of Chapter XX. 


§1. SPECIAL COMPLEXES 

As in the preceding chapter, we work with the category of modules over a 
ring, but the reader will notice that the arguments hold quite generally in an 
abelian category. 

In some applications one determines homology from a complex which is 
not suitable for other types of construction, like changing the base ring. In this 
section, we give a general procedure which constructs another complex with 
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better properties than the first one, while giving the same homology. For an 
application to Noetherian modules, see Exercises 12-15 of Chapter X. 

Let f: K — C be a morphism of complexes. We say that / is a homology 
isomorphism if the natural map 

H(f): H(K) — H(C) 

is an isomorphism. The definition is valid in an abelian category, but the reader 
may thinlc of modules over a ring, or abelian groups even. A family 3 of objects 
will be called suífìcient if given an object E there exists an element F in 5 and 
an epimorphism 

F - E - 0, 

and if 3 is closed under taking finite direct sums. For instance, we may use for 
3 the family of free modules. However, in important applications, we shall deal 
with finitely generated modules, in which case g might be taken as the family of 
finite free modules. These are in fact the applications I have in mind, which 
resulted in having axiomatized the situation. 

Proposition 1.1. Let C be a complex such that H P (C ) / 0 only for 
0 ^ p S n. Let 3 be a sufficient family of projectives. There exists a 
complex 

O-X 0 -^ 1 --►£"- 0 

such that: 

K p / 0 only for 0 ^ p ^ n; 

K p is in 3 for all p ^ 1; 

and there exists a homomorphism of complexes 

/: X — C 

xvhich is a homology isomorphism. 

Proof. We define f m by descending induction on m: 



We suppose that we have defined a morphism of complexes with p ^ m H- 1 
such that H p (f) is an isomorphism for p ^ m + 2, and 


fm+ 1 : Z m+ *(K) -*• H m+1 (C) 
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is an epimorphism, where Z denotes the cycles, that is Ker ô. We wish to con- 
struct K m and / m , thus propagating to the left. First let m ^ 0. Let B m+ 1 be 
the kernel of 


Ker Ô% + 1 -► H m+í (C). 

Let K' be in g with an epimorphism 

Ô' :K' -+B m+1 . 

Let K" -► H m (C) be an epimorphism with K" in g, and let 

f" : K" - Z m (C) 

be any lifting, which exists since K" is projective. Let 

K m = K' © K" 

and define ô m : K m - K m+ 1 to be ô' on K' and 0 on K". Then 

fm+\° à'(K') <= S C (C m ), 
and hence there exists f':K'-> C m such that 

°f' — fm+ i °&- 

We now define f m : K m -> C m to be f' on K' and f" on K". Then we have 
defined a morphism of complexes truncated down to m as desired. 

Finally, if m = — 1, we have constructed down to K°, ô° , and / 0 with 

K° h H°(C) -» 0 

exact. The last square looks like this, defìning K _1 = 0. 

0- >K’ 0 K" òo=ò > Ô’K’ <= A: 1 



0-»c°-* c 1 

We replace K° by K°/(Ker ô° n Ker/ 0 ). Then H°(f) becomes an isomorphism, 
thus proving the proposition. 

We want to say something more about K°. For this purpose, we define a 
new concept. Let 5 be a family of objects in the given abelian category (think 
of modules in first reading). We shall say that 5 is complete if it is sufficient, and 
for any exact sequence 

0 - F' - F - F" - 0 
with F" and F in 5 then F' is also in 5- 
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Example. In Chapter XVI, Theorem 3.4 we proved that the family of finite 
flat modules in the category of finite modules over a Noetherian ring is complete. 
Similarly, the family of flat modules in the category of modules over a ring is 
complete. We cannot get away with just projectives or free modules, because 
in the statement of the proposition, K° is not necessarily free but we want to 
include it in the family as having especially nice properties. In practice, the 
family consists of the flat modules, or finite flat modules. Cf. Chaper X, Theorem 
4.4, and Chapter XVI, Theorem 3.8. 

Proposition 1.2. Let f\K->Cbea morphism of complexes , such that K p , 
H P {C) are #0 onlyfor p = 1,..., n. Let 5 be a complete family , and assume 
that K p , C p are in g for all p , except possibly for K°. If f is a homology 
isomorphism , then K° is also in g. 

Before giving the proof, we defìne a new complex called the mapping cylinder 
of an arbitrary morphism of complexes / by letting 

M p = K p ® C p ~ l 

and defining Ò M : M p -► M p+1 by 

y) = (Sxjx - ôy). 

It is trivially verified that M is then a complex, i.e. ò ° ò = 0. If C is the com- 
plex obtained from C by shifting degrees by one (and making a sign change 
in ò c \ so C ,p = C p ~ l , then we get an exact sequence of complexes 

O-C'-M-X-O 

and hence the mapping cylinder exact cohomology sequence 


H P (K) - >H p+1 (C) 

II 


H p+Í (M) -> H p+Í (K) -> H p+2 (C) 

II 


H p (C) 


H P+1 (C) 


and one sees from the definitions that the cohomology maps 
H p (K)^ H p+1 (C')k H p (C) 
are the ones induced by f:K->C. 

We now return to the assumptions of Proposition 1.2, so that these maps are 
isomorphisms. We conclude that H(M) = 0. This implies that the sequence 

()-► K° -+ M 1 -► M 2 -►- >M n+1 -> 0 

is exact. Now each M p is in g by assumption. Inserting the kernels and 
cokernels at each step and using induction together with the definition of a 
complete family, we conclude that K° is in g, as was to be shown. 
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In the next proposition, we have axiomatized the situation so that it is 
applicable to the tensor product, discussed later, and to the case when the family 
5 consists of flat modules, as defined in Chapter XVI. No knowledge of this 
chapter is needed here, however, since the axiomatization uses just the general 
language of functors and exactness. 

Let 5 be a complete family again, and let T be a covariant additive functor 
on the given category. We say that 5 is exact for T if given an exact sequence 

0 - F - F - F* - 0 


in 5, then 

0 - T(F') - T(F) - T(F") - 0 

is exact. 


Proposition 1.3. Let 5 be a complete family which is exact for T. Let 
f :K -> C be a morphism of complexes , such that K p and C p are in g for all 
p, and K p , H P (C) are zero for all but a finite number of p. Assume that f is a 
homology isomorphism. Then 

T(f ): T(K) -► T(C) 

is a homology isomorphism. 

Proof. Construct the mapping cylinder M for /. As in the proof of Propo- 
sition 1.2, we get H(M) = 0 so M is exact. We then start inductively from the 
right with zeros. We let Z p be the cycles in M p and use the short exact sequences 

0 -► Z p -► M p -► Z pJr 1 -► 0 

together with the definition of a complete family to conclude that Z p is in g for 
all p. Hence the short sequences obtained by applying T are exact. But T(M) 
is the mapping cylinder of the morphism 

T(f):T(K)^T(C\ 

which is therefore an isomorphism, as one sees from the homology sequence of 
the mapping cylinder. This concludes the proof. 


§2. FINITE FREE RESOLUTIONS 

The first part of this section develops the notion of resolutions for a case 
somewhat more subtle than projective resolutions, and gives a good example for 
the considerations of Chapter XX. Northcott in [No 76] pointed out that minor 
adjustments of standard proofs also applied to the non-Noetherian rings, only 
occasionally slightly less tractable than the Noetherian ones. 
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Let Abe a ring. A module E is called stably free if there exists a finite free 
module F such that E © F is finite free, and thus isomorphic to A (n) for some 
positive integer n. In particular, E is projective and finitely generated. 

We say that a module M has a finite free resolution if there exists a resolution 

0 —► E n Eq —► M —► 0 

such that each E { is finite free. 

Theorem 2.1. Let M be a projective module. Then M is stably free if and 
only ifM admits afinite free resolution. 

Proof. If M is stably free then it is trivial that M has a finite free resolution. 
Conversely assume the existence of the resolution with the above notation. 
We prove that M is stably free by induction on n. The assertion is obvious if 
n = 0. Assume n ^ 1. Insert the kernels and cokernels at each step, in the 
manner of dimension shifting. Say 

Mj = Ker(£ 0 -> P), 

giving rise to the exact sequence 

0 — M t — £ 0 — M — 0. 

Since M is projective, this sequence splits, and E 0 « M ® M v But M t has a 
finite free resolution of length smaller than the resolution of M, so there exists 
a finite free module F such that Mj © F is free. Since P 0 © P is also free, this 
concludes the proof of the theorem. 

A resolution 

0 - P„ --► P 0 - M — 0 

is called stably free if all the modules P, (i = 0,..., n) are stably free. 

Proposition 2.2. Let M be an A-module. Then M has afinitefree resolution 
of length n ^ 1 ifand only if M has a stably free resolution of length n. 

Proof. One direction is trivial, so we suppose given a stably free resolution 
with the above notation. Let 0 ^ i < n be some integer, and let F h F i+Í be 
finite free such that £* © F t and E i+Í © F i+Í are free. Let F = F t ® F i+Ì . 
Then we can form an exact sequence 

0 - E n --> E i+ ^ © F - Ei © F --► E 0 — M — 0 

in the obvious manner. In this way, we have changed two consecutive modules 
in the resolution to make them free. Proceeding by induction, we can then 
make E 0 , E^ free, then E x , E 2 free, and so on to conclude the proof of the 
proposition. 
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The next lemma is designed to facilitate dimension shifting. 

We say that two modules M u M 2 are stably isomorphìc if there exist finite 
free modules F u F 2 such that M x ® F t « M 2 © F 2 . 

Lemma 2.3. Let M x be stably isomorphic to M 2 . Let 

0 Nj -► -► Mj -► 0 

0 -► N 2 - E 2 - M 2 - 0 

be exact sequences, xvhere M x is stably isomorphic to M 2 , and E x , E 2 are 
stablyfree. Then N x is stably isomorphic to N 2 . 

Proof. By definition, there is an isomorphism M x ® F^ & M 2 ® F 2 . 
We have exact sequences 

0 - Nj - E x ® F x - M, ffi F, - 0 

0 - N 2 - E 2 ffi F 2 - M 2 ffi f 2 - 0 

By Schanuel’s lemma (see below) we conclude that 


N t ffi E 2 ffi F 2 « N 2 ffi E x ffiFj. 


Since F 1? £ 2 , F l5 F 2 are stably free, we can add finite free modules to each side 
so that the summands of N x and N 2 become free, and by adding 1-dimensional 
free modules if necessary, we can preserve the isomorphism, which proves that 
A/\ is stably isomorphic to iV 2 . 

We still have to take care of Schanuel’s lemma: 

Lemma 2.4. Let 


0-K->P-M-0 

O-X'-P'-M-O 

be exact sequences where P, P' are projective. Then there is an isomorphism 

K®P'kK'®P. 


Proof. Since P is projective, there exists a homomorphism P — P' making 
the right square in the following diagram commute. 


0- >K —i— P -►M-►O 


u 


w 


id 


0- >K' — J—>P' - >M --0 
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Then one can find a homomorphism K -► K' which makes the left square 
commute. Then we get an exact sequence 

0 - K - P ® - P' - 0 

by x i—► (ix, ux) for x e K and (y, z) i—► vvy — jz. We leave the verification of 
exactness to the reader. Since P' is projective, the sequence splits thus proving 
Schanuel’s lemma. This also concludes the proof of Lemma 2.3. 

The minimal length of a stably free resolution of a module is called its 
stably free dimension. To construct a stably free resolution of a finite module, 
we proceed inductively. The preceding lemmas allow us to carry out the induc- 
tion, and also to stop the construction if a module is of finite stably free dimen- 
sion. 


Theorem 2.5. Let M be a module which admits a stably free resolution of 
length n 

0 —► E n Eq —► M —► 0. 


Let 


F m ► * * • ► F o ► M ► 0 


be an exact sequence with F { stably free for i = 0,..., m. 

(i) Ifm < n — 1 then there exists a stably free F m+Í such that the exact 
sequence can be continued exactly to 

F m + 1 ► ‘ * * ► * M ► 0. 


(ii) If m = n - 1, let F n = Ker(F„_! -► F„_ 2 ). Then F n is stably free 
and thus 


0 - F„ - F„_ j --> F 0 - M - 0 


is a stably free resolution. 


Remark. If A is Noetherian then of course (i) is trivial, and we can even 
pick F m+Í to be finite free. 

Proof. Insert the kernels and cokernels in each sequence, say 

K m = Ker(£ m - E m _ j) if m # 0 
K 0 = Ker(£ 0 -► M), 

and define K' m similarly. By Lemma 2.3, K m is stably isomorphic to K m , say 

K m 0 F « K' m © F' 


with F, F' finite free. 
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If m < n — 1, then K m is a homomorphic image of E m + x ; so both K m ® F 
and K m © F' are homomorphic images of E m+Í © F. Therefore K m is a homo- 
morphic image of E m+l © F which is stably free. We let F m+Í = E m+Í (& F to 
conclude the proof in this case. 

If m = n — 1, then we can take K n = E n . Hence K m © F is stably free, and 
so is K' m © F' by the isomorphism in the first part of the proof. It follows trivially 
that K' m is stably free, and by definition, K' m = F m+Í in this case. This concludes 
the proof of the theorem. 

Corollary 2.6. /f 0 —» —» £ —» M —» 0 exact, M has stablyfree dimen- 

sion ^ n, and E is stably free, then M x has stably free dimension ^ n — 1. 

Theorem 2.7. Let 


0 —» A/' —» A/ —» A/" —» 0 

be an exact sequence. Ifany two of these modules have afinite free resolution, 
then so does the third. 

Proof. Assume M' and M have finite free resolutions. Since M is finite, it 
follows that M" is also finite. By essentially the same construction as Chapter 
XX, Lemma 3.8, we can construct an exact and commutative diagram where 
E', E, E" are stably free: 


0 0 0 


0-► A/ \ -► M j -♦ M ï-> 0 


0-> E' -> E -> E" -► 0 


0-> M' -► M-► M" -► 0 


0 0 0 

We then argue by induction on the stably free dimension of M. We see 
that Mj has stably free dimension ^ n — 1 (actually n — 1 , but we don’t care), 
and M\ has finite stably free dimension. By induction we are reduced to the 
case when M has stably free dimension 0, which means that M is stably free. 
Since by assumption there is a finite free resolution of M', it follows that M" 
also has a finite free resolution, thus concluding the proof of the first assertion. 
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Next assume that M', M" have finite free resolutions. Then M is finite. 
If both M' and M" have stably free dimension 0, then M', M" are projective 
and M « M' © M" is also stably free and we are done. We now argue by 
induction on the maximum of their stably free dimension n, and we assume 
n ^ 1. We can construct an exact and commutative diagram as in the previous 
case with E\ E , E" finite free (we leave the details to the reader). But the maxi- 
mum of the stably free dimensions of M\ and M" x is at most n — 1, and so by 
induction it follows that M x has finite stably free dimension. This concludes the 
proof of the second case. 

Observe that the third statement has been proved in Chapter XX, Lemma 3.8 
when A is Noetherian, taking for & the abelian category of finite modules, and 
for <€ the family of stably free modules. Mitchell Stokes pointed out to me that 
the statement is valid in general without Noetherian assumption, and can be 
proved as follows. We assume that M, M" have finite free resolutions. We first 
show that M' is finitely generated. Indeed, suppose first thatM is finite free. We 
have two exact sequences 

0 M' M -► M" 0 
0 -► K" F" —» M" —> 0 

where F" is finite free, and K" is finitely generated because of the assumption 
that M" has a finite free resolution. That M' is finitely generated follows from 
Schanuel’s lemma. If M is not free, one can reduce the finite generation of M' 
to the case when M is free by a pull-back, which we leave to the reader. 

Now suppose that the stably free dimension of M" is positive. We use the 
same exact commutative diagram as in the previous cases, with E', E, E" finite 
free. The stably free dimension of M'[ is one less than that of M", and we are 
done by induction. This concludes the proof of Theorem 2.7. 

This also concludes our general discussion of fìnite free resolutions. For 
more information cf. Northcott’s book on the subject. 

We now come to the second part of this section, which provides an applica- 
tion to polynomial rings. 

Theorem 2.8. Let R be a commutative Noetherian ring. Let x be a variable. 
If every finite R-module has a finitefree resolution, then everyfinite R[x]-module 
has afinite free resolution. 

In other words, in the category of finite R-modules, if every object is of 
finite stably free dimension, then the same property applies to the category of 
finite R[x]-modules. Before proving the theorem, we state the application we 
have in mind. 

Theorem 2.9. (Serre). // k is a field and x lt ..., x r independent vari- 
ables, then every finite projective module over k[x j,. . . , x r ] is stablyfree, or 
equivalently admits a finite free resolution. 
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Proof. By induction and Theorem 2.8 we conclude that every finite module 
over k[x } ,. . . , x r ] is of finite stably free dimension. (We are using Theorem 
2.1.) This concludes the proof. 

The rest of this section is devoted to the proof of Theorem 2.8. 

Let M be a finite /?[*]-module. By Chapter X, Corollary 2.8, M has a finite 
filtration 


M = M 0 d Mj =)•••=) M n = 0 

such that each factor M f /M í+1 is isomorphic to P[x]/P, for some prime P { . 
In light of Theorem 2.7, it suffices to prove the theorem in case M = R[x]/P 
where P is prime, which we now assume. In light of the exact sequence 

0 - P - P[x] - RM/P - 0. 

and Theorem 2.7, we note that M has a finite free resolution if and only if P 
does. 

Let p = P n R. Then p is prime in R. Suppose there is some M = /?[x]/P 
which does not admit a finite free resolution. Among all such M we select one for 
which the intersection p is maximal in the family of prime ideals obtained as 
above. This is possible in light of one of the basic properties characterizing 
Noetherian rings. 

Let R 0 = R/p so R 0 is entire. Let P 0 = P/pP[x]. Then we may view M 
as an P 0 [x]-module, equal to P 0 /P 0 . Let / 1 ,...,/„bea finite set of generators 
for P 0 , and let / be a polynomial of minimal degree in P 0 . Let K 0 be the 
quotient field of R 0 . By the euclidean algorithm, we can write 

fi = qif + n for i = 1,..., n 

with q i9 r, e X 0 [x] and deg r, < deg/. Let d 0 be a common denominator for 
the coefficients of all q i9 r, . Then d 0 # 0 and 

d 0 fi = Úf + r'i 

where q' ( = </ 0 q, and rj = d 0 rj lie in R 0 [x]. Since deg / is minimal in P 0 it 
follows that r\ = 0 for all i, so 


d 0 P 0 c R 0 [x]/ = (/). 

Let N 0 = P 0 /(/), so N 0 is a module over P 0 [x], and we can also view N 0 
as a module over P[x]. When so viewed, we denote N 0 by N. Let d e R be any 
element reducing to d 0 mod p. Then d $ p since d 0 # 0. The module N 0 has 
a finite filtration such that each factor module of the filtration is isomorphic to 
some P 0 [x]/e 0 where Q 0 is an associated prime of N 0 . Let Q be the inverse 
image of Q 0 in P[x]. These prime ideals Q are precisely the associated primes 
of N in P[x]. Since d 0 kills N 0 it follows that d kills N and therefore d lies in 
every associated prime of N. By the maximality property in the selection of P, 
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it follows that every one of the factor modules in the filtration of N has a finite 
free resolution, and by Theorem 2.7 it follows that N itself has a finite free 
resolution. 

Now we view R 0 [x] as an R[x]-module, via the canonical homomorphism 
«W - *oW = 

By assumption, p has a finite free resolution as R-module, say 
0 —► E n Eq —► p —► 0. 

Then we may simply form the modules E t [x] i n the obvious sense to obtain a 
finite free resolution of p[x] = pR[x]. From the exact sequence 

0 -► p/?[x] -► R[x] -► R 0 [x] -► 0 

we conclude that R 0 [x] has a finite free resolution as K[x]-module. 

Since R 0 is entire, it follows that the principal ideal (/) in R 0 [x] is /?[x]- 
isomorphic to /? 0 M> an d therefore has a finite free resolution as /?[x]-module. 
Theorem 2.7 applied to the exact sequence of /?[x]-modules 

0 —► (/) —► P 0 —► N —► 0 

shows that P 0 has a finite free resolution; and further applied to the exact 
sequence 

0 —► pK[x] —► P —► P 0 ► 0 

shows that P has a finite free resolution, thereby concluding the proof of 
Theorem 2.8. 


§3. UNIMODULAR POLYNOMIAL VECTORS 

Let A be a commutative ring. Let (/ l9 ..., f n ) be elements of A generating 
the unit ideal. We call such elements unimodular. We shall say that they have 
the unimodular extension property if there exists a matrix in GL n (A ) with first 
column { (f u . .. ,/„). If A is a principal entire ring, then it is a trivial exercise to 
prove that this is always the case. Serre originally asked the question whether 
it is true for a polynomial ring k[xj,..., x r ] over a field /c. The problem was 
solved by Quillen and Suslin. We give here a simplification of Suslin’s proof by 
Vaserstein, also using a previous result of Horrocks. The method is by induc- 
tion on the number of variables, in some fashion. 

We shall write / = l (f l9 ... ,/„) for the column vector. We first remark 
that/has the unimodular extension property if and only if the vector obtained 
by a permutation of its components has this property. Similarly, we can make 
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the usual row operations, adding a multiple gf to f (j # í), and /has the uni- 
modular extension property if and only if any one of its transforms by row 
operations has the unimodular extension property. 

We first prove the theorem in a context which allows the induction. 

Theorem 3.1. (Horrocks). Let (o, m) be a local ring and let A = o[x] 
be the polynomial ring in one variable over o. Let f be a unimodular vector 
in A (n) such that some component has teading coefftcient 1. Thenf has the 
unimodular extension property. 

Proof. (Suslin). If n = 1 or 2 then the theorem is obvious even without 
assuming that o is local. So we assume n ^ 3 and do an induction of the 
smallest degree d of a component of/ with leading coefficient 1. First we note 
that by the Euclidean algorithm and row operations, we may assume that f x 
has leading coefficient 1, degree d , and that deg/ < d for j / 1. Since / is 
unimodular, a relation £ g { f = 1 shows that not all coefficients of/ 2 , ...,/„ 
can lie in the maximal ideal m. Without loss of generality, we may assume that 
some coefficient of f 2 does not lie in m and so is a unit since o is local. Write 

/j(x) = x d -f a d _ X x d ~ 1 + • ■ • -f a 0 with a { e o, 

f 2 (x) = b 5 x s + • • • + b 0 with b t 6 o, s ^ d - 1, 

so that some b, is a unit. Let a be the ideal generated by all leading coefficients 
of polynomials g^f^ + g 2 f 2 of degree S d — 1. Then û contains all the co- 
efficients b t , i = 0,..., s. One sees this by descending induction, starting with 
b s which is obvious, and then using a linear combination 

X d ~ s f 2 (x) - b'ffx). 

Therefore a is the unit ideal, and there exists a polynomial g x f x + g 2 f 2 of 
degree ^ d — 1 and leading coefficient 1. By row operations, we may now get 
a polynomial of degree ^ d — 1 and leading coefficient 1 as some component 
in the i-th place for some i # 1, 2. Thus ultimately, by induction, we may 
assume that d = 0 in which case the theorem is obvious. This concludes the 
proof. 

Over any commutative ring A , for two column vectors /, g we write / ~ g 
over A to mean that there exists M e GL n (A) such that 

/= Mg, 

and we say that / is equivalent to g over A. Horrocks’ theorem states that a 
unimodular vector/ with one component having leading coefficient 1 is o[x]- 
equivalent to the first unit vector e 1 . We are interested in getting a similar 
descent over non-local rings. We can write/ = /(x), and there is a natural 
“constant” vector/(0) formed with the constant coefficients. As a corollary of 
Horrocks’ theorem, we get: 
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Corollary 3.2. Let o be a local ring. Let f be a unimodular vector in 
o[x] (n) such that some component has leading coefficient 1. Then f ~ /(0) 
over o[x]. 

Proof. Note that/(0) e o (M) has one component which is a unit. It suffices 
to prove that over any commutative ring R any element c e R (n) such that some 
component is a unit is equivalent over R to e 1 , and this is obvious. 

Lemma 3.3. Let R be an entire ring, and let S be a multiplicative subset. 
Let x, y be independent variables. Iff(x) ~/(0) over S~ x R[x), then there exists 
c e S such thatf(x -I- cy) ~ f(x) over /?[x, y\. 

Proof. Let MeGL„(5 _1 R[x]) be such that f(x) = M(x)f( 0). Then 
M(x)~ l f(x) = /(0) is constant, and thus invariant under translation x h-► x -f y. 
Let 

G(x, y) = M(x)M(x -f y)" 1 . 

Then G(x, y)f(x -f y) = f(x). We have G(x, 0) = / whence 

G(x, y) = I -f yH(x, y) 

with H(x, y) e S~ ^[x, y~\. There exists c e S such that cH has coefficients in 
R. Then G(x, cy) has coefficients in R. Since det Àí(x) is constant in S~ 'R, it 
follows that det Af(x -f cy) is equal to this same constant and therefore that 
det G(x, cy) = 1. This proves the lemma. 

Theorem 3.4. Let R be an entire ring, and let f be a unimodular vector in 
/?[x] (n) , such that one component has leading coefficient 1. Then f(x) ~ /(0) 
over R[x). 

Proof. Let J be the set of elements c e R such that f(x -f cy) is equivalent 
to f(x) over R[x, ý]. Then J is an ideal, for if c e J and ae R then replacing y 
by ay in the definition of equivalence shows that f(x -f cay) is equivalent to 
f(x) over K[x, ay ], so over K[x, y~\. Equally easily, one sees that if c, c' e J 
then c -f c' e J. Now let p be a prime ideal of R. By Corollary 3.2 we know 
that f(x) is equivalent to/(0) over K p [x], and by Lemma 3.3 it follows that 
there exists ceR and c^p such that f(x -f cy) is equivalent to f(x) over 
/?[x, y\. Hence J is not contained in p, and so J is unit ideal in R, so there exists 
an invertible matrix M(x, y) over /?[x, y\ such that 

/(x-f y) = M(x, y)f(x). 

Since the homomorphic image of an invertible matrix is invertible, we substitute 
0 for x in this last relation to conclude the proof of the theorem. 

Theorem 3.5. (Quillen-Suslin). Let k be afield and letfbe a unimodular 
vector in k[x } ,. . . , x r f n \ Then f has the unimodular extension property. 
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Proof. By induction on r. If r = 1 then k[x í '] is a principal ring and the 
theorem is left to the reader. Assume the theorem for r — 1 variables with r ^ 2, 
and put 


R = k[x l9 ... 9 x r - l ]. 

We view / as a vector of polynomials in the last variable x r and want to apply 
Theorem 3.4. We can do so if some component of/has leading coefficient 1 in 
the variable x r . We reduce the theorem to this case as follows. The proof of the 
Noether Normalization Theorem (Chapter VIII, Theorem 2.1) shows that if we 
let 


y>i = Xi — x n f 

then the polynomial vector 

f(x l ,...,x r ) = g(y l ,...,y r ) 

has one component with y r -leading coefficient equal to 1. Hence there exists a 
matrix N(y) = M(x) invertible over R[x r ] = K[j r ] such that 

g(yu , y r ) = N(yi,..., y r )g(y u • • •, y r -„ 0), 

and g(y u ...,y r _ lt 0) is unimodular in fc[y,, ...,y r ~ We can therefore 
conclude the proof by induction. 

We now give other formulations of the theorem. First we recall that a 
module E over a commutative ring A is called stably free if there exists a finite 
free module F such that E © F is finite free. 

We shall say that a commutative ring A has the unimodular column exten- 
sion property if every unimodular vector/ e A (n) has the unimodular extension 
property, for all positive integers n. 

Theorem 3.6. LetA be a commutative ring which has the unimodular column 
extension property. Then every stably free module over A is free. 

Proof. Let E be stably free. We use induction on the rank of the free 
modules F such that E © F is free. By induction, it suffices to prove that if 
E ® A is free then E is free. Let E ® A = A {n) and let 

p : A {n) - A 

be the projection. Let u 1 be a basis of A over itself. Viewing A as a direct 
summand in E ® A = A (n) we write 


u 1 ='(a,„ , a„,) with a n e A. 



850 FINITE FREE RESOLUTIONS 


XXI, §4 


Then u 1 is unimodular, and by assumption u 1 is the first column of a matrix 
M = (ciij) whose determinant is a unit in A. Let 

u j = Me j for j = 1,..., n , 

where is they‘-th unit column vector of A (n \ Note that u l is the first column 
of M. By elementary column operations, we may change M so that u j e E for 
j = 2,..., n. Indeed, if pe j = cu { for j ^ 2 we need only replace e j by e J — ce\ 
Without loss of generality we may therefore assume that w 2 ,..., u n lie in E. 
Since M is invertible over A , it follows that M induces an automorphism of 
A (n) as /1-module with itself by 


Xï-+MX. 

It follows immediately from the construction and the fact that A (n) = E © A 
that M maps the free module with basis { e 2 ,..., e n } onto E. This concludes 
the proof. 

If we now feed Serre’s Theorem 2.9 into the present machinery consisting 
of the Quillen-Suslin theorem andTheorem 3.6, we obtain the alternative version 
of the Quillen-Suslin theorem: 

Theorem 3.7. Let k be a field. Then every finite projective module over the 
polynomial ring k[x u . .., x r ] is free. 


§4. THE KOSZUL COMPLEX 

In this section, we describe a finite complex built out of the alternating 
product of a free module. This gives an application of the alternating product, 
and also gives a fundamental construction used in algebraic geometry, both 
abstract and complex, as the reader can verify by looking at Griffiths-Harris 
[GrH 78], Chapter V, §3; Grothendieck’s [SGA 6]; Hartshorne [Ha 77], Chapter 
III, §7; and Fulton-Lang [FuL 85], Chapter IV, §2. 

We know from Chapter XX that a free resolution of a module allows us to 
compute certain homology or cohomology groups of a functor. We apply this 
now to Hom and also to the tensor product. Thus we also get examples of explicit 
computations of homology, illustrating Chapter XX, by means of the Koszul 
complex. We shall also obtain a classical application by deriving the so-called 
Hilbert Syzygy theorem. 

Let A be a ring (always assumed commutative) and M a module. A sequence 
of elements x x ,..., x r in A is called M-regular if M/(x x ,. . . , x r )M ± 0, if x x 



XXI, §4 


THE KOSZUL COMPLEX 851 


is not divisor of zero in M, and for i ^ 2, x t is not divisor of 0 in 

It is called regular when M = A. 

Proposition 4.1. Let I = (jcj ,..., x r ) be generated by a regular sequence 
in A. Then I/l 2 isfree of dimension r over A/I. 

Proof. Let x f be the class of x, mod / 2 . It suffices to prove that x l9 ..., x r 
are linearly independent. We do this by induction on r. For r = 1, if ax = 0, 
then ax = bx 2 for some b g A 9 so x(a — bx) = 0. Since x is not zero divisor in A 9 
we have a = bx so a = 0. 

Now suppose the proposition true for the regular sequence x í9 ..., x r - v 
Suppose 

YjâiXi = 0 in I/I 2 . 

i = 1 

We may assume that £ a^x, = 0 in A ; otherwise £ flfX, = £ y,x, with g / and 
we can replace a, by a t — y,- without changing a { . 

Since x r is not zero divisor in A/(x l9 ..., x r _!) there exist b t e A such that 

r— 1 r— 1 r—1 

a r*r + Z a i x i = o => a r = X fcjX,- => X ( fl i + è f X r )Xj = 0. 

í=l i=l i=l 


By induction, 

r— 1 

Oj + bjX, e (J = 1,..., r - 1) 

í= 1 

so ûj g / for all j, so ô,- = 0 for all;, thus proving the proposition. 

Let L be complexes, which we write as direct sums 

K = @K P and L = 0L, 

with p , # gZ. Usually, K p = L q = 0 for /?, # < 0. Then the tensor product 
K (x) L is the complex such that 

(K ® L)„ = ©K p ® L q ; 

p + q = n 

and for w g K p9 v g L q the differential is defined by 

d(u ® v) = du (g) v -f (— l) p u (g) dv. 

(Carry out the detailed verification, which is routine, that this gives a complex.) 
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Let A be a commutative ring and xeA. We define the complex K(x ) to have 
K 0 (x) = A, K x (x) = Ae u where is a symbol, Ae x is the free module of rank 1 
with basis {e and the boundary map is defined by de x = x, so the complex 
can be represented by the sequence 

0-► Ae x ——» A -> 0 


0- +K x (x) - *K 0 (x) -»0 

More generally, for elements x u ..., x r e A we define the Xoszul complex 
K(x) = K(x u ..., x r ) as follows. We put: 

K 0 (x) = A; 

K x (x) = free module E with basis {e l9 ..., e r )\ 

K p (x) = free module /\E with basis {e i{ a • • • a e ip }, i x < • • • < i p \ 

K r (x) = free module /\E of rank 1 with basis e x a • • • a e r . 


We define the boundary maps by de { = x t and in general 

d:K p (x)^K p ^(x) 


by 

p 

d(c fl A • • • a e ip ) = z (- iy- l x it e it a • • • a t tj a • • 
A direct verification shows that d 2 = 0, so we have a complex 


a e { 


0 - K r (x) --> K p (x) --> K^(x) - ,4 - 0 

The next lemma shows the extent to which the complex is independent of the 
ideal / = (x Ì9 ... , x r ) generated by (x). Let 


I = x r ) =5 r = (y u ..., y r ) 

be two ideals of A. We have a natural ring homomorphism 

can : A/T -> A/l. 

Let {e' l9 ..., e' r ) be a basis for K x (y\ and let 

= Yj c u x j with c u e A. 


We define f x :K x (y) -> K x (x) by 


fA = £ c u e j 
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and 


/ P = /i a • • • a f u product taken p times. 

Let D = det(c tJ ) be the determinant. Then for p = r we get that 
f r : K r (y) -> K r (x) is multiplication by D. 


Lemma 4.2. Notation as above, the homomorphisms f p define a morphism of 
Koszul complexes: 


>K r (y)- 


fr = D 


>K p (y) 

fp 

>K p (x) 


K x (y) 

/ 

K,(x) 


A- 

id 

A - 


A/I' 


A/I 


0 

0 


and define an isomorphism ifD is a unit inA.for instance if(y)isapermutation 
of (x). 

Proof. By definition 

/(<?;, a ••• A e' ip ) = C iu e^ A ••• A ^X' 

Then 

fdie’i, a • • • a e' ip ) 

= /^E (-1)" - a • • • a Ç/ k a • • • a é-;^ 

= I (- 1)*~ 'yd X A • • • A X A • • • A 

* \j=l / k 

= x (-1)*“ '(jtcijej) A A (jtc iki Xjeìj 

^ v J 

omitted 

= df(e' it a ••• a e; p ) 

using y t = X c i k j x j- This concludes the proof that the / /; define a homomorphism 
of complexes. 

In particular, if (x) and (y) generate the same ideal, and the determinant D 
is a unit (i.e. the linear transformation going from (x) to (y) is invertible over 
the ring), then the two Koszul complexes are isomorphic. 
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The next lemma gives us a useful way of making inductions later. 
Proposition 4.3. There is a natural isomorphism 

K(x u ..., x r ) « K(x { ) (g) • • • ® K(x r ). 

Proof The proof will be left as an exercise. 

Let / = (xx r ) be the ideal generated by x l5 ..., x r . Then directly from 
the definitions we see that the 0-th homology of the Koszul complex is simply 
A/IA. 

More generally, let M be an A-module. Define the Koszul complex of M by 
K(x; M) = Â^Xi,..., x r ; M) = K(x l9 ..., x r ) <g ) A M 
Then this complex looks like 

0 K r (x) <8> A/ —>-> K 2 (x) ® a M^> M (r) M -* 0. 

We sometimes abbreviate H p (x\ M) for H p K(x\ M). The first and last homology 
groups are then obtained directly from the definition of boundary. We get 

H 0 (K(x ; M)) « M//M; 

H r (K(x)\ M) = {v e M such that x{0 = 0 for all i = 1,.. . , r}. 

In light of Proposition 4.3, we study generally what happens to a tensor 
product of any complex with K(x), when x consists of a single element. Let 
y e A and let C be an arbitrary complex of A-modules. We have an exact sequence 
of complexes 

(1) 0 C C ® K(y) ^(C® K(y))/C 0 

made explicit as follows. 


o- >c n+ ,-*(C B+1 ® A)®(C n ® AT,(y)) -*C„® ATj(y)- 


® id 


0- *c m — ^(C„® ^)0(C n _, ® a:,(j) ->C„ ®A:,(y)) -*0 

_ i (g) id 

♦C.-I-*(C,-I ® i4)©'(C„_ 2 ® «-,(>))-*C„- 2 ® AT,(y)-*0 


0 
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We note that C ® K x {y) is just C with a dimension shift by one unit, in other 
words 

(2) (C®K x {y)) n + x = C n ®K x {y). 

In particular, 

(3) H n+l {C ® K{y)/C) ~H n (C). 

Associated with an exact sequence of complexes, we have the homology sequence, 
which in this case yields the long exact sequence 

-►//„+ ,(C)-► H n + ,(C ® K,(y)) 


H n + i(C 0 K(y)/C) " * H„(C) 

u 

H n {C) 


which we write stacked up according to the index: 

(4) H p+l (C) H p+l (C) -> H p+l (C ® K(y)) -> 

-> H p (C) -* H p (C) -* H p (C ® K(y)) -* 

ending in lowest dimension with 

(5) H X {C) H X {C ® K{y)) -> // 0 (C) -+ // 0 (C). 

Furthermore, a direct application of the definition of the boundary map and the 
tensor product of complexes yields: 

The boundary map on H p {C) {p ^ 0) is induced by multiplication by {—\Yy\ 

(6) a = (-1 ) p m{y):H p {C)^H p {C). 

Indeed, write 

(C ® K{y)) p = {C p ® A) © (C p _, ® K x {y)) - C p © C p _,. 

Let (u, vv) e C p ® C^ with v e C p and vv e C^. Then directly from the 
definitions, 

(7) d{v , vv) = (ífo -f (-l)^ _1 yw, Jw). 

To see (6), one merely follows up the definitions of the boundary, taking an 
element w e C p ~ C p © Â^(y), lifting back to (0, w), applying d , and lifting 
back to C p . If we start with a cycle, i.e. dw = 0, then the map is well defined 
on the homology class, with values in the homology. 

Lemma 4.4. Lety e A and let Cbea complex as above. Then m{y) annihilates 
H p {C ® K{y))for all p ^ 0. 

Proof. If ( v , w) is a cycle, i.e. d{v, w) = 0, then from (7) we get at once 
that {yv, yw) = d{ 0, (— \) p v), which proves the lemma. 
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In the applications we have in mind, we let y = x r and 

C = K(x ì9 ...,x r - i; M) = K(x ì9 . . . , x r - j) (8) M. 

Then we obtain: 

Theorem 4.5.(a) There is an exact sequence with maps as above: 

-> H p K(x Ì9 ..., x r - X \ M) -> H p K(x Ì9 ..., x r - X ; M) -> H p K(x { ,..., x r , M) 
-> H x (x i,..., * r ; M) —» // 0 (*i,.. • , * r -i; M) ,..., x r - X \M). 

(b) Every element of I = (x Ì9 ..., x r ) annihilates H n (x; M) for p ^ 0. 

(c) /// = A, then H p (x; M) = 0 for all p^O. 

Proof. This is immediate from Proposition 4.3 and Lemma 4.4. 

We define the augmented Koszul complex to be 

0 -► K r (x\ M) —>-» K x (x\ M) = M (r) —» M —» M//M 0. 

Theorem 4.6. Let M be an A-module. 

(a) Let x x , . . ., x r be a regular sequence for M. Then H p K(x; M) = 0 for 
p > 0. (Of course, H 0 K(x; M) = M/IM.) In other words, the augmented 
Koszul complex is exact. 

(b) Conversely, suppose A is local, and x x , ..., x r lie in the maximal ideal of 
A. Suppose M is finite over A, and also assume that H x K(x; M) = 0. Then 
(x ìf ..., x r ) is M-regular. 

Proof We prove (a) by induction on r. If r = 1 then H x (x\ M) = 0 directly 
from the definition. Suppose r > 1. We use the exact sequence of Theorem 
4.5(a). If p > 1 then H p (x\ M) is between two homology groups which are 0, so 
H p (x\ M) = 0. If p = 1, we use the very end of the exact sequence of Theorem 
4.5(a), noting that m(x r ) is injective, so by induction we findZ/C*; M) = 0 also, 
thus proving (a). 

As to (b), by Lemma 4.4 and the hypothesis, we get an exact sequence 

H x (x x , ...,x r ^-,M )—Ì ..., x r -\, M) —*■ H,{x\ M) = 0, 

so m(x r ) is surjective. By Nakayama’s lemma, it follows that 

H i(jcj, . . . , x r - x \ M) = 0. 

By induction (x Ì9 ..., ^ r -i) is an M-regular sequence. Looking again at the tail 
end of the exact sequence as in (a) shows that x r is M/(x x ,. . . , x^-^M- regular, 
whence proving (b) and the theorem. 

We note that (b), which uses only the triviality of H x (and not all H p ) is 
due to Northcott [No 68], 8.5, Theorem 8. By (a), it follows that H p = 0 for 

p > 0. 
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An important special case of Theorem 4.6(a) is when M = A, in which case 
we restate the theorem in the form: 

Let *!,. .., x r be a regular sequence in A. Then K(x { ,. . . , x r ) is a free 
resolution ofA/í : 

0 - K r (x ) -> Kfx) - /1 - ,4/7 - 0. 

In particular , A/I has Tor -dimension ^ r. 

For the Hom functor, we have: 

Theorem 4.7. Let x ì9 ..., x r be a regular sequence in A. Then there is an 
isomorphism 

(p XtM : H r (Hom(K(x\ M)) - M/IM 
to be described below. 

Proof. The module K r (x ) is 1-dimensional, with basis e x a • • • a e r . 
Depending on this basis, we have an isomorphism 

Hom (K r (x), M) % M, 

whereby a homomorphism is determined by its value at the basis element in M. 
Then directly from the defìnition of the boundary map d r in the Koszul complex, 
which is 

r 

d r : í, A ' • • A í r K X (-ir l í/i A • • • A í; A • • ' A e r 

; =i 

we see that 

H r (Hom(K r (x), M) *s Hom(K r (x ), M)/d r ~ l Hom^.fx), M) 

* M/IM. 


This proves the theorem. 

The reader who has read Chapter XX knows that the i-th homology group 
ofHom(K(x), M) is called Ext l (A/I, M), determined up to a unique isomorphism 
by the complex, since two resolutions ofA/I differ by a morphism of complexes, 
and two such morphisms differ by a homotopy which induces a homology iso- 
morphism. Thus Theorem 4.7 gives an isomorphism 

cp XtM : Ext r (A/I, M) - M/IM. 

In fact, we shall obtain morphisms of the Koszul complex from changing the 
sequence. We go back to the hypothesis of Lemma 4.2. 
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Lemma 4.8. Ifl = (*) = ( y) where (jt), ( y) are two regular sequences , then 
we have a commutative diagram 


Exi r (A/f M)< 




M/IM 

D = det(c„) 

M/IM 


where all the maps are isomorphisms of A/I-modules. 

The fact that we are dealing with >l//-modules is immediate since multiplication 
by an element of A commutes with all homomorphisms in sight, and / an- 
nihilates A/I. 

By Proposition 4.1, we know that I// 2 is a free module of rank r over A/I. 
Hence 


Aw 2 ) 


is a free module of rank 1, with basis jc^ a ••• a x r (where the bar denotes 
residue class mod I 2 ). Taking the dual ofthis exterior product, we see that under 
a change of basis, it transforms according to the inverse of the determinant 
mod I 2 . This allows us to get a canonical isomorphism as in the next theorem. 


Theorem 4.9. Let x Xi ..., x r be a regular sequence in A, and let I = (x). 
Let M be an A-module. Let 

i/j X íM : M/IM - (M/IM) ® /\ r (/// 2 ) dual 

be the embedding determined by the basis (x^ a • • • a x r ) dual of /\ r (/// 2 ) dual . 
Then the composite isomorphism 

Ext r (A/I, M) M/IM ^ (M/IM) ® /\'(I/I 2 ) dua ' 

is a functorial isomorphism , independent of the choice of regular generators 
for I. 

We also have the analogue of Theorem 4.5 in intermediate dimensions. 


Theorem 4.10. Let x ìt ... t x r be an M-regular sequence in A. Let I = (x). 
Then 


Ext l C4//, M) = 0 for i < r. 

Proof. For the proof, we assume that the reader is acquainted with the 
exact homology sequence. Assume by induction that Ext*(^4//, M) = 0 for 



XXI, §4 


THE KOSZUL COMPLEX 859 


i < r — 1. Then we have the exact sequence 

0 = Ext' - \A/1, M/ Xl M) - Ext'(^//, M) Ext \A/1, M) 

for i < r. But x^e I so multiplication by Xj induces 0 on the homology groups, 
which gives Ext l (A/I, M) = 0 as desired. 

Let L N —► N —► 0 be a free resolution of a module N. By definition, 

Tor f(N, M) = i-th homology of the complex L® M. 

This is independent of the choice of L N up to a unique isomorphism. We now 
want to do for Tor what we have just done for Ext. 

Theorem 4.11. Let I = (x ìf ..., x r ) be an ideal of A generated by a regular 
sequence of length r. 

(i) There is a natural isomorphism 

TorfiA/l, A/I) * K aii (I/I 2 ), for i* 0. 

(ii) Let L be a free A/I-module , extended naturally to an A-module. Then 

Torf(L,A/I)*L®/y A/I (IH 2 l for i> 0. 

These isomorphisms will follow from the next considerations. 

First we use again that the residue classes x ì9 ...,x r mod I 2 form a basis of 
I/I 2 over A/I. Therefore we have a unique isomorphism of complexes 

<p x : K(x) ® A/I -> A (I/I 2 ) = © AW/ 2 ) 

with zero differentials on the right-hand side, such that 

e ix a ••• a e ip i ► Xj-j a ••• a x ip . 


Lemma 4.12. Let I = (x) D T = (y) be two ideals generated by regular 
sequences oflength r. Let f : ÂT(y) —> £(x) be the morphism ofKoszul complexes 
defined in Lemma 4.2. Then the following diagram is commutative: 


K(y) ® A/T 


A AirV'/I' 2 ) 


f (g) can 


canonical hom 


<Px 


K(x) ® A/I 


AaiiU/I 2 ) 
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Proof. We have 


<Px ° (/ ® can)(e;, a • • • a e' ip ® 1) 

r r 

= I CltjXj A • • • A I 

j=2 j=l 

= ŷi, A • • • a ŷ lp = can(<p ) ,(e;. i a • • • a e,' p )). 

This proves the lemma. 

In particular, if /' = / then we have the commutative diagram 

K(y) 

J ®d 

K(x) 

which shows that the identification of Tor i(A/I, A/I) with /\‘(/// 2 ) via the 
choices of bases is compatible under one isomorphism of the Koszul complexes, 
which provide a resolution of A/I. Since any other homomorphism of Koszul 
complexes is homotopic to this one, it follows that this identification does not 
depend on the choices made and proves the fìrst part of Theorem 4.11. 

The second part follows at once, because we have 

Tor f(A/l L) = tìi(K(x) ® L) = H&K(x) ® A A/I) ® A/I L 

= /V aii (I/I 2 )®L. 



This concludes the proof of Theorem 4.11. 

Example. Let k be a field and \ct A = /c[x 1? ..., x r ] be the polynomial ring 
in r variables. Let / = (x l5 ..., x r ) be the ideal generated by the variables. Then 
A/I = k , and therefore Theorem 4.11 yields for i ^ 0: 

Tor f(k 9 k) * A Í(I/I 2 ) 

Tor f(L, k)*L® Al(/// 2 ) 

Note that in the present case, we can think of I/I 2 as the vector space over k with 
basis Xj,..., x r . Then A can be viewed as the symmetric algebra SE , where E 
is this vector space. We can give a specific example of the Koszul complex in this 
context as in the next theorem, given for a free module. 
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Theorem 4.13. Let Ebe a finite free moduie of rank r over the ring R. For 
each p = 1,..., r there is a unique homomorphism 

d p : f\ p E®SE^ A P " l£ ® SE 


such that 


di((x , A • • • A x p ) <g> ỳ) 


= £ (- 1)' ‘Oi A • • • A $ A • • • A X p ) <g) (Xj ® >>) 

Ì =1 

where x f e E and y e SE. This gives the resolution 

0 - f\ r E ®SE^f\ r ~ l E®SE^ -► f\°E (x) K - 0 

Proof. The above definitions are merely examples of the Koszul complex 
for the symmetric algebra SE with respect to the regular sequence consisting of 
some basis of E. 

Since d p maps f\ p E ® S q E into f\ p ~ l E ® S q+Í E , we can decompose this 
complex into a direct sum corresponding to a given graded component, and 
hence: 

CoroIIary 4.14. For each integer n ^ 1, we have an exact sequence 

0 - f\ r E ®S n ~ r E --► f\ l E ® S n ~ l E - S n E - 0 

where S j E = 0 for j < 0. 

Finally, we give an application to a classical theorem of Hilbert. The poly- 
nomial ring A = k[_x u ..., x r ] is naturally graded, by the degrees of the homo- 
geneous components. We shall consider graded modules , xvhere the grading is in 
dimensions ^ 0, and xve assume that homomorphisms are graded of degree 0. 

So suppose M is a graded module (and thus M f = 0 for i < 0) and M is finite 
over A. Then we can find a graded surjective homomorphism 

Lq-M-0 

where L 0 is finite free. Indeed, let w l5 ..., w„ be homogeneous generators of M. 
Let e l9 ..., e n be basis elements for a free module L 0 over A. We give L 0 the 
grading such that if a e A is homogeneous of degree d then ae^ is homogeneous of 
degree 

deg ae { = deg a H- deg w f . 

Then the homomorphism of L 0 onto M sending e, i—^w f is graded as desired. 
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The kernel M x is a graded submodule of L 0 . Repeating the process, we can find a 
surjective homomorphism 

L x — M x — 0. 

We continue in this way to obtain a graded resolution of M. We want this 
resolution to stop, and the possibility of its stopping is given by the next theorem. 

Theorem 4.15. (Hilbert Syzygy Theorem). Let kbe a field and 

A = fc[x l5 ..., x r ] 

the polynomial ring in r variables. Let Mbe a graded module over A , and let 
0 —► K —► L r _ j Lq —► M —► 0 

be an exact sequence of graded homomorphisms of graded modules , such that 
L 0 ,..., L r _ ^ ure/ree. Then K is free. If M is in addition finite over A and 
L 0 ,..., L r _ j arefinitefree , í/ien K is finitefiree. 

Proof. From the Koszul complex we know that Tor f (M, fc) = 0 for i > r 
and all M. By dimension shifting, it follows that 

Tor,(K, fc) = 0 for i > 0. 

The theorem is then a consequence of the next result. 

Theorem 4.16. Let F be a graded finite module over A = fc[x l9 ..., x r ]. // 
Tor/F, fc) = 0 then F is free. 

Proof. The method is essentially to do a Nakayama type argument in the 
case of the non-local ring A. First note that 

F ® fc = F/IF 

where / = (x^,..., x r ). Thus F ® k is naturally an A/I = fc-module. Let 
v !,..., v n be homogeneous elements of F whose residue classes mod /F form a 
basis of F//F over fc. Let L be a free module with basis e l5 ..., e n . Let 

L-F 

be the graded homomorphism sending e t i—► for / = 1,..., n. It suffices to 
prove that this is an isomorphism. Let C be the cokernel, so we have the exact 
sequence 

L - F - C - 0. 

Tensoring with fc yields the exact sequence 

L®fc — F®fc — C(g)fc — 0. 



XXI, §4 


THE KOSZUL COMPLEX 863 


Since by construction the map L ® k -> F ® k is surjective, it follows that 
C ®k = 0. But C is graded, so the next lemma shows that C = 0. 

Lemma 4.17. Let N be a graded module over A = k[x l9 ..., x r ]. Let 

I = (xj,..., x r ). IfN/IN = 0 then N = 0. 

Proof This is immediate by using the grading, looking at elements of N 
of smallest degree if they exist, and using the fact that elements of / have degree 
>0. 

We now get an exact sequence of graded modules 

O-L-L^F-O 

and we must show that E = 0. But the exact homology sequence and our as- 
sumption yields 

0 = Tor^F, /c) — L(x)/c — L(x)/c — F(x)/c — 0. 

By construction L (x) /c — F (x) /c is an isomorphism, and hence E (x) k = 0. 
Lemma 4.17 now shows that E — 0. This concludes the proof of the syzygy 
theorem. 

Remark. The only place in the proof where we used that k is a field is in the 
proof of Theorem 4.16 when we picked homogeneous elements v ì9 ..., v n in M 
whose residue classes mod IM form a basis of M/IM over A/IA. Hilbert’s 
theorem can be generalized by making the appropriate hypothesis which allows 
us to carry out this step, as follows. 

Theorem4.18. Let R be a commutative local ring and let A = R[x l5 ..., x r ] 

be the polynomial ring in r variables. Let Mbe a graded finite module over A 9 

projective over R. Let 

0 —► K. —► L r _ i —►•••—► Lq —► M —► 0 

be an exact sequence of graded homomorphisms of graded modules such that 

L 0 ,..., L r _ ^ are finite free. Then K is finite free. 

Proof Replace k by R everywhere in the proof of the Hilbert syzygy 
theorem. We use the fact that a finite projective module over a local ring is free. 
Not a word needs to be changed in the above proof with the following exception. 
We note that the projectivity propagates to the kernels and cokernels in the 
given resolution. Thus F in the statement of Theorem 4.16 may be assumed 
projective, and each graded component is projective. Then F/IF is projective 
over A/IA = R , and so is each graded component. Since a finite projective 
module over a local ring is free, and one gets the freeness by lifting a basis from the 
residue class field, we may pick v u ..., v n homogeneous exactly as we did in the 
proof of Theorem 4.16. This concludes the proof. 
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For exercises 1 through 4 on the Koszul complex, see [No 68], Chapter 8. 

1. Let 0 —> M' —> M —» M" —» 0 be an exact sequence of A-modules. Show that tensoring 
with the Koszul complex K{.x) one gets an exact sequence of complexes, and therefore 
an exact homology sequence 

0 H r K{x\ M') H r K{x\ M) H r K{x\ M") • • • 

-> H p K{x\ M') H p K{x\ M) -► H p K{x\ M") • • • 

-> H 0 K{x\ M') HqK{x\ M) HqK{x\ M") 0 


2. (a) Show that there is a unique homomorphism of complexes 
/: K{x\ M) —» K{x Xì . . . , x r _ x \ M) 
such that for v e M: 


f p (e h a • • • a e ip ® v) = 


I e i, A 
\e,. a 


A e: ® x r v if / _ = r 

tp r p 

A e: ® v if /_ = r. 

i p p 


(b) Show that / is injective if x r is not a divisor of zero in M. 

(c) For a complex C, denote by C(-l) the complex shifted by one place to the left, 
so C(—1)„ = C„_, for all n. Let M = M/x r M. Show that there is a unique 
homomorphism of complexes 

g: K{x ì9 ..., x r . lf 1; M) K{x ì9 .. ., jc r _,; M)(-1) 

such that for v E M: 


_ A * * * A e. 

A • • • A c ip ® v) = | o ' 


' v if i p — r 
if i p < r. 


(d) If x r is not a divisor of 0 in M, show that the following sequence is exact: 

0 —* K(x\ M) -4 K(x u ..., jc r .„ 1;M)4. AT(x .. M)(-l) ^ 0. 

Using Theorem 4.5(c), conclude that for all p ^ 0, there is an isomorphism 
H p K{x\ M) H p K{x u . . . , x r _ x \ M). 


3. Assume A and M Noetherian. Let / be an ideal of A. Let a ,,..., a k be an M-regular 
sequence in /. Show that this sequence can be extended to a maximal M-regular 
sequence a lf ..., in /, in other words an M-regular sequence such that there is 
no M-regular sequence a l9 ..., a q+l in I. 

4. Again assume A and M Noetherian. Let / = (jc, ,. .. , x r ) and let a lt . .., be a 
maximal M-regular sequence in I. Assume IM ± M. Prove that 

H r _ q {x\ M) ± 0 but H p {x\ M) = 0 for p > r - q. 

[See [No 68], 8.5 Theorem 6. The result is similar to the result in Exercise 5, and 
generalizes Theorem 4.5(a). See also [Mat 80], pp. 100-103. The result shows that 
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all maximal M-regular sequences in M have the same length, which is called the 
/-depth of M and is denoted by depth 7 (M). For the proof, let s be the maximal integer 
such that H s K{x\ M) 0. By assumption, H 0 (x\ M) = M//M 0, so s exists. 
We have to prove that q 4- s = r. First note that if q = 0 then s = r. Indeed, if 
q = 0 then every element of / is zero divisor in M, whence / is contained in the 
union of the associated primes of M, whence in some associated prime of M. Hence 
H r (x\ M) =/= 0. 

Next assume q > 0 and proceed by induction. Consider the exact sequence 
0 —> M ^ M —> M/a x M 0 

where the first map is m(a { ). Since / annihilates H p (x\ M) by Theorem 4.5(c), we 
get an exact sequence 

0 -► H p (x\ M) -> H p (x\ M/a x M) H p _ x (x\ M) -> 0. 

Hence H s+ j(jc; M / a x M) =/= 0, but H p (x\M/a x M) = Oforp ^ s -f 2. Fromthe hypothesis 
that a j,..., a q is a maximal M-regular sequence, it follows at once that a 2ì . . . , a q 
is maximal M/ûjM-regular in /, so by induction, q - 1 = r — (s 4- 1) and hence 
q -I- s = r, as was to be shown.] 

5. The following exercise combines some notions of Chapter XX on homology, and 
some notions covered in this chapter and in Chapter X, §5. Let M be an A-module. 

Let A be Noetherian, M finite module over A, and / an ideal of A such that /M ^ M. 
Let r be an integer ^ 1. Prove that the following conditions are equivalent: 

(i) Ext‘(JV, M) = 0 for all i < r and all finite modules N such that supp(N) c= (/). 

(ii) ExV(A/I, M) = 0 for all i < r. 

(iii) There exists a fìnite module N with supp (N) = ££(/) such that 

Ext‘(jV, M) = 0 for all i < r. 

(iv) There exists an M-regular sequence a i9 ..., a r in /. 

[Hint : (i) => (ii) => (iii) is clear. For (iii) => (iv), first note that 

0 = Ext °(N, M) = Hom(N, M). 

Assume supp(N) = 2E(/). Find an M-regular element in /. If there is no such element, 
then / is contained in the set of divisors of 0 of M in A, which is the union of the as- 
sociated primes. Hence / c= P for some associated prime P. This yields an injection 
i4/P c= M, so 


0 ï Hom Áp (A P /PA P , M). 

By hypothesis, N P # 0 so N P /PN P # 0, and N P /PN P is a vector space over A P /PA P , 
so there exists a non-zero A P /PA P homomorphism 

N P /PN P - M P , 

so Hom^iVp, M P ) # 0, whence Hom(N, M) # 0, a contradiction. This proves the 
existence of one regular element a x . 
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Now let Mj = M/ûjM. The exact sequence 

0->M M -* M/ciyM -+ 0 

yields the exact cohomology sequence 

- Ext‘(N, M) - Ext‘(JV, M/ûjM) - Ext i+ *(JV, M) - 

so Ext‘(JV, M/a^M) = 0 for i < r — 1. By induction there exists an M r regular se- 
quence a 2 ,..., a r and we are done. 

Last, (iv) => (i). Assume the existence of the regular sequence. By induction, 
Ext f (JV, ûjM) = 0 for i < r — 1. We have an exact sequence for i < r : 

0 - Ext ‘(N, M) % Ext \N, M) 

But supp(JV) = f£(ann(JV)) C ££(/), so / C rad(ann(JV)), so a x is nilpotent on JV. 
Hence a x is nilpotent on Ext'(JV, M), so Ext'(JV, M) = 0. Done.] See Matsumura’s 
[Mat 70], p. 100, Theorem 28. The result is useful in algebraic geometry, with for 
instance M = A itself. One thinks of A as the affine coordinate ring of some variety, 
and one thinks of the equations a, = 0 as defining hypersurface sections of this variety, 
and the simultaneous equations a x = • • • = a r = 0 as defining a complete intersection. 
The theorem gives a cohomological criterion in terms of Ext for the existence of such 
a complete intersection. 
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The proof which we shall give here follows the classical method of Gelfond 
and Schneider, properly formulated. It is based on a theorem concerning values 
of functions satisfying differential equations, and it had been recognized for some 
time that such values are subject to severe restrictions, in various contexts. 
Here, we deal with the most general algebraic differential equation. 

We shall assume that the reader is acquainted with elementary facts con- 
cerning functions of a complex variable. Let / be an entire function (i.e. a 
function which is holomorphic on the complex plane). For our purposes, we 
say / is of order ^ p if there exists a number C > 1 such that for all large R we 
have 


\m\ á c* p 

whenever \z\ ^ R. A meromorphic function is said to be of order ^ p if it is a 
quotient of entire functions of order ^ p. 

Theorem. Let Kbea finite extension of the rational numbers. Let f l9 ..., f N 
be meromorphic functions of order ^ p. Assume that the field K(f l9 ..., f N ) 
has transcendence degree ^ 2 over K , and that the derivative D = d/dz maps 
the ring K[f í9 ..., /#] into itself. Let w l9 ... ,w m be distinct complex numbers 
not lying among the poles of the /, such that 


/<w v ) e K 


for all i = 1,..., N and v = 1,..., m. Then m ^ 10p[K : Q]. 

Corollary 1. (Hermite-Lindemann). If ot is algebraic (over Q) and # 0, 
then e a is transcendental. Hence n is transcendental. 


867 
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Proof. Suppose that a and e* are algebraic. Let K = Q(a, e a ). The two 
functions z and e z are algebraically independent over K (trivial), and the ring 
K[z , e z ] is obviously mapped into itself by the derivative. Our functions take on 
algebraic values in K at a, 2a,..., mcc for any m, contradiction. Since e 2ni = 1, 
it follows that 2ni is transcendental. 

Corollary 2. (Gelfond-Schneider). If a w algebraic #0, 1 úrm/ if /3 is 

algebraic irrational , í/ien a^ = e^ loga is transcendental. 

Proof. We proceed as in Corollary 1, considering the functions e pt and e f 
which are algebraically independent because /3 is assumed irrational. We look 
at the numbers log a, 2 log a,..., m log a to get a contradiction as in Corollary 1. 

Before giving the main arguments pro ving the theorem, we state some lemmas. 
The first two, due to Siegel, have to do with integral solutions of linear homo- 
geneous equations. 

Lemma 1. Let 


«nXi + + a ln x n = 0 

a r 1*1 + *•* + a rn x n = 0 

be a system of linear equations mth integer coeffìcients a ir and n > r. Let A 
be a number such that \ a^ \ ^ A for all i, j. Then there exists an integral , 
non-trivial solution mth 


\xj\£ 2(2nA) r,in ~ r) . 


Proof. We view our system of linear equations as a linear equation 
L(X) = 0, where L is a linear map, L: Z (n) -► Z (r) , determined by the matrix of 
coefficients. If B is a positive number, we denote by Z {n) (B) the set of vectors X 
in Z (n) such that | X \ ^ B (where | X \ is the maximum of the absolute values 
of the coefficients of 2f). Then L maps Z {n) (B) into Z {r) (nBA). The number of 
elements in Z {n) (B) is B n and ^(2 B -f l) n . We seek a value of B such that 
there will be two distinct elements X , Y in Z {n) (B) having the same image, 
L(X) = L(Y). For this, it will suffice that B n > (2nBA) r , and thus it will suffice 
that 


B = (2nA) r/{n ~ r) . 

We take X — Y as the solution of our problem. 

Let K be a finite extension of Q, and let I K be the integral closure of Z in K. 
From Exercise 5 of Chapter IX, we know that I K is a free module over Z, of 
dimension [ K: Q]. We view K as contained in the complex numbers. If 
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a e K, a conjugate of a will be taken to be an element (xa, where o is an embedding 
of K in C. By the size ofa set ofelements of K we shall mean the maximum ofthe 
absolute values of all conjugates of these elements. 

By the size of a vector X = (x l5 ..., x„) we shall mean the size of the set of its 
coordinates. 

Let o í9 ..., o M be a basis of I K over Z. Let a g I K9 and write 
a = + ••• + a M o) M . 

Let o )\,..., o)' M be the dual basis of o ) l9 ..., o) M with respect to the trace. Then 
we can express the (Fourier) coefhcients a } of a as a trace, 

aj = Tr(aa;'). 

The trace is a sum over the conjugates. Hence the size of these coefficients is 
bounded by the size of a, times a fixed constant, depending on the size of the 
elements co'-. 

Lemma 2. Let Kbe a finite extension of Q. Let 
a n X! + ••• + a ln x„ = 0 

a rl Xi + + a rn x„ = 0 

be a system of linear equations mth coejficients in I K , and n > r. Let Abe a 
number such that size(a^) ^ A , for all i , j. Then there exists a non-trivial 
solution X in I K such that 

size(X) ^ C 1 (C 2 n+) r/(w " r) , 

xvhere C l5 C 2 are constants depending only on K. 

Proof. Let o u ... ,o M be a basis of I K over Z. Each x ; can be written 

X j = Çjl^i + * * • + ÇjM^M 

with unknowns <+ A . Each a^- can be written 

a ij = a^ j i o^ + • • • + a ijM o M 

with integers a ijÀ g Z. If we multiply out the a^-Xy, we find that our linear equa- 
tions with coefficients in I K are equivalent to a system of rM linear equations in 
the nM unknowns Ç jÀ , with coefficients in Z, whose size is bounded by CA , where 
C is a number depending only on M and the size of the elements o À , together with 
the products o À o M , in other words where C depends only on K. Applying 
Lemma 1, we obtain a solution in terms of the £, jÀ , and hence a solution X in I K , 
whose size satisfies the desired bound. 
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The next lemma has to do with estimates of derivatives. By the size of a 
polynomial with coefficients in K, we shall mean the size of its set of coefficients. 
A denominator for a set of elements of K will be any positive rational integer 
whose product with every element of the set is an algebraic integer. We define in 
a similar way a denominator for a polynomial with coefficients in K. We 
abbreviate “denominator” by den. 

Let 

P(Ti,..., T n ) = £ a (v) AÍ (v) (T) 
be a polynomial with complex coefficients, and let 

Q(T u ... 9 T N ) = 'ZP iv) M ix) (T) 

be a polynomial with real coefficients ^O. We say that Q dominates P if 
|a (v) | < j? (v) for all (v). It is then immediately verified that the relation of domi- 
nance is preserved under addition, multiplication, and taking partial derivatives 
with respect to the variables T u ..., T N . 

Lemma 3. Let K be of finite degree over Q. Let f l9 ..., f N be functions, 
holomorphic on a neighborhood of a point vv e C, and assume that D = d/dz 
maps the ring K[f u ..., f N ~\ into itself. Assume that fi(w) e K for all i. Then 
thereexists a number C Y having the folloxving property. Let P(T U ..., T N )be 
a polynomial with coejficients in K, of degree ^ r. Ifxve set f = P(f u ..., f N \ 
then xve have,for all positive integers k, 

siz e(D k f(xv)) S size(P)r*/c! Cj +r 

Furthermore, there is a denominator for D k f(xv) bounded by den (P)C\ +r . 

Proof. There exist polynomials P,(Ti ,..., T N ) with coefficients in K such 
that 

Dfi = Pi(f 

Let h be the maximum of their degrees. There exists a unique derivation D on 
K[T U ..., T n ~\ such that Dfi = PjT^,..., T N ). For any polynomial P we have 

D(P(T ( ,..., T N )) = X (D.PXTj,..., T n ) • PiT u ..., T n ), 

i = 1 

where D u ..., D N are the partial derivatives. The polynomial P is dominated by 
size(P)(l + Tj + • • • + T N ) r , 

and each P, is dominated by size(P,)(l + T x + • • • + T N ) h . Thus DP is dominated 
by 


size(P)C 2 r(l + Ti + • • • + T N ) r+h . 
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Proceeding inductively, one sees that D k P is dominated by 
size(P)C$r*/c!(l 4- Tj • • • + P N ) r+kh . 

Substituting values /j(w) for T i9 we obtain the desired bound on D k f(w). The 
second assertion concerning denominators is proved also by a trivial induction. 

We now come to the main part of the proof of our theorem. Let /, g be two 
functions among f l9 ..., f N which are algebraically independent over K. Let 
r be a positive integer divisible by 2 m. We shall let r tend to infinity at the end 
of the proof. 

Let 


í bijfV 

ij= 1 

have coefficients in K. Let n = r 2 /2m. We can select the not all equal to 0, 
and such that 


D k F(w y ) = 0 

for 0 ^ k < n and v = 1,..., m. Indeed, we have to solve a system of mn linear 
equations in r 2 = 2 mn unknowns. Note that 

mn _ 

2 mn — mn 

We multiply these equations by a denominator for the coefficients. Using the 
estimate of Lemma 3, and Lemma 2, we can in fact take the to be algebraic 
integers, whose size is bounded by 

0(rVCï +r ) g 0(n 2n ) 


for n -► oo. 

Since /, g are àlgebraically independent over K , our function F is not 
identically zero. We let s be the smallest integer such that all derivatives of F 
up to order s — 1 vanish at all points w l9 ..., w OT , but such that D S F does not 
vanish at one of the vv, say w r Then s ^ n. We let 

y = D s P(w!) / 0. 

Then y is an element of X, and by Lemma 3, it has a denominator which is 
bounded by 0(C S ) for s -► oo. Let c be this denominator. The norm of cy from 
K to Q is then a non-zero rational integer. Each conjugate of cy is bounded by 
0(s 5s ). Consequently, we get 

1 ^ \N%cy)\ ^ 0(s 5s ) lK:Q1 - 1 lvl, 


( 1 ) 
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where | y | is the fixed absolute value of y 9 which will now be estimated very well by 
global arguments. 

Let 6 be an entire function of order ^ p, such that 6f and 6g are entire, and 
0( Wl ) # 0. Then 6 2r F is entire. We consider the entire function 

n - ».)' 

v= 1 


Then //(vvj) differs from D^vv^) by obvious factors, bounded by C^s!. By the 
maximum modulus principle, its absolute value is bounded by the maximum of 
H on a large circle of radius R. If we take R large, then z — w v has approximately 
the same absolute value as R , and consequently, on the circle of radius R , H(z) 
is bounded in absolute value by an expression of type 

s 3s Cj rRP 

R ms 


We select R = s 1/2p . We then get the estimate 


\y\ú 


S* S C S 6 

S ms/2p‘ 


We now let r tend to infinity. Then both n and s tend to infinity. Combining this 
last inequality with inequality (1), we obtain the desired bound on m. This 
concludes the proof. 

Of course, we made no effort to be especially careful in the powers of s 
occurring in the estimates, and the number 10 can obviously be decreased by 
exercising a little more care in the estimates. 

The theorem we proved is only the simplest in an extensive theory 
dealing with problems of transcendence degree. In some sense, the theorem is 
best possible without additional hypotheses. For instance, if P(t ) is a polynomial 
with integer coefficients, then e P{t) will take the value 1 at all roots of P, these being 
algebraic. Furthermore, the functions 


are algebraically independent, but take on values in Q(e) for all integral values 
of í. 

However, one expects rather strong results of algebraic independence to hold. 
Lindemann proved that if ol ì9 ..., a„ are algebraic numbers , linearly independent 
over Q, then 


e 


<*i 


, e an 


are algebraically independent . 
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More generally, Schanuel has made the following conjecture: If a^,..., a„ 
are complex numbers, linearly independent over Q, then the transcendence 
degree of 


oc 1 ,...,ot n ,e ai 9 


should be ^ n. 

From this one would deduce at once the algebraic independence of e and n 
(looking at 1, 2ni, e , e 2ni \ and all other independence statements concerning the 
ordinary exponential function and logarithm which one feels to be true, for 
instance, the statement that n cannot lie in the field obtained by starting with the 
algebraic numbers, adjoining values of the exponential function, taking algebraic 
closure, and iterating these two operations. Such statements have to do with 
values of the exponential function lying in certain fields of transcendence degree 
< n, and one hopes that by a suitable deepening of Theorem 1, one will reach 
the desired results. 
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§1. DENUMERABLE SETS 

Let n be a positive integer. Let J n be the set consisting of all integers /c, 
1 ^ k ^ n. If S is a set, we say that S has n elements if there is a bijection between 
S and J n . Such a bijection associates with each integer k as above an element of 5, 
say k\-+a k . Thus we may use J n to “count ” S. Part of what we assume about the 
basic facts concerning positive integers is that if S has n elements, then the integer 
n is uniquely determined by S. 

One also agrees to say that a set has 0 elements if the set is empty. 

We shall say that a set S is denumerable if there exists a bijection of S with the 
set of positive integers Z + . Such a bijection is then said to enumerate the set S. 
It is a mapping 


n i—► a n 

which to each positive integer n associates an element of 5, the mapping being 
injective and surjective. 

If D is a denumerable set, and /: S -► D is a bijection of some set S with D, 
then S is also denumerable. Indeed, there is a bijection g : D -> Z + , and hence 
g o f is a bijection of S with Z + . 

Let T be a set. A sequence of elements of T is simply a mapping of Z + into T. 
If the map is given by the association n\—>x n9 we also write the sequence as 
or also {x l5 x 2 , • •For simplicity, wealso write {x n } for the sequence. 
Thus we think of the sequence as prescribing a first, second,..., n-th element of 
T. We use the same braces for sequences as for sets, but the context will always 
make our meaning clear. 


875 
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Examples. The even positive integers may be viewed as a sequence {x„} if 
we put x n = 2n for n = 1,2, .... The odd positive integers may also be viewed 
as a sequence {y n } if we put y n = 2n — 1 for n = 1, 2,... . In each case, the 
sequence gives an enumeration of the given set. 

We also use the word sequence for mappings of the natural numbers into a set, 
thus allowing our sequences to start from 0 instead of 1. If we need to specify 
whether a sequence starts with the 0-th term or the first term, we write 

or {X„} ngl 

according to the desired case. Unless otherwise specified, however, we always 
assume that a sequence will start with the first term. Note that from a sequence 
{x n } n ^ 0 we can define a new sequence by letting y n = for n ^ 1. Then 
y\ = yi — • • • • Thus there is no essential difference between the two 

kinds of sequences. 

Given a sequence {x„}, we call x n the n- th term of the sequence. A sequence 
may very well be such that all its terms are equal. For instance, if we let x M = 1 
for all n ^ 1, we obtain the sequence {1, 1, 1,...}. Thus there is a difference 
between a sequence of elements in a set T, and a subset of T. In the example just 
given, the set of all terms of the sequence consists of one element, namely the 
single number 1. 

Let {xj, x 2 ,...} be a sequence in a set S. By a subsequence we shall mean a 
sequence {x ni , x„ 2 ,...} such that n x < n 2 < ■ • •. For instance, if {x n } is the 
sequence of positive integers, x„ = n , the sequence of even positive integers {x 2n } 
is a subsequence. 

An enumeration of a set S is of course a sequence in S. 

A set is finite if the set is empty, or if the set has n elements for some positive 
integer n. If a set is not finite, it is called infìnite. 

Occasionally, a map of J n into a set T will be called a finite sequence in T. 
A finite sequence is written as usual, 

.JtJ or {Xí} ì= i. 

When we need to specify the distinction between finite sequences and maps of 
Z + into T, we call the latter infinite sequences. Unless otherwise specified, we 
shall use the word sequence to mean infinite sequence. 

Proposition 1.1. Let D be an infinite subset ofZ + . Then D is denumerable , 

and in fact there is a unique enumeration of D, say {k Ì9 k 2 , ...} such that 

ki < k 2 < • • • < k n < k n+ ! < • • • . 

Proof. We let k x be the smallest element of D. Suppose inductively that we 
have defined k x < • • • < /c n , in such a way that any element k in D which is not 
equal tok l9 ... 9 k n is> k n . We define k n+1 to be the smallest element of D which 
is > k n . Then the map n i—► /c n is the desired enumeration of D. 
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Coroilary 1.2. Let S be a denumerable set and D an infinite subset of S. 

Then D is denumerable. 

Proofi Given an enumeration of 5, the subset D corresponds to a subset of 
Z + inthisenumeration. UsingProposition 1.1, weconcludethat wecanenumer- 
ate D. 

Proposition 1.3. Every infinite set contains a denumerable subset. 

Proof. Let S be an infinite set. For every non-empty subset T of S, we 
select a definite element a T in T. We then proceed by induction. We let x x be the 
chosen element a s . Suppose that we have chosen x Í9 ... ,x n having the property 
that for each k = 2,..., n the element x k is the selected element in the subset 
which is the complement of {x u ..., x fc _ t }. We let x n+ j be the selected element 
in the complement of the set {x l5 ..., x„}. By induction, we thus obtain an 
association n ► x„ for all positive integers n, and since x n / x k for all k < n it 
follows that our association is injective, i.e. gives an enumeration of a subset of S. 

Proposition 1.4. Let D be a denumerable set , and f: D -► S a surjective 

mapping. Then S is denumerable or finite. 

Proof. For each yeS , there exists an element x y eD such that f(x y ) = y 
because/ is surjective. The association y i—► x v is an injective mapping of S into 
D, because if 


y, z e S and x y = x z 


then 


y = fi.Xy) = f(x 2 ) = Z. 

Let g(y) = x y . The image of g is a subset of D and D is denumerable. Since g 
is a bijection between S and its image, it follows that S is denumerable or finite. 

Proposition 1.5. Let Dbea denumerable set. Then D x D (the set of all pairs 
(x, y) mth x, yeD) is denumerable. 

Proof. There is a bijection between D x D and Z + x Z + , so it will suffice to 
provethatZ + x Z + is denumerable. Consider the mapping of Z + x Z + ->Z + 
given by 


(m, n)y-+2 n 3 m . 

It is injective, and by Proposition 1.1, our result follows. 

Proposition 1.6. Let {D l5 D 2 , . . .} be a sequence of denumerable sets. Let S 
be the union of all sets D t (i = 1,2,.. .). Then S is denumerable. 
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Proof For each i = 1 , 2,... we enumerate the elements of D i9 as indicated 
in the following notation: 

D i * {x^ i, x 12 , x 13 ,...} 

D 2 : {*21> X 22> X 23> • • •} 

Dï- {Xn,x i29 x i39 ...} 


The map/:Z + xZ + ->D given by 


f(Uj) = Xij 


is then a surjective map of Z + x Z + onto S. By Proposition 1.4, it follows that 
S is denumerable. 

Corollary 1.7. Let F bea non-empty finite set and D a denumerable set. Then 
F x D is denumerable. If S l9 S 2f ... are a sequence of sets 9 each of which is 
finite or denumerable , then the union S^ u S 2 u • • • is denumerable or finite. 

Proof. There is an injection of F into Z + and a bijection of D with Z + . Hence 
there is an injection of F x Z + into Z + x Z + and we can apply Corollary 1.2 
and Proposition 1.6 to prove the first statement. One could also define a sur- 
jective map of Z + x Z + onto F x D. (Cf. Exercises 1 and 4.) As for the second 
statement, each finite set is contained in some denumerable set, so that the second 
statement follows from Proposition 1.1 and 1.6. 

For convenience, we shall say that a set is countable if it is either finite or 
denumerable. 


§2. ZORN'S LEMMA 

In order to deal efficiently with infinitely many sets simultaneously, one needs 
a special property. To state it, we need some more terminology. 

Let S be a set. An ordering (also called partial ordering) of S is a relation, 
written x ^ y 9 among some pairs ofelements ofS, having the following properties. 

ORD 1. We have x ^ x. 

ORD 2. Ifx ^ y and y ^ z then x ^ z. 

ORD 3. Ifx ^ y and y ^ x then x = y. 
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Wesometimeswritey ^ xforx rg y. Notethat wedon’t requirethattherelation 
x ^ y or y ^ x hold for every pair of elements (x, y) of S. Some pairs may not be 
comparable. Ifthe ordering satisfies this additional property, then we say that it 

is a total ordering. 

Example 1. Let G be a group. Let S be the set of subgroups. If //, H' are 
subgroups of G, we define 


H SH' 

if H is a subgroup of H'. One verifies immediately that this relation defines an 
ordering on S. Given two subgroups //, H' of G, we do not necessarily have 
H ^ H' or H' ^ H. 

Example 2. Let R be a ring, and let S be the set of left ideals of R. We define 
an ordering in S in a way similar to the above, namely if L, L' are left ideals of R, 
we define 


L^L' 


if L c: L'. 

Example 3. Let X be a set, and S the set of subsets of X. If Y , Z are subsets 
of X , we define Y ^ Z if Y is a subset of Z. This defines an ordering on S. 

In all these examples, the relation of ordering is said to be that of inclusion. 

In an ordered set, if x ^ y and x/ywe then write x < y. 

Let A be an ordered set, and B a subset. Then we can define an ordering on B 
by defining x ^ y for x, y e B to hold if and only if x ^ y in A. We shall say that 
R 0 is the ordering on B induced by R , or is the restriction to B of the partial 
ordering of A. 

Let S be an ordered set. By a least element of S (or a smallest element) one 
means an element a e S such that a ^ x for all xeS. Similarly, by a greatest 
element one means an element b such that x ^ b for all x e 5. 

By a maximal element m of S one means an element such that if x e 5 and 
x ^ m,thenx = m. Notethatamaximalelementneednotbeagreatestelement. 
There may be many maximal elements in 5, whereas if a greatest element exists, 
then it is unique (proof ?). 

Let S be an ordered set. We shall say that S is totally ordered if given x, y e S 
we have necessarily x ^ y or y ^ x. 

Example 4. The integers Z are totally ordered by the usual ordering. So 
are the real numbers. 

Let S be an ordered set, and T a subset. An upper bound of T (in S) is an 
element beS such that x ^ b for all xeT. A least upper bound of T in S is an 
upper bound b such that, if c is another upper bound, then b ^ c. We shall say 
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that S is inductively ordered if every non-empty totally ordered subset has an 
upper bound. 

We shall say that S is strictly inductively ordered if every non-empty totally 
ordered subset has a least upper bound. 

In Examples 1, 2, 3, in each case, the set is strictly inductively ordered. To 
prove this, let us take Example 2. Let T be a non-empty totally ordered subset 
ofthe set ofsubgroups of G. This meansthatif //,//'g T,then// <= H'orH' cz H. 
Let U be the union of all sets in T. Then: 

1. U is a subgroup. Proof: If x, ye U , there exist subgroups H, H' eT 
such that x e H and y e H'. If, say, H <= H' , then both x,yeH' and hence 
xyeH'. Hence xyeU. Also, x _ 1 g//', so x~ l eU. Hence U is a 
subgroup. 

2. U is an upper bound for each element of T. Proof: Every H eT is con- 
tained in U, so H ^ U for all H e T. 

3. U is a least upper bound for T. Proof : Any subgroup of G which 
contains all the subgroups H e T must then contain their union U. 

The proof that the sets in Examples 2, 3 are strictly inductively ordered is 
entirely similar. 

We can now state the property mentioned at the beginning of the section. 

Zorn’s Lemma. Let S be a non-empty inductively ordered set. Then there 

exists a maximal element in S. 

As an example of Zorn’s lemma, we shall now prove the infinite version of a 
theorem given in Chapters 1, §7, and XIV, §2, namely: 

Let R be an entire, principal ring and let Ebe a free module over R. Let F bea 

submodule. Then F is free. In fact, if is a basis for E, and F # {0}, 

then there exists a basis for F indexed by a subset of I. 

Proof. For each subset J of / we let Ej be the free submodule of E generated 
by all Vj,j g J, and we let Fj — Ej n F. We let S be the set of all pairs (Fj, vv) 
where J is a subset of /, and w : J' -► Fj is a basis of Fj indexed by a subset J' of J. 
We write Wj instead of w(j) for j g J'. If (Fj, vv) and (F K , u) are such pairs, we 
define (F Jy w) ^ (F K , u) if J <= K, if J' <= K', and if the restriction of u to J is 
equal to vv. (In other words, the basis u for F K is an extension of the basis vv for 
Fj.) This defines an ordering on S, and it is immediately verified that S is in fact 
inductively ordered, and non-empty (say by the finite case of the result). We can 
therefore apply Zorn’s lemma. Let (Fj, vv) be a maximal element. We contend 
that J = I (this will prove our result). Suppose J # / and let k g / but k$J. Let 
K = J u {k}. If 


Fj U {k) n F — Fj, 
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then ( F k , vv) is a bigger pair than (Fj, w) contradicting the maximality assump- 
tion. Qtherwise there exist elements of F K which can be written in the form 

cv k -f y 

with some yeEj and c e R, c # 0. The set of all elements ce R such that there 
exists y e Ej for which cv k + y e F is an ideal. Let a be a generator of this ideal, 
and let 


w k = av k + y 


be an element of F, with y e Ej. If z e F K then there exists b e R such that 
z — bw k e Ej. But z — bw k e F, whence z — bw k e Fj. It follows at once that 
the family consisting of Wj (j e J) and w k is a basis for F K , thus contradicting the 
maximality again. This proves what we wanted. 

Zorn’s lemma could be just taken as an axiom of set theory. However, it is 
not psychologically completely satisfactory as an axiom, because its statement 
is too involved, and one does not visualize easily the existence of the maximal 
element asserted in that statement. We show how one can prove Zorn’s lemma 
from other properties of sets which everyone would immediately grant as ac- 
ceptable psychologically. 

From now on to the end of the proof of Theorem 2.1, we let A be a non- 
empty partially ordered and strictly inductively ordered set. We recall that 
strictly inductively ordered means that every nonempty totally ordered subset 
has a least upper bound. We assume given a map /: A -► A such that for all 
x e A we have x ^ /(x). We could call such a map an increasing map. 

Let a e A. Let B be a subset of A. We shall say that B is admissible if: 

1. Fcontainsa. 

2. We have f(B) c B. 

3. Whenever T is a non-empty totally ordered subset of F, the least upper 
bound of T in A lies in B. 

Then B is also strictly inductively ordered, by the induced ordering of A. We 
shall prove: 

Theorem 2.1. (Bourbaki). Let A be a non-empty partially ordered and 

strictly inductively ordered set. Let f:A^A be an increasing mapping. 

Then there exists an element x 0 e A such that f(x 0 ) = x 0 . 

Proof. Suppose that A were totally ordered. By assumption, it would have 
a least upper bound b e A, and then 


b s m ^ b 9 
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so that in this case, our theorem is clear. The whole problem is to reduce the 
theorem to that case. In other words, what we need to find is a totally ordered 
admissible subset of A. 

If we throw out of A all elements xe A such that x is not a, then what 
remains is obviously an admissible subset. Thus without loss of generality, we 
may assume that A has a least element a, that is a ^ x for all xeA. 

Let M be the intersection of all admissible subsets of A. Note that A itself is 
an admissible subset, and that all admissible subsets of A contain a, so that M is 
not empty. Furthermore, M is itself an admissible subset of A. To see this, let 
x e M. Then x is in every admissible subset, so /(x) is also in every admissible 
subset, and hence /(x) e M. Hence /(M) <= M. If T is a totally ordered non- 
empty subset of M, and b is the least upper bound of T in A , then b lies in every 
admissible subset of A , and hence lies in M. It follows that M is the smallest 
admissible subset of A , and that any admissible subset of A contained in M is 
equal to M. 

We shall prove that M is totally ordered, and thereby prove Theorem 2.1. 

[First we make some remarks which don’t belong to the proof, but will help 
in the understanding of the subsequent lemmas. Since aeM , we see that 
/(a) e M, / o f ( a ) e M, and in general /”(a) e M. Furthermore, 


a á f(a) á f 2 (a) á • • • • 


If we had an equality somewhere, we would be finished, so we may assume that 
the inequalities hold. Let D 0 be the totally ordered set {/ n (a)}„^ 0 . Then D 0 
looks like this: 


a < f(a ) < f 2 (a) < < f n (á) < 

Let a x be the least upper bound of D 0 . Then we can form 

«i </(0i)</ 2 («i)<- 

in the same way to obtain D u and we can continue this process, to obtain 

D\, D 2 , • • • • 

It is clear that D l9 D 2 ,... are contained in M. If we had a precise way of ex- 
pressing the fact that we can establish a never-ending string of such denumerable 
sets, then we would obtain what we want. The point is that we are now trying to 
prove Zorn’s lemma, which is the natural tool for guaranteeing the existence of 
such a string. However, given such a string, we observe that its elements have 
two properties: If c is an element of such a string and x < c, then /(x) ^ c. 
Furthermore, there is no element between c and /(c), that is if x is an element of 
the string, then x ^ c or /(c) ^ x. We shall now prove two lemmas which show 
that elements of M have these properties.] 
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Letc e M. Weshallsay that c isanextreme pointofM ifwhenever xeM and 
x < c, then/(x) < c. For each extreme point ceM we let 

M c = set of x e M such that x < c or f(c)<x. 

Note that M c is not empty because a is in it. 


Lemma 2.2. We have M c — M for every extreme point c of M. 

Proof It will suffice to prove that M c is an admissible subset. Let x e M c . 
If x < c then /(x) < cso f(x)e M c . If x = c then /(x) = /(c) is again in M c . 
If /(c) ^ x, then f(c) < x < /(x), so once more /(x)eM c . Thus we have 
proved that/(M c ) c= M c . 

Let T be a totally ordered subset of M c and let b be the least upper bound of 
T in M. If all elements x e T are < c, then b < c and be M c . If some xeT is 
suchthat/(c) ^ x,then f(c)<x< h,andsoòisinM c . This proves our lemma. 


Lemma 2.3. Every element of M is an extreme point. 

Proof. Let E be the set of extreme points of M. Then E is not empty because 
a e E. It will suffice to prove that E is an admissible subset. We first prove that 
/ maps E into itself. Let ceE. Let x e M and suppose x < /(c). Wemustprove 
that/(x) ^/(c). By Lemma 2.2, M = M c , and hence we have x < c, or x = c, 
or /(c) ^ x. This last possibility cannot occur because x < /(c). If x < c 
then 


/(x) ^ c ^ /(c). 

If x = c then /(x) = /(c), and hence f(E) c= E. 

Next let T be a totally ordered subset of E. Let b be the least upper bound 
of T in M. We must prove that b e E. Let x e M and x < b. If for all c e T we 
have/(c) ^ x, then c =/(c) ^ x implies that x is an upper bound for T, whence 
b ^ x, which is impossible. Since M c = M for all c e E, we must therefore 
have x ^ c for some c e T. If x < c, then/(x) ^ c ^ b, and if x = c, then 

c = x < b. 

Since c is an extreme point and M c = M, we get/(x) ^ This proves that 
b e E, that E is admissible, and thus proves Lemma 2.3. 

We now see trivially that M is totally ordered. For let x,yeM. Then x is an 
extreme point of M by Lemma 2, and yeM x soy<xor 

* è f(x) < y. 


thereby proving that M is totally ordered. As remarked previously, this con- 
cludes the proof of Theorem 2.1. 
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We shall obtain Zorn’s lemma essentially as a corollary of Theorem 2.1. 
We first obtain Zorn’s lemma in a slightly weaker form. 

Corollary 2.4. Let Abea non-empty strictly inductively ordered set. Then A 

has a maximal element. 

Proof. Suppose that A does not have a maximal element. Then for each 
x e A there exists an element y x e A such that x < y x . Let f:A —> A be the map 
such that f(x) = y x for all x e A. Then A, f satisfy the hypotheses of Theorem 
2.1 and applying Theorem 2.1 yields a contradiction. 

The only difference between Corollary 2.4 and Zorn’s lemma is that in 
Corollary 2.4, we assume that a non-empty totally ordered subset has a least 
upper bound, rather than an upper bound. It is, however, a simple matter to 
reduce Zorn’s lemma to the seemingly weaker form of Corollary 2.4. We do 
this in the second corollary. 

Corollary 2.5. (Zorn’s lemma). Let S be a non-empty inductively ordered 

set. Then S has a maximal element. 

Proof. Let A be the set of non-empty totally ordered subsets of S. Then A 
is not empty since any subset of S with one element belongs to A. If X, Y e A, 
we define X < Y to mean X c Y. Then A is partially ordered, and is in fact 
strictly inductively ordered. ForletT = {2Q Í6/ beatotally orderedsubsetoM. 
Let 


2= U Xi- 

iel 

Then Z is totally ordered. To see this, let x, y e Z. Then x e X { and y e Xj for 
some i, j e I. Since T is totally ordered, say X { <= X } . Then x, y e Xj and since 
Xj is totally ordered, x ^ y or y < x. Thus Z is totally ordered, and is obviously 
a least upper bound for T in A. By Corollary 2.4, we conclude that A has a 
maximal element X 0 . This means that X 0 is a maximal totally ordered subset of 
S (non-empty). Let m be an upper bound for X 0 in 5. Then m is the desired 
maximal element of S. For if x e S and m < x then X 0 u {x} is totally ordered, 
whence equal to X 0 by the maximality of X 0 . Thus xel 0 and x < m. Hence 
x = m, as was to be shown. 


§3. CARDINAL NUMBERS 

Let A, B be sets. We shall say that the cardinality of A is the same as the 
cardinality of B , and write 

card(4) = card(B) 
if there exists a bijection of A onto B. 
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We say card(/l) g card(B) if there exists an injective mapping (injection) 
/: A -> B. We also write card(B) ^ card (A) in this case. It is clear that if 
card(y4) ^ card(B) and card(B) ^ card(C), then card(/l) ^ card(C). 

This amounts to saying that a composite of injective mappings is injective. 
Similarly, if card(/l) = card(£) and card(B) = card(C) then card (A) = card(C). 

This amounts to saying that a composite of bijective mappings is bijective. 
We clearly have card(A) = card(A). Using Zom’s lemma, it is easy to show (see 
Exercise 14) that 

card(A ^ card (B) or card(B) ^ card(A). 

Let f: A -► B be a surjective map of a set A onto a set B. Then 

card (B) ^ card(A). 

This is easily seen, because for each yeB there exists an element xeA, 
denoted by x y , such that f(x y ) = y. Then the association y x y is an injective 
mapping of B into A, whence by definition, card(fí) ^ card(A). 

Given two nonempty sets A, B we have card(A) = card(#) or card(5) = card(A). 

This is a simple application of Zorn’s lemma. We consider the family of pairs 
(S,f) where S is a subset of A and/: S —> B is an injective mapping. From the 
existence of a maximal element, the assertion follows at once. 

Theorem 3.1. (Schroeder-Bernstein). Let A, B be sets, and suppose that 
card(A) ^ card(B), and card(B) ^ card(A). Then 

card(A) = card(B). 

Proof. Let 

f.A^B and g : B -► A 

be injections. We separate A into two disjoint sets A x and A 2 . We let A x consist 
of all x e A such that, when we lift back x by a succession of inverse maps, 

x, g-\x), f~ l o g~\x), g~ l °/~ 1 ° g~ \x), ... 

then at some stage we reach an element of A which cannot be lifted back to B by 
g. We let A 2 be the complement of A t , in other words, the set of x e A which can 
be lifted back indefinitely, or such that we get stopped in B (i.e. reach an element 
of B which has no inverse image in A by/). ThenA = A x u A 2 . Weshalldefine 
a bijection h of A onto B. 

If x e A l5 we define h(x) = f(x). 

If x £ A 2 , we define h(x) = g~ *(x) = unique element yeB such that 

g(y) = x. 

Then trivially, h is injective. We must prove that h is surjective. Let be B. 
If, when we try to lift back b by a succession of maps 

f~' « g~ l o f~ l o g ~ l o f~\b) 
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we can lift back indefínitely, or if we get stopped in B, then g ( b) belongs to A 2 
and consequendy b = h(g(b)), so b lies in the image of h. On the other hand, if we 
cannot lift back b indefinitely, and get stopped in A , then f~ x (b) is defined 
(i.e., b is in the image of/), and/ _1 (í?) lies in A t . In this case, b = H(f~ l (b)) 
is also in the image of h, as was to be shown. 

Next we consider theorems concerning sums and products of cardinalities. 

We shall reduce the study of cardinalities of products of arbitrary sets to the 
denumerable case, using Zorn’s lemma. Note first that an infinite set A always 
contains a denumerable set. Indeed, since A is infinite, we can first select an 
element a x e A, and the complement of { a is infinite. Inductively, if we have 
selected distinct elements a l9 ...,a„ in A, the complement of { a l9 is 

infinite, and we can select a n+1 in this complement. In this way, we obtain a 
sequence of distinct elements of A, giving rise to a denumerable subset of A. 

Let A be a set. By a covering of A one means a set T of subsets of A such that 
the union 


U c 

Cer 

of all the elements of T is equal to A. We shall say that T is a disjoint covering if 
whenever C, C' e T, and C # C, then the intersection of C and C is empty. 

Lemma 3.2. Let A be an infinite set. Then there exists a disjoint covering of 
A by denumerable sets. 

Proof. Let S be the set whose elements are pairs (B, T) consisting of a 
subset B of A, and a disjoint covering of B by denumerable sets. Then S is not 
empty. Indeed, since A is infinite, A contains a denumerable set D, and the pair 
(D, {D}) is in S. If (B, T) and ( B' , T') are elements of 5, we define 

(b, d s (b\ n 

to mean that B c= B' , and T c= P. Let T be a totally ordered non-empty subset 
of S. We may write T = {(£, , r,)}^/ for some indexing set /. Let 

and T = (J r 4 . 

iel i e/ 


If C, C e T, C ^ C', then there exists some indices i, j such that C e T, and 
C' e Vj. Since T is totally ordered, we have, say, 

(B h r f ) ^ (Bj, r j). 

Hence in fact, C, C are both elements of T,-, and hence C, C have an empty 
intersection. On the other hand, if x e B, then x e £, for some i, and hence there 
is some C e T, such that x e C. Hence T is a disjoint covering of B. Since the 
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elements of each T, are denumerable subsets of A, it follows that V is a disjoint 
covering of B by denumerable sets, so (£, T) is in 5, and is obviously an upper 
bound for T. Therefore S is inductively ordered. 

Let (M, A) be a maximal element of S, by Zorn’s lemma. Suppose that 
M A. If the complement of M in A is infinite, then there exists a denumerable 
set D contained in this complement. Then 

(M u D, A u {D}) 

is a bigger pair than (M, A), contradicting the maximality of (M, A). Hence the 
complement of M in A is a finite set F. Let D 0 be an element of A. Let 

D x = D 0 u F. 

Then D { is denumerable. Let A^ be the set consisting of all elements of A, except 
D 0 , together with D v Then A^ is a disjoint covering of A by denumerable sets, 
as was to be shown. 

Theorem 3.3. Let A be an infinite set , and let Dbe a denumerable set. Then 

card(/l x D) = card(/l). 

Proof. By the lemma, we can write 

A = {JDì 

iel 

as a disjoint union of denumerable sets. Then 

A x D = (J (Pi x D). 

Ì 6 / 

For each i e /, there is a bijection of D t x D on D, by Proposition 1.5. Since the 
sets x D are disjoint, we get in this way a bijection of A x Don A, as desired. 

Corollary 3.4. If F is a finite non-empty set , then 
card(/l x F) = card(/l). 


Proo/ We have 

card(^) ^ card(/l x F) ^ card(/l x D) = card/4). 

We can then use Theorem 3.1 to get what we want. 

Coroilary 3.5. Let A , B be non-empty sets , /1 infinite , and suppose 


card(B) ^ card(^). 
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Then 


card(/l u B) = card(,4). 

Proof. We can write A u B = A u C for some subset C of B , such that C 
and ,4 are disjoint. (We let C be the set ofall elements of B which are not elements 
of A.) Then card(C) ^ card(^l). We can then construct an injection of A u C 
into the product 


A x {1,2} 

of A with a set consisting of 2 elements. Namely, we have a bijection of A with 
A x {1} in the obvious way, and also an injection of C into A x {2}. Thus 

card(,4 u C) ^ card(/l x {1, 2}). 

We conclude the proof by Corollary 3.4 and Theorem 3.1. 

Theorem3.6. Let A be an infinite set. Then 

card(/l x A) = card(/l). 

Proofi Let S be the set consisting of pairs (£, /) where B is an infinite subset 
of A, and /: B x B is a bijection of B onto B x B. Then S is not empty because if 
D is a denumerable subset of A, we can always find a bijection of D on D x D. 
If (B, /) and (B', /') are in 5, we define (£, /) ^ (B', /') to mean B c= B\ and the 
restriction of/' to B is equal to /. Then 5 is partially ordered, and we contend 
that 5 is inductively ordered. Let 7 be a non-empty totally ordered subset of 5, 
and say T consists of the pairs (B i9 /) for i in some indexing set /. Let 

M = U 

i e / 

We shall define a bijection g: M ^ M x M. If x e M, then x lies in some 
We define g(x) = /(x). This value /(x) is independent of the choice of B, in 
which x lies. Indeed, if x e Bj for some j e /, then say 

(Bi, /) ^ (Bj,fj). 

By assumption, <= B j9 and /(x) = /(x), so g is well defined. To show g is 
surjective, let x, yeM and (x, y)e M x M. Then x e B t for some i e I and 
y e Bj for some j e /. Again since T is totally ordered, say (B t , /) ^ (B j9 /). Thus 
Bi cz Bj , and x, y e Bj. There exists an element b e Bj such that 

fj<b) = (x, y) G Bj X Bj. 

By definition, g(b) = (x, y), so g is surjective. We leave the proof that g is 
injective to the reader to conclude the proof that g is a bijection. We then see 
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that (M, g) is an upper bound for T in S , and therefore that S is inductively 
ordered. 

Let (M, g) be a maximal element of 5, and let C be the complement of M in A. 
If card(C) ^ card(M), then 

card(M) ^ card(/l) = card(M u C) = card(M) 

by Corollary 3.5, and hence card(M) = card(/l) by Bernstein’s Theorem. Since 
card(M) = card(M x M), we are done with the proof in this case. If 

card(M) ^ card(C), 

then there exists a subset M { of C having the same cardinality as M. We consider 
(M u Mj) x (M u Mj) 

= (M x M) u (AÍ! x M) u (M x MJ u (Mj x Mj). 

By the assumption on M and Corollary 3.5, the last three sets in parentheses on 
the right of this equation have the same cardinality as M. Thus 

(MuMO x (M uMj) = (M x M)uM 2 

where M 2 isdisjoint from M x M, and has the same cardinality as M. Wenow 
define a bijection 


flfi: M u M x - (M u Mj) x (M u M t ). 

Weletgr/x) = gf(x) ifxeM,and we letgr t on Mj beanybijection ofM^ onM 2 . 
In this way we have extended g io M u M u and the pair (M u M b gfj) is in 5, 
contradicting the maximality of (M, g). The case card(M) ^ card(C) therefore 
cannot occur, and our theorem is proved (using Exercise 14 below). 

Corollary 3.7. IfA is an infinite set , and A in) = A x • • • x A is the product 
tahen n times , í/ien 


card(/l ( " ) ) = card(,4). 


Proofi Induction. 

Corollary 3.8. If A x , ... , A n are non-empty sets mth A n infinite , and 

card^,) ^ card04„) 


/or i = 1 ,..., n, í/ien 


card^! x • • • x ,4„) = card(,4„). 
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Proof. We have 

card(,4 n ) ^ card(^j x • • • x A n ) ^ card(>l n x • • • x A n ) 

and we use Corollary 3.7 and the Schroeder-Bernstein theorem to conclude the 
proof. 

CoroOary 3.9. Let A be an infinite set 9 and let O be the set of finite subsets 
of A. Then 


card(O) = card(/l). 

Proof. Let be the set of subsets of A having exactly n elements, for each 
integer n = 1,2,.... We first show that card(O n ) ^ card(>l). If F is an element 
of O n , we order the elements of F in any way, say 

F = {x„ ..., x B }. 

and we associate with F the element (x l9 ..., x n ) e A {n \ 

F h->(x 1 ,...,X n ). 

If G is another subset of A having n elements, say G = {y u .. ., y n }, and G / F, 
then 


(*i, ••■,*».) * (yi> • • •. y n \ 

Hence our map 

Fi->(xi, ...,x n ) 

of 0 M into A in) is injective. By Corollary 3.7, we conclude that 

card(O n ) ^ card(/l). 

Now O is the disjoint union of the O n for n = 1,2,... and it is an exercise to 
show that card(O) ^ card(/l) (cf. Exercise 1). Since 

card(/l) ^ card(O), 

because in particular, card(Oj) = card(/l), we see that our corollary is proved. 

In the next theorem, we shall see that given a set, there always exists another 
set whose cardinality is bigger. 

Theorem 3.10. Let A be an infinite seí, and T the set consisting of two 
elements {0, 1}. Let M be the set of all maps of A into T. Then 


card(^) ^ card(M) and card(^) ^ card(M). 
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Proof. For each x e A we let 

f x ' A -> {0, 1} 

be the map such that f x (x ) = 1 and f x (y ) = 0 if y # x. Then x i—► f x is obviously 
an injection of A into M, so that card(/l) ^ card(M). Suppose that 

card(/l) = card(M). 

Let 

x^g x 

be a bijection between A and M. We define a map h: A -► {0, 1} by the rule 

/i(x) = 0 if gf x (x) = 1, 

/î(x) = 1 if g x (x) = o. 

Then certainly h / g x for any x, and this contradicts the assumption that x\-+g x 
is a bijection, thereby proving Theorem 3.10. 

Corollary 3.11. Let A be an infinite set , and let S be the set of all subsets of A. 
Then card(^l) rg card(S) and card(/l) / card(S). 

Proof. We leave it as an exercise. [// int : If B is a non-empty subset of A, 
use the characteristic function cp B such that 

cp B (x) =1 if xe B, 

cp B (x ) = 0 if x$B. 

What can you say about the association B i—► cp B ( ì~\ 


§4. VVELL-ORDERING 

An ordered set A is said to be we!l-ordered if it is totally ordered, and if every 
non-empty subset B has a least element, that is, an element aeB such that 
a ^ x for all xeB. 

Example 1. The set of positive integers Z + is well-ordered. Any finite set 
can be well-ordered, and a denumerable set D can be well-ordered: Any bijection 
of D with Z + will give rise to a well-ordering of D. 

Example 2. Let S be a well-ordered set anc} let b be an element of some set, 
b$S. LetA = Su {b}. Wedefinex ^ bforallxGS. Then A is totally ordered, 
and is in fact well-ordered. 
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Proof. Let B be a non-empty subset of A. If B consists of b alone, then b is a 
least element of B. Otherwise, B contains some element ae A. Then B n A is not 
empty, and hence has a least element, which is obviously also a least element for 
B. 


Theorem4.1. Every non-empty set can be well-ordered. 

Proof. Let A be a non-empty set. Let S be the set of all pairs (X, co), where 
X is a subset of A and co is a well-ordering of X. Note that S is not empty because 
any single element of A gives rise to such a pair. If (X , co) and ( X' , co') are such 
pairs, we define (X, w) ^ (X', w') if X C X', if the ordering induced on X by 
o/ is equal to cu, and if X is the initial segment of X'. It is obvious that this 
defines an ordering on 5, and we contend that S is inductively ordered. Let 
{(X z , cu z )} be a totally ordered non-empty subset of S. Let X = U X/. If cz, b e X, 
tfien a, 6 lie in some X h and we define a ^ b inX if a ^ b with respect to the 
ordering cu z . This is independent of the choice of i (immediate from the assumption 
of total ordering). In fact, X is well ordered, for if Y is a non-empty subset of 
X, then there is some element y e Y which lies in some Xj. Let c be a least 
element of Xj fl Y. One verifies at once that c is a least element of Y. We can 
therefore apply Zom’s lemma. Let (X, co) be a maximal element in S. If X A, 
then, using Example 2, we can define a well-ordering on a bigger subset than 
X, contradicting the maximality assumption. This proves Theorem 4.1. 

Note. Theorem 4.1 is an immediate and straightforward consequence of 
Zorn’s lemma. Usually in mathematics, Zorn’s lemma is the most efficient tool 
when dealing with infinite processes. 


EXERCISES 


1. Prove the statement made in the proof of Corollary 3.9. 

2. If A is an infinite set, and is the set of subsets of A having exactly n elements, show that 

card(^) ^ card^,,) 

for n ^ 1. 

3. Let Ai be infinite sets for i = 1,2,... and assume that 


card^) ^ card(/l) 
for some set A, and all i. Show that 


card 



^ card(A). 
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4. Let tfbea subfield of the complex numbers. Show that for each integer n ^ 1, the 
cardinality of the set of extensions of K of degree n in C is ^ card(À). 

5. Let K be an infinite field, and E an algebraic extension of K. Show that 

card(£) = card(X). 

6. Finish the proof of the Corollary 3.11. 

7. If A , B are sets, denote by M(A, B) the set of all maps of A into B. If B, B' are sets with 
the same cardinality, show that M(A , B) and M(A , B') have the same cardinality. If 
/4, A' have the same cardinality, show that M(A , B) and M(4', B) have the same 
cardinality. 

8. Let A be an infinite set and abbreviate card(A) by a. If B is an infinite set, abbreviate 
card(fi) by /3. Define a/3 to be card(A x B). Let B' be a set disjoint from A such that 
card(fi) = card(B'). Define a + /3 to be card(A u B r ). Denote by B A the set of all maps 
of A into B , and denote card(5 A ) by /3 a . Let C be an infinite set and abbreviate card(C) 
by y. Prove the following statements: 

(a) ot(p + y) = otfi + oty. 

(b) OLp = pOL. 

(c) ot p+y = a^a*. 

9. Let K be an infinite field. Prove that there exists an algebraically closed field K a 
containing K as a subfield, and algebraic over K. [Hint: Let fì be a set of cardinality 
strictly greater than the cardinality of K , and containing K. Consider the set S of all 
pairs (£, cp) where E is a subset of fì such that K c= £, and cp denotes a law of addition 
and multiplication on E which makes E into a field such that K is a subfield, and E is 
algebraic over K. Define a partial ordering on S in an obvious way; show that S is 
inductively ordered, and that a maximal element is algebraic over K and algebraically 
closed. You will need Exercise 5 in the last step.] 

10. Let K be an infìnite fìeld. Show that the field of rational functions K(t) has the same 
cardinality as K. 

11. Let J n be the set of integers {1,..., n}. Let Z + be the set of positive integers. Show 
that the following sets have the same cardinality: 

(a) The set of all maps M(Z + , J n ). 

(b) The set of all maps M(Z + , J 2 )- 

(c) The set of all real numbers x such that 0 ^ x < 1. 

(d) The set of all real numbers. 

12. Show that M(Z + , Z + ) has the same cardinality as the real numbers. 

13. Let S be a non-empty set. Let S' denote the product S with itself taken denumerably 
many times. Prove that (S')' has the same cardinality as S'. [Given a set S whose 
cardinality is strictly greater than the cardinality of R, I do not know whether it is 
always true that card S = card S '.] Added 1994: The grapevine communicates to me 
that according to Solovay, the answer is “no.” 

14. Let A, B be non-empty sets. Prove that 

card(A) i card(fî) or card(fí) ^ card(A). 

[Hint: consider the family of pairs (C, /) where C is a subset of A and /: C —> 23 is 
an injective map. By Zom’s lemma there is a maximal element. Now finish the proof]. 
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period, 8, 23 
set, 876 

under a place, 349 
infinitely 
large, 450 
small, 450 
injective 


map, ix 

module, 782, 830 
resolution, 788, 801, 819 
inner automorphism, 26 
inseparable 
degree, 249 
extension, 247 
integers mod n, 94 
integral, 334, 351, 352, 409 
closure, 336, 409 
domain, 91 
equation, 334 
extension, 340 
homomorphism, 337 
map, 357 
root test, 185 

valued polynomials, 216, 435 
integrally closed, 337 
integrality criterion, 352, 409 
invariant 
bases, 550 
submodule, 665 
invariant 

of linear map, 557, 560 
of matrix, 557 
of module, 153, 557, 563 
of submodule, 153, 154 
inverse, ix, 7 

inverse limit, 50, 51, 161, 163, 169 
of Galois groups, 313, 328 
inverse matrix, 518 
invertible, 84 
Irr(z,fc,X), 224 
irreducible 

algebraic set, 382, 408 
character, 669, 696 
element, 111 
module, 554 
polynomial, 175, 183 
polynomial of a field element, 224 
irrelevant prime, 436 
isolated prime, 422 
isometry, 572 
isomorphism, 10, 54 

of representations, 56, 667 
isotropy group, 27 
Iss’sa-Hironaka theorem, 498 

Jacobson 

density, 647 
radical, 658 
Jordan-Hòlder, 22, 156 
Jordan canonical form, 559 
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A'-family, 771 
A'-theory, 139, 771-782 
kemel 

of bilinear map, 48, 144, 522, 572 
of homomorphism, 11, 133 
Kolchin’s theorem, 661 
Koszul complex, 853 
Krull 

theorem, 429 
topology, 329 
Krull-Remak-Schmidt, 441 
Kummer extensions 

abelian, 294-296, 332 
non-abelian, 297, 304, 326 


L-functions, 727 

lambda operation, 217 

lambda-ring, 218, 780 

Langlands conjectures, 316, 319 

lattice, 662 

law of composition, 3 

Lazard’s theorem, 639 

leading coeflfìcient, 100 

least 

common multiple, 113 
element, 879 
upper bound, 879 
left 

coset, 12 

derived functor, 791 
exact, 790 
ideal, 86 
module, 117 
length 

of complex, 765 
of fìltration, 433 
of module, 433, 644 
Lie algebra, 548 
lie above 
prime, 338 
valuation ring, 350 
lifting, 227 
linear 

combination, 129 
dependence, 130 
independence, 129, 150, 283 
map, 119 
polynomial, 100 
linearly disjoint, 360 
local 

degree, 477 
homomorphism, 444 


norm, 478 
parameter, 487 
ring, 110, 425, 441 
uniformization, 498 
localization, 110 
locally nilpotent, 418 
logarithm, 497, 597 
logarithmic derivative, 214, 375 

Mackey’s theorems, 694 
MacLane’s criterion, 364 
mapping cylinder, 838 
Maschke’s theorem, 666 
Mason-Stothers theorem, 194, 220 
matrix, 503 

of bilinear map, 528 
over non-commutative ring, 641 
maximal 

abelian extension, 269 
archimedean, 450 
element, 879 
ideal, 92 

metric linear map, 573 
minimal polynomial, 556, 572 
Mittag-Leffler condition, 164 
modular forms, 318, 319 
module, 117 

over principal ring, 146, 521 
modulo an ideal, 90 
Moebius inversion, 116, 254 
monic, 175 
monoid, 3 

algebra, 106, 126 
homomorphism, 10 
monomial, 101 
monomorphism, 120 
Morita’s theorem, 660 
morphism, 53 
of complex, 765 
of functor, 65, 625, 800 
or representation, 125 
multilinear map, 511, 521, 602 
multiple root, 178, 247 
multiplicative 
function, 116 
subgroup of a field, 177 
subset, 107 
multiplicity 

of character, 670 
of root, 178 
of simple module, 644 

Nakayama’s lemma, 424, 661 
natural transformation, 65 
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negative, 449 
defìnite, 578 

Newton approximation, 493 
nilpotent, 416, 559, 569 
Noether normalization, 357 
Noetherian, 186, 210, 408-409, 415, 427 
graded ring, 427 
module, 413 

non-commutative variables, 633 
non-degenerate, 522, 572 
non-singular, 523, 529 
norm, 284, 578, 637 
on a vector space, 469 
on a fìnitely generated abelian group, 166 
normal 

basis theorem, 312 
endomorphism, 597 
extension, 238 
subgroup, 14 
tower, 18 
normalizer, 14 
Northcott theorems, 864 
null 

sequence, 52 
space, 586 

nullstellensatz, 380, 383 


occur, 102, 176 
odd permutation, 31 
one-dimensional 
character, 671 
representation, 671 
open complex, 761 
open set, 406 
operate 

on a module, 664 
on an object, 55 
on a set, 25, 76 
orbit, 28 

decomposition formula, 29 
order 

of a group, 12 
at p, 113, 488 
at a valuation, 488 
of a zero, 488 
ordering, 449, 480, 878 
ordinary tensor product, 630 
orthogonal 

basis, 572-585 
element, 48, 144, 572 
group, 535 
map, 535 
sum, 572 


orthogonality relations, 677 
orthogonalization, 579 
orthonormal, 577 
over a map, 229 


p-adic 

integers, 51, 162, 169, 488 
numbers, 488 
p-class, 706 
p-conjugate, 706 
p-divisible, 50 
p-elementary, 705 
p-group, 33 
p-regular, 705 
p-singular, 705 
p-subgroup, 33 
pairing, 48 

parallelogram law, 598 
partial fractions, 187 
partition, 79 
function, 211 
perfect, 252 
period, 23, 148 

periodicity of Clifford algebra, 758 
permutation, 8, 30 
perpendicular, 48, 144, 522 
Pfaffian, 589 

Pic or Picard group, 88, 126 
place, 349, 482 
Poincaré series, 211, 431 
point 

of algebraic set, 383 
in a fìeld, 408 
polar decomposition, 584 
polarization identity, 580 
pole, 488 
polynomial, 97 
algebra, 97, 633 
function, 98 
invariants, 557 
irreducible, 175, 183 
Noetherian, 185 
Pontijagin dual, 145 
positive, 449 

defìnite, 578, 583 
power map, 10 
power series, 205 
factorial, 209 
Noetherian, 210 
primary 

decomposition, 422 
ideal, 421 
module, 421 
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pnme 

element, 113 
field, 90 
ideal, 92 
ring, 90 
primitive 

element, 243, 244 
group, 80 
operation, 79 
polynomials, 181, 182 
power series, 209 
root, 301 

root of unity, 277, 278 
principal 

homomorphism, 418 
ideal, 86, 88 
module, 554, 556 
representation, 554 
ring, 86, 146, 521 
product 

in category, 58 
of groups, 9 
of modules, 127 
of rings, 91 
profinite, 51 
projection, 388 
projective 

module, 137, 168, 848, 850 
resolution, 763 
space, 386 
proper, ix 

congruence, 492 
pull-back, 61 
purely inseparable 
element, 249 
extension, 250 
push-out, 62, 81 


quadratic 

extension, 269 
form, 575 
map, 574 
symbol, 281 

quadratically closed, 462 
quatemions, 9, 545, 723, 758 
Quillen-Suslin theorem, 848 
quotient 
field, 110 
ring, 107 

radical 

of an ideal, 388, 417 


of a ring, 661 
of an integer, 195 
Ramanujan power series, 212 
ramification index, 483 
rank, 42, 46 

of a matrix, 506 
rational 

conjugacy class, 276, 326, 725 
element, 714 
function, 110 
real, 451 
closed, 451 
closure, 452 
place, 462 
zero, 457 
reduced 

decomposition, 422, 443 
polynomial, 177 
reduction 

criterion, 185 

map, 99, 102 

modulo an ideal, 446, 623 

mod p, 623 

refinement of a tower, 18 
regular 

character, 675, 699 
extension, 366 
module, 699, 829 
representation, 675, 829 
sequence, 850 
relations, 68 

relative invariant, 171, 327 
relatively prime, 113 
representation, 55, 124, 126 
functor, 64 

of a group, 55, 317, 664 
of a ring, 553 
space, 667 
residue class, 91 
degree, 422, 483 
ring, 91 

resolution, 763, 798 
resultant, 200, 398, 410 
system, 403 
variety, 393 
Ribet, 319 

Rieffel’s theorem, 655 
Riemann surface, 275 
Riemann-Roch, 212, 218, 220, 258 
right 

coset, 12, 75 
derived functor, 791 
exact functor, 791, 798 
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right ( continued) 
ideal, 66 
module, 117 
rigid, 275 

rigidity theorem, 276 
ring, 83 

homomorphism, 88 
of fractions, 107 
root, 175 

of unity, 177, 276 
ro\v 

operation, 154 
'rank, 506 
vector, 503 

1 S 3 and S^, 722 
scalar product, 571 
Schanuel 

conjecture, 873 
lemma, 841 
Schreier’s theorem, 22 
Schroeder-Bemstein theorem, 885 
Schur 

Galois groups, 274 
lemma, 643 

Schwarz inequality, 578, 580 
section, 64, 792 
self-adjoint, 581 
semidirect product, 15, 76 
semilinear, 532 
seminorm, 166, 475 
semipositive, 583, 597 
semisimple 

endomorphism, 569, 661 
module, 554, 647, 659 
representation, 554, 712 
ring, 651 
separable 

closure, 243 
degree, 239 
element, 240 
extension, 241, 658 
polynomial, 241 
separably generated, 363 
separating transcendence basis, 363 
sequence, 875 
Serre’s conjecture, 848 
theorem, 844 
sesquilinear form, 532 
Shafarevich conjecture, 314 
sheaf, 792 

sign of a permutation, 31, 77 


simple 

character, 669 
group, 20 

module, 156, 554, 643 
ring, 653, 655 
root, 247 

simplicity of SL n , 539, 542 
size of a matrix, 503 
skew symmetric, 526 
SL 2 , 69, 537, 539, 546 

generators and relations, 69, 70, 537 
SL n , 521, 539, 541, 547 
snake lemma, 158, 169, 614-621 
Snyder’s proof, 220 
solvable 

extension, 291, 314 
group, 18, 293, 314 
by radicals, 292 
spec of a ring, 405, 410 
special linear group, 14, 52, 59, 69, 541, 546, 
547 

specializing, 101 
specialization, 384 
spectral 

sequence, 815-825 
theorem, 581, 583, 585 
split exact sequence, 132 
splitting field, 235 
square 

matrix, 504 
group, 9, 77, 270 
root of operator, 584 
stably free, 840 
dimension, 840 
stably isomorphic, 841 
stalk, 161 
standard 

complex, 764 
altemating matrix, 587 
Steinberg theorem, 726 
Stewart-Tijdeman, 196 
strictly inductively ordered, 881 
stripping functor, 62 
Sturm’s theorem, 454 
subgroup, 9 
submodule, 118 
submonoid, 6 
subobject, 134 
subring, 84 
subsequence, 876 
subspace, 141 
substituting, 98, 101 



INDEX 913 


super 

algebra, 632 
commutator, 757 
product, 631, 751 
tensor product, 632, 751 
supersolvable, 702 
support, 419 
suijective, ix 
Sylow group, 33 
Sylvester’s theorem, 577 
symmetric 
algebra, 635 

endomorphism, 525, 585, 597 
form, 525, 571 
group, 29, 269, 272-274 
matrix, 530 
multilinear map, 635 
polynomial, 190, 217 
product, 635, 781, 861 
symplectic, 535 
basis, 599 

syzygy theorem, 862 
Szpiro conjecture, 198 

Taniyama-Shimura conjecture, 316, 319 
Tate group, 50, 163, 169 
limit, 598 
Taylor series, 213 
tensor, 581, 628 
algebra, 633 
exact, 612 
product, 602, 725 
product of complexes, 832, 851 
product representation, 725, 799 
Tits construction of free group, 81 
tor (for torsion), 42, 47, 149 
Tor, 622, 791 
dimension, 622 
Tomheim proof, 471 
torsion 

ffee, 45, 147 
module, 147, 149 
total 

complex, 815 
degree, 103 
totally ordered, 879 
tower 

of fields, 225 
of groups, 18 
trace 

of element, 284, 666 
of linear map, 511, 570 
of matrix, 505, 511 


transcendence 
basis, 356 
degree, 355 
of e, 867 

transcendental, 99 
transitive, 28, 79 
translation, 26, 227 
transpose 

of bifunctor, 808 
of linear map, 524 
of matrix, 505 
transposition, 13 
transvection, 542 
trigonometric degree, 115 
polynomial, 114, 115 
trivial 

character, 282 
operation, 664 
representation, 664 
subgroup, 9 
valuation, 465 
two-sided ideal, 86, 655 
type 

of abelian group, 43 
of module, 149 

unimodular, 846 

extension property, 849 
unipotent, 714 

unique factorization, 111, 116 
uniquely divisible, 575 
unit, 84 

element, 3, 83 
ideal, 87 
unitary, 535, 583 
universal, 37 

delta-functor, 800 
derivation, 746 
universally 
attracting, 57 
repelling, 57 
upper bound, 879 
upper diagonal group, 19 

valuation, 465 
valuation ring, 348, 481 

determined by ordering, 450, 452 
value group, 480 

Vandermonde determinant, 257-259, 516 
vanishing ideal, 38 
variable, 99, 104 
variation of signs, 454 
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variety, 382 
vector space, 118, 139 
volume, 735 

Waming’s theorem, 214 
Wedderbum’s theorem, 649 
Weierstrass 
degree, 208 
polynomial, 208 
preparation theorem, 208 
weight, 191 

well-behaved, 410, 478 
well-defined, x 
well-ordering, 891 
Weyl group, 570 


Witt group, 594, 599 
theorem, 591 
vector, 330, 492 
Witt-Grothendieck group, 595 

Zariski-Matsusaka theorem, 372 
Zariski topology, 407 
Zassenhaus lemma, 20 
zero 

divisor, 91 
element, 3 
of ideal, 390, 405 
of polynomial, 102, 175, 379, 390 
zeta function, 211, 212, 255 
Zom’s lemma, 880, 884 
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